Cloudera Distribution for Hadoop Review

Cloudera Manager is a good tool to administer. Sometimes it gets confusing to follow a single path for installation.


What is most valuable?

  • Cloudera Manager for administering the Hadoop cluster
  • Cloudera specific solutions like Impala
  • Extensive documentation
  • Good user community

How has it helped my organization?

Implementing a Hadoop cluster has become relatively straight-forward using CDH. Administering it is also less complex. As a result, efforts spent in these areas are less than anticipated.

What needs improvement?

  • Some of the UI features seem confusing e.g. charts on the CM Services page
  • Sometimes it gets confusing to follow a single path for installation due to multiple recommended approaches e.g. parcels vs packages

For how long have I used the solution?

We have been using it for the last two years.

What was my experience with deployment of the solution?

Following a single path for installation becomes confusing due to multiple recommended approaches e.g. parcels vs packages.

What do I think about the stability of the solution?

Flume seems unstable and has to be restarted quite often.

What do I think about the scalability of the solution?

None as such

How is customer service and technical support?

We are mostly using Cloudera Express so we did not use their technical support. However, the Cloudera community is an active place and Cloudera representatives participate actively in understanding and resolving issues.

Which solutions did we use previously?

Cloudera is a prominent player in the Hadoop space and we did not have a need to adopt a different solution. However, we are also looking to work on Hadoop and MapR

How was the initial setup?

Following a single path for installation was initially confusing due to multiple recommended approaches e.g. parcels vs. packages. However, after a while, we managed to master it. However, knoweldge of Cloudera Manager and Hadoop architecture is a must.

What about the implementation team?

We have our own team of consultants who are proficient in implementing it. The high level steps about the implementation remain the same; however, it is the environment specific issues which are challenging.

What was our ROI?

We haven't really measured ROI.

What's my experience with pricing, setup cost, and licensing?

Licensing price on per node basis for Cloudera seems to be pretty steep (based on the inputs we have received from Cloudera).

What other advice do I have?

It is user friendly and installation is pretty straightforward. Cloudera Manager is a good tool to administer it. However, configuration for specific requirements is sometimes pretty complex.

You should have a team which is knowledgeable in Hadoop. Do keep in mind that the product is still maturing so there are good chances that you will come across unexpected issues now and then.

Disclosure: My company has a business relationship with this vendor other than being a customer: We're Cloudera partners and regularly install CDH
Add a Comment
Guest
Sign Up with Email