Cloudera Distribution for Hadoop Review

Feature rich and scalable with good support, but there are performance issues and the security could be improved

What is our primary use case?

We are using this solution for storing Big Data in one centralized location.

How has it helped my organization?

It has been helpful in allowing data storage in one centralized location with data lakes and all of the surrounding applications.

All of the data processes are being stored into the Big Data Lake.

What is most valuable?

It allows us to store huge amounts of data, which is an advantage.

They have BI (Business Intelligence) tools. There are many AI tools.

We are able to connect and analyze the data to get reports. The reports are very good.

The main advantage is the storage is less expensive.

What needs improvement?

The performance can be improved. We have experienced some performance issues. It is not as sophisticated as Oracle Sybase.

Currently, we are using many other tools such as Spark and Blade Job to improve the performance.

The setup could be simplified, it's complex.

The security needs to be improved.

For how long have I used the solution?

I have been using this solution since 2015.

What do I think about the stability of the solution?

It's a stable solution.

What do I think about the scalability of the solution?

Scalability is good. It's replicated and by default, with Big Data there is a replication factor.

Over the years we have grown, when we started we had 10 nodes now we have increased to a large number of nodes.

How are customer service and technical support?

Technical support is good. I have been able to learn from them. As a developer, I am learning every day.

I would rate the technical support a ten out of ten.

Which solution did I use previously and why did I switch?

Previously we were using Oracle Sybase SQL. We switched because now, we have introduced Big Data.

How was the initial setup?

The initial setup was complex.

It's not as simple as Oracle Sybase.

It's a complex architecture because you have raw data and many engines.

What's my experience with pricing, setup cost, and licensing?

When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive.

What other advice do I have?

I am a part of security and software development. 

We are currently considering migrating to the cloud, and planning on using Microsoft Azure, mainly for the Big Data component.

I would rate this solution a five out of ten.

Which deployment model are you using for this solution?


Which version of this solution are you currently using?

CDH 5.4
**Disclosure: I am a real user, and this review is based on my own experience and opinions.
More Cloudera Distribution for Hadoop reviews from users
...who work at a Financial Services Firm
...who compared it with Oracle NoSQL
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: January 2021.
456,719 professionals have used our research since 2012.
Add a Comment