We performed a comparison between Cloudera Distribution for Hadoop, Hortonworks Data Platform, and IBM InfoSphere BigInsights [EOL] based on real PeerSpot user reviews.
Find out what your peers are saying about Cloudera, Apache, Amazon and others in Hadoop."CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools."
"We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"The search function is the most valuable aspect of the solution."
"Customer service and support were able to fix whatever the issue was."
"I don't see any performance issues."
"Very good end-to-end security features."
"We had a data warehouse before all the data. We can process a lot more data structures."
"Distributed computing, secure containerization, and governance capabilities are the most valuable features."
"We use it for data science activities."
"The data platform is pretty neat. The workflow is also really good."
"The upgrades and patches must come from Hortonworks."
"The Hortonworks solution is so stable. It is working as a production system, without any error, without any downtime. If I have downtime, it is mostly caused by the hardware of the computers."
"The scalability is the key reason why we are on this platform."
"Ranger for security; with Ranger we can manager user’s permissions/access controls very easily."
"The product offers a fairly easy setup process."
"InfoSphere Streams was the one core product from the platform in which we were using. We were building a real-time response system and we built it on InfoSphere Streams."
"The pricing needs to improve."
"I would like to see an improvement in how the solution helps me to handle the whole cluster."
"The initial setup of Cloudera is difficult."
"The competitors provide better functionalities."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The procedure for operations could be simplified."
"There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon."
"More information could be there to simplify the process of running the product."
"Since Cloudera acquired HDP, it's been bundled with CBH and HDP. However, the biggest challenge is cloud storage integration with Azure, GCP, and AWS."
"Deleting any service requires a lot of clean up, unlike Cloudera."
"The cost of the solution is high and there is room for improvement."
"It would also be nice if there were less coding involved."
"Security and workload management need improvement."
"I would like to see more support for containers such as Docker and OpenShift."
"It's at end of life and no longer will there be improvements."
"The UI was not interactive: Responses used to be very slow and hang up at times."
More Cloudera Distribution for Hadoop Pricing and Cost Advice →
Earn 20 points