We performed a comparison between Cloudera Distribution for Hadoop, Hortonworks Data Platform, and IBM InfoSphere BigInsights [EOL] based on real PeerSpot user reviews.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop."The tool can be deployed using different container technologies, which makes it very scalable."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"The product is completely secure."
"Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis."
"The scalability of Cloudera Distribution for Hadoop is excellent."
"Very good end-to-end security features."
"I don't see any performance issues."
"With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."
"The Hortonworks solution is so stable. It is working as a production system, without any error, without any downtime. If I have downtime, it is mostly caused by the hardware of the computers."
"Ranger for security; with Ranger we can manager user’s permissions/access controls very easily."
"The product offers a fairly easy setup process."
"We use it for data science activities."
"Now, using this solution, it is much cheaper to have all of the data available for searching, not in real-time, but whenever there is a pending request."
"It is a scalable platform."
"Distributed computing, secure containerization, and governance capabilities are the most valuable features."
"Ambari Web UI: user-friendly."
"InfoSphere Streams was the one core product from the platform in which we were using. We were building a real-time response system and we built it on InfoSphere Streams."
"Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The Cloudera training has deteriorated significantly."
"It could be faster and more user-friendly."
"The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it."
"The competitors provide better functionalities."
"This is a very expensive solution."
"The pricing needs to improve."
"Hive performance. If Hive performance increased, Hadoop would replace (not everywhere) traditional databases."
"Deleting any service requires a lot of clean up, unlike Cloudera."
"It's at end of life and no longer will there be improvements."
"Security and workload management need improvement."
"It would also be nice if there were less coding involved."
"The version control of the software is also an issue."
"I would like to see more support for containers such as Docker and OpenShift."
"The cost of the solution is high and there is room for improvement."
"The UI was not interactive: Responses used to be very slow and hang up at times."
More Cloudera Distribution for Hadoop Pricing and Cost Advice →
Earn 20 points