We performed a comparison between Cloudera Distribution for Hadoop, Hortonworks Data Platform, and IBM InfoSphere BigInsights [EOL] based on real PeerSpot user reviews.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop."The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"The scalability of Cloudera Distribution for Hadoop is excellent."
"We also really like the Cloudera community. You can have any question and will have your answer within a few hours."
"The search function is the most valuable aspect of the solution."
"Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis."
"The solution is reliable and stable, it fits our requirements."
"With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."
"The scalability is the key reason why we are on this platform."
"The data platform is pretty neat. The workflow is also really good."
"Hortonworks should not be expensive at all to those looking into using it."
"Ranger for security; with Ranger we can manager user’s permissions/access controls very easily."
"Ambari Web UI: user-friendly."
"Distributed computing, secure containerization, and governance capabilities are the most valuable features."
"Now, using this solution, it is much cheaper to have all of the data available for searching, not in real-time, but whenever there is a pending request."
"The Hortonworks solution is so stable. It is working as a production system, without any error, without any downtime. If I have downtime, it is mostly caused by the hardware of the computers."
"InfoSphere Streams was the one core product from the platform in which we were using. We were building a real-time response system and we built it on InfoSphere Streams."
"The solution is not fit for on-premise distributions."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there is a lot of things that need to improve."
"The Cloudera training has deteriorated significantly."
"The dashboard could be improved."
"Cloudera's support is extremely bad and cannot be relied on."
"Currently, we are using many other tools such as Spark and Blade Job to improve the performance."
"Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions."
"The tool's ability to be deployed on a cloud model is an area of concern where improvements are required."
"It's at end of life and no longer will there be improvements."
"More information could be there to simplify the process of running the product."
"Hive performance. If Hive performance increased, Hadoop would replace (not everywhere) traditional databases."
"Since Cloudera acquired HDP, it's been bundled with CBH and HDP. However, the biggest challenge is cloud storage integration with Azure, GCP, and AWS."
"Security and workload management need improvement."
"I would like to see more support for containers such as Docker and OpenShift."
"The cost of the solution is high and there is room for improvement."
"The version control of the software is also an issue."
"The UI was not interactive: Responses used to be very slow and hang up at times."
More Cloudera Distribution for Hadoop Pricing and Cost Advice →
Earn 20 points