We performed a comparison between Cloudera DataFlow and Cloudera Distribution for Hadoop based on real PeerSpot user reviews.
Find out what your peers are saying about Databricks, Amazon Web Services (AWS), Confluent and others in Streaming Analytics."This solution is very scalable and robust."
"The initial setup was not so difficult"
"DataFlow's performance is okay."
"The most valuable feature is Impala, the querying engine, which is very fast."
"The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on."
"The product as a whole is good."
"CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools."
"The data science aspect of the solution is valuable."
"The scalability of Cloudera Distribution for Hadoop is excellent."
"The solution's most valuable feature is the enterprise data platform."
"Very good end-to-end security features."
"It's an outdated legacy product that doesn't meet the needs of modern data analysts and scientists."
"Although their workflow is pretty neat, it still requires a lot of transformation coding; especially when it comes to Python and other demanding programming languages."
"It is not easy to use the R language. Though I don't know if it's possible, I believe it is possible, but it is not the best language for machine learning."
"The initial setup of Cloudera is difficult."
"They should focus on upgrading their technical capabilities in the market."
"Cloudera's support is extremely bad and cannot be relied on."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The pricing needs to improve."
"There are better solutions out there that have more features than this one."
"The procedure for operations could be simplified."
More Cloudera Distribution for Hadoop Pricing and Cost Advice →
Cloudera DataFlow is ranked 13th in Streaming Analytics with 3 reviews while Cloudera Distribution for Hadoop is ranked 2nd in Hadoop with 47 reviews. Cloudera DataFlow is rated 6.6, while Cloudera Distribution for Hadoop is rated 8.0. The top reviewer of Cloudera DataFlow writes "A scalable and robust platform for analyzing data". On the other hand, the top reviewer of Cloudera Distribution for Hadoop writes "Good end-to-end security features and we like that it's cloud independent". Cloudera DataFlow is most compared with Databricks, Confluent, Amazon MSK, Spring Cloud Data Flow and Informatica Data Engineering Streaming, whereas Cloudera Distribution for Hadoop is most compared with Amazon EMR, HPE Ezmeral Data Fabric, Apache Spark, MongoDB and Cassandra.
We monitor all Streaming Analytics reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.