Cassandra vs Cloudera Distribution for Hadoop comparison

Cancel
You must select at least 2 products to compare!
Apache Logo
3,266 views|2,594 comparisons
89% willing to recommend
Cloudera Logo
730 views|563 comparisons
91% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Cassandra and Cloudera Distribution for Hadoop based on real PeerSpot user reviews.

Find out in this report how the two NoSQL Databases solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Cassandra vs. Cloudera Distribution for Hadoop Report (Updated: March 2024).
772,649 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Our primary use case for the solution is testing.""Can achieve continuous data without a single downtime because of node to node ring architecture.""I am satisfied with the performance.""The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-time updates is paramount.""The most valuable features are the counter features and the NoSQL schema. It also has good scalability. You can scale Cassandra to any finite level.""We can add almost one million columns to the solution.""Some of the valued features of this solution are it has good performance and failover.""Cassandra has some features that are more useful for specific use cases where you have time series where you have huge amounts of writes. That should be quick, but not specifically the reads. We needed to have quicker reads and writes and this is why we are using Cassandra right now."

More Cassandra Pros →

"CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools.""With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility.""The main advantage is the storage is less expensive.""The solution's most valuable feature is the enterprise data platform.""Cloudera is a very manageable solution with good support.""The scalability of Cloudera Distribution for Hadoop is excellent.""The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on.""The solution is stable."

More Cloudera Distribution for Hadoop Pros →

Cons
"The solution is limited to a linear performance.""The initial setup of Cassandra can be difficult in the configuration. There might be a need to have assistance. The implementation process can six months for connecting to certain databases.""There were challenges with the query language and the development interface. The query language, in particular, could be improved for better optimization. These challenges were encountered while using the Java SDK.""There could be more integration, and it could be more user-friendly.""Depending upon our schema, we can't make ORDER BY or GROUP BY clauses in the product.""Doesn't support a solution that can give aggregation.""Cassandra can improve by adding more built-in tools. For example, if you want to do some maintenance activities in the cluster, we have to depend on third-party tools. Having these tools build-in would be e benefit.""The solution doesn't have joins between tables so you need other tools for that."

More Cassandra Cons →

"Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved.""Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions.""I would like to see an improvement in how the solution helps me to handle the whole cluster.""Cloudera's support is extremely bad and cannot be relied on.""Currently, we are using many other tools such as Spark and Blade Job to improve the performance.""There are multiple bugs when we update.""The competitors provide better functionalities.""There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon."

More Cloudera Distribution for Hadoop Cons →

Pricing and Cost Advice
  • "Cassandra is a free open source solution, but there is a commercial version available called DataStax Enterprise."
  • "There are licensing fees that must be paid, but I'm not sure if they are paid monthly or yearly."
  • "We are using the open-source version of Cassandra, the solution is free."
  • "We pay for a license."
  • "I don't have the specific numbers on pricing, but it was fairly priced."
  • More Cassandra Pricing and Cost Advice →

  • "When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
  • "The price could be better for the product."
  • "I haven't bought a license for this solution. I'm only using the Apache license version."
  • "Cloudera requires a license to use."
  • "Cloudera Distribution for Hadoop is expensive, with support costs involved."
  • "I wouldn't recommend CDH to others because of its high cost."
  • "The price is very high. The solution is expensive."
  • "The solution is expensive."
  • More Cloudera Distribution for Hadoop Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
    772,649 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-time… more »
    Top Answer:There were challenges with the query language and the development interface. The query language, in particular, could be improved for better optimization. These challenges were encountered while using… more »
    Top Answer:The tool can be deployed using different container technologies, which makes it very scalable.
    Top Answer:The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its… more »
    Top Answer:The tool's ability to be deployed on a cloud model is an area of concern where improvements are required. The tool works very well when deployed on an on-premises model. The deployment on a cloud… more »
    Ranking
    4th
    out of 18 in NoSQL Databases
    Views
    3,266
    Comparisons
    2,594
    Reviews
    6
    Average Words per Review
    353
    Rating
    7.2
    5th
    out of 18 in NoSQL Databases
    Views
    730
    Comparisons
    563
    Reviews
    13
    Average Words per Review
    424
    Rating
    8.2
    Comparisons
    Learn More
    Overview

    Cassandra is a distributed and scalable database management system used for real-time data processing. 

    It is highly valued for its ability to handle large amounts of data, scalability, high availability, fault tolerance, and flexible data model. 

    It is commonly used in finance, e-commerce, and social media industries.

    Cloudera Distribution for Hadoop is the world's most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.
    Sample Customers
    1. Apple 2. Netflix 3. Facebook 4. Instagram 5. Twitter 6. eBay 7. Spotify 8. Uber 9. Airbnb 10. Adobe 11. Cisco 12. IBM 13. Microsoft 14. Yahoo 15. Reddit 16. Pinterest 17. Salesforce 18. LinkedIn 19. Hulu 20. Airbnb 21. Walmart 22. Target 23. Sony 24. Intel 25. Cisco 26. HP 27. Oracle 28. SAP 29. GE 30. Siemens 31. Volkswagen 32. Toyota
    37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
    Top Industries
    REVIEWERS
    Comms Service Provider25%
    Computer Software Company13%
    University13%
    Financial Services Firm13%
    VISITORS READING REVIEWS
    Financial Services Firm20%
    Computer Software Company16%
    Comms Service Provider6%
    Healthcare Company6%
    REVIEWERS
    Financial Services Firm25%
    Computer Software Company21%
    Insurance Company14%
    Comms Service Provider11%
    VISITORS READING REVIEWS
    Financial Services Firm22%
    Computer Software Company15%
    Educational Organization8%
    Manufacturing Company8%
    Company Size
    REVIEWERS
    Small Business39%
    Large Enterprise61%
    VISITORS READING REVIEWS
    Small Business19%
    Midsize Enterprise13%
    Large Enterprise69%
    REVIEWERS
    Small Business28%
    Midsize Enterprise17%
    Large Enterprise55%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise9%
    Large Enterprise75%
    Buyer's Guide
    Cassandra vs. Cloudera Distribution for Hadoop
    March 2024
    Find out what your peers are saying about Cassandra vs. Cloudera Distribution for Hadoop and other solutions. Updated: March 2024.
    772,649 professionals have used our research since 2012.

    Cassandra is ranked 4th in NoSQL Databases with 19 reviews while Cloudera Distribution for Hadoop is ranked 5th in NoSQL Databases with 47 reviews. Cassandra is rated 8.0, while Cloudera Distribution for Hadoop is rated 8.0. The top reviewer of Cassandra writes "Well-equipped to handle a massive influx of data and billions of requests". On the other hand, the top reviewer of Cloudera Distribution for Hadoop writes "Good end-to-end security features and we like that it's cloud independent". Cassandra is most compared with Couchbase, MongoDB, ScyllaDB, InfluxDB and Vertica, whereas Cloudera Distribution for Hadoop is most compared with Amazon EMR, HPE Ezmeral Data Fabric, Apache Spark, ScyllaDB and MongoDB. See our Cassandra vs. Cloudera Distribution for Hadoop report.

    See our list of best NoSQL Databases vendors.

    We monitor all NoSQL Databases reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.