Cassandra vs Cloudera Distribution for Hadoop comparison

Cancel
You must select at least 2 products to compare!
Apache Logo
3,406 views|2,666 comparisons
89% willing to recommend
Cloudera Logo
752 views|571 comparisons
91% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Cassandra and Cloudera Distribution for Hadoop based on real PeerSpot user reviews.

Find out in this report how the two NoSQL Databases solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Cassandra vs. Cloudera Distribution for Hadoop Report (Updated: March 2024).
772,567 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The most valuable features are the counter features and the NoSQL schema. It also has good scalability. You can scale Cassandra to any finite level.""Our primary use case for the solution is testing.""The most valuable feature of Cassandra is its fast retrieval. Additionally, the solution can handle large amounts of data. It is the quickest application we use.""Can achieve continuous data without a single downtime because of node to node ring architecture.""The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-time updates is paramount.""The most valuable features of Cassandra are the NoSQL database, high performance, and zero-copy streaming.""I am satisfied with the performance.""Cassandra is good. It's better than CouchDB, and we are using it in parallel with CouchDB. Cassandra looks better and is more user-friendly."

More Cassandra Pros →

"We had a data warehouse before all the data. We can process a lot more data structures.""Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis.""The product is completely secure.""We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization.""The solution is stable.""The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on.""The most valuable feature is Kubernetes.""The main advantage is the storage is less expensive."

More Cloudera Distribution for Hadoop Pros →

Cons
"It can be difficult to analyze what's going on inside of the database relative to other databases. It can also be difficult to troubleshoot sometimes.""Fine-tuning was a bit of a challenge.""Depending upon our schema, we can't make ORDER BY or GROUP BY clauses in the product.""There could be more integration, and it could be more user-friendly.""There were challenges with the query language and the development interface. The query language, in particular, could be improved for better optimization. These challenges were encountered while using the Java SDK.""The disc space is lacking. You need to free it up as you are working.""Cassandra can improve by adding more built-in tools. For example, if you want to do some maintenance activities in the cluster, we have to depend on third-party tools. Having these tools build-in would be e benefit.""The secondary index in Cassandra was a bit problematic and could be improved."

More Cassandra Cons →

"Cloudera's support is extremely bad and cannot be relied on.""Currently, we are using many other tools such as Spark and Blade Job to improve the performance.""Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment.""The governance aspect of the solution should be improved.""They should focus on upgrading their technical capabilities in the market.""The procedure for operations could be simplified.""The Cloudera training has deteriorated significantly.""Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions."

More Cloudera Distribution for Hadoop Cons →

Pricing and Cost Advice
  • "Cassandra is a free open source solution, but there is a commercial version available called DataStax Enterprise."
  • "There are licensing fees that must be paid, but I'm not sure if they are paid monthly or yearly."
  • "We are using the open-source version of Cassandra, the solution is free."
  • "We pay for a license."
  • "I don't have the specific numbers on pricing, but it was fairly priced."
  • More Cassandra Pricing and Cost Advice →

  • "When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
  • "The price could be better for the product."
  • "I haven't bought a license for this solution. I'm only using the Apache license version."
  • "Cloudera requires a license to use."
  • "Cloudera Distribution for Hadoop is expensive, with support costs involved."
  • "I wouldn't recommend CDH to others because of its high cost."
  • "The price is very high. The solution is expensive."
  • "The solution is expensive."
  • More Cloudera Distribution for Hadoop Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
    772,567 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-time… more »
    Top Answer:There were challenges with the query language and the development interface. The query language, in particular, could be improved for better optimization. These challenges were encountered while using… more »
    Top Answer:The tool can be deployed using different container technologies, which makes it very scalable.
    Top Answer:The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its… more »
    Top Answer:The tool's ability to be deployed on a cloud model is an area of concern where improvements are required. The tool works very well when deployed on an on-premises model. The deployment on a cloud… more »
    Ranking
    4th
    out of 18 in NoSQL Databases
    Views
    3,406
    Comparisons
    2,666
    Reviews
    7
    Average Words per Review
    358
    Rating
    7.3
    5th
    out of 18 in NoSQL Databases
    Views
    752
    Comparisons
    571
    Reviews
    14
    Average Words per Review
    443
    Rating
    8.1
    Comparisons
    Learn More
    Overview

    Cassandra is a distributed and scalable database management system used for real-time data processing. 

    It is highly valued for its ability to handle large amounts of data, scalability, high availability, fault tolerance, and flexible data model. 

    It is commonly used in finance, e-commerce, and social media industries.

    Cloudera Distribution for Hadoop is the world's most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.
    Sample Customers
    1. Apple 2. Netflix 3. Facebook 4. Instagram 5. Twitter 6. eBay 7. Spotify 8. Uber 9. Airbnb 10. Adobe 11. Cisco 12. IBM 13. Microsoft 14. Yahoo 15. Reddit 16. Pinterest 17. Salesforce 18. LinkedIn 19. Hulu 20. Airbnb 21. Walmart 22. Target 23. Sony 24. Intel 25. Cisco 26. HP 27. Oracle 28. SAP 29. GE 30. Siemens 31. Volkswagen 32. Toyota
    37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
    Top Industries
    REVIEWERS
    Comms Service Provider25%
    Computer Software Company13%
    University13%
    Financial Services Firm13%
    VISITORS READING REVIEWS
    Financial Services Firm20%
    Computer Software Company15%
    Healthcare Company6%
    Comms Service Provider6%
    REVIEWERS
    Financial Services Firm25%
    Computer Software Company21%
    Insurance Company14%
    Comms Service Provider11%
    VISITORS READING REVIEWS
    Financial Services Firm22%
    Computer Software Company15%
    Educational Organization9%
    Manufacturing Company9%
    Company Size
    REVIEWERS
    Small Business39%
    Large Enterprise61%
    VISITORS READING REVIEWS
    Small Business18%
    Midsize Enterprise13%
    Large Enterprise68%
    REVIEWERS
    Small Business28%
    Midsize Enterprise17%
    Large Enterprise55%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise9%
    Large Enterprise75%
    Buyer's Guide
    Cassandra vs. Cloudera Distribution for Hadoop
    March 2024
    Find out what your peers are saying about Cassandra vs. Cloudera Distribution for Hadoop and other solutions. Updated: March 2024.
    772,567 professionals have used our research since 2012.

    Cassandra is ranked 4th in NoSQL Databases with 19 reviews while Cloudera Distribution for Hadoop is ranked 5th in NoSQL Databases with 47 reviews. Cassandra is rated 8.0, while Cloudera Distribution for Hadoop is rated 8.0. The top reviewer of Cassandra writes "Well-equipped to handle a massive influx of data and billions of requests". On the other hand, the top reviewer of Cloudera Distribution for Hadoop writes "Good end-to-end security features and we like that it's cloud independent". Cassandra is most compared with Couchbase, MongoDB, InfluxDB, ScyllaDB and Apache HBase, whereas Cloudera Distribution for Hadoop is most compared with Amazon EMR, HPE Ezmeral Data Fabric, Apache Spark, MongoDB and ScyllaDB. See our Cassandra vs. Cloudera Distribution for Hadoop report.

    See our list of best NoSQL Databases vendors.

    We monitor all NoSQL Databases reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.