Cassandra vs Cloudera Distribution for Hadoop comparison

Cancel
You must select at least 2 products to compare!
Apache Logo
3,406 views|2,666 comparisons
89% willing to recommend
Cloudera Logo
752 views|571 comparisons
91% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Cassandra and Cloudera Distribution for Hadoop based on real PeerSpot user reviews.

Find out in this report how the two NoSQL Databases solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Cassandra vs. Cloudera Distribution for Hadoop Report (Updated: March 2024).
770,616 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"We can add almost one million columns to the solution.""The most valuable feature of Cassandra is its fast retrieval. Additionally, the solution can handle large amounts of data. It is the quickest application we use.""Cassandra is good. It's better than CouchDB, and we are using it in parallel with CouchDB. Cassandra looks better and is more user-friendly.""Some of the valued features of this solution are it has good performance and failover.""I am getting much better performance than relational databases.""The solution's database capabilities are very good.""Can achieve continuous data without a single downtime because of node to node ring architecture.""The most valuable features of this solution are its speed and distributed nature."

More Cassandra Pros →

"The solution's most valuable feature is the enterprise data platform.""The most valuable feature is Kubernetes.""The main advantage is the storage is less expensive.""The search function is the most valuable aspect of the solution.""The data science aspect of the solution is valuable.""The product as a whole is good.""The features I find most valuable is that the solution is that it is easy to install and to work with. It starts with the installation and from there on the management is very simple and centralized.""The file system is a valuable feature."

More Cloudera Distribution for Hadoop Pros →

Cons
"It can be difficult to analyze what's going on inside of the database relative to other databases. It can also be difficult to troubleshoot sometimes.""There were challenges with the query language and the development interface. The query language, in particular, could be improved for better optimization. These challenges were encountered while using the Java SDK.""The solution is not easy to use because it is a big database and you have to learn the interface. This is the case though in most of these solutions.""The initial setup of Cassandra can be difficult in the configuration. There might be a need to have assistance. The implementation process can six months for connecting to certain databases.""There could be more integration, and it could be more user-friendly.""The solution is limited to a linear performance.""Depending upon our schema, we can't make ORDER BY or GROUP BY clauses in the product.""The disc space is lacking. You need to free it up as you are working."

More Cassandra Cons →

"The initial setup of Cloudera is difficult.""Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions.""This is a very expensive solution.""The Cloudera training has deteriorated significantly.""The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better.""The solution is not fit for on-premise distributions.""While the deployed product is generally functional, there are instances where it presents difficulties.""Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved."

More Cloudera Distribution for Hadoop Cons →

Pricing and Cost Advice
  • "Cassandra is a free open source solution, but there is a commercial version available called DataStax Enterprise."
  • "There are licensing fees that must be paid, but I'm not sure if they are paid monthly or yearly."
  • "We are using the open-source version of Cassandra, the solution is free."
  • "We pay for a license."
  • "I don't have the specific numbers on pricing, but it was fairly priced."
  • More Cassandra Pricing and Cost Advice →

  • "When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
  • "The price could be better for the product."
  • "I haven't bought a license for this solution. I'm only using the Apache license version."
  • "Cloudera requires a license to use."
  • "Cloudera Distribution for Hadoop is expensive, with support costs involved."
  • "I wouldn't recommend CDH to others because of its high cost."
  • "The price is very high. The solution is expensive."
  • "The solution is expensive."
  • More Cloudera Distribution for Hadoop Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
    770,616 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-time… more »
    Top Answer:There were challenges with the query language and the development interface. The query language, in particular, could be improved for better optimization. These challenges were encountered while using… more »
    Top Answer:The tool can be deployed using different container technologies, which makes it very scalable.
    Top Answer:The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its… more »
    Top Answer:The tool's ability to be deployed on a cloud model is an area of concern where improvements are required. The tool works very well when deployed on an on-premises model. The deployment on a cloud… more »
    Ranking
    4th
    out of 18 in NoSQL Databases
    Views
    3,406
    Comparisons
    2,666
    Reviews
    7
    Average Words per Review
    358
    Rating
    7.3
    5th
    out of 18 in NoSQL Databases
    Views
    752
    Comparisons
    571
    Reviews
    14
    Average Words per Review
    443
    Rating
    8.1
    Comparisons
    Learn More
    Overview

    Cassandra is a distributed and scalable database management system used for real-time data processing. 

    It is highly valued for its ability to handle large amounts of data, scalability, high availability, fault tolerance, and flexible data model. 

    It is commonly used in finance, e-commerce, and social media industries.

    Cloudera Distribution for Hadoop is the world's most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.
    Sample Customers
    1. Apple 2. Netflix 3. Facebook 4. Instagram 5. Twitter 6. eBay 7. Spotify 8. Uber 9. Airbnb 10. Adobe 11. Cisco 12. IBM 13. Microsoft 14. Yahoo 15. Reddit 16. Pinterest 17. Salesforce 18. LinkedIn 19. Hulu 20. Airbnb 21. Walmart 22. Target 23. Sony 24. Intel 25. Cisco 26. HP 27. Oracle 28. SAP 29. GE 30. Siemens 31. Volkswagen 32. Toyota
    37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
    Top Industries
    REVIEWERS
    Comms Service Provider25%
    Computer Software Company13%
    University13%
    Financial Services Firm13%
    VISITORS READING REVIEWS
    Financial Services Firm20%
    Computer Software Company15%
    Comms Service Provider7%
    Healthcare Company6%
    REVIEWERS
    Financial Services Firm25%
    Computer Software Company21%
    Insurance Company14%
    Comms Service Provider11%
    VISITORS READING REVIEWS
    Financial Services Firm22%
    Computer Software Company16%
    Educational Organization9%
    Manufacturing Company8%
    Company Size
    REVIEWERS
    Small Business39%
    Large Enterprise61%
    VISITORS READING REVIEWS
    Small Business18%
    Midsize Enterprise13%
    Large Enterprise69%
    REVIEWERS
    Small Business28%
    Midsize Enterprise17%
    Large Enterprise55%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise9%
    Large Enterprise75%
    Buyer's Guide
    Cassandra vs. Cloudera Distribution for Hadoop
    March 2024
    Find out what your peers are saying about Cassandra vs. Cloudera Distribution for Hadoop and other solutions. Updated: March 2024.
    770,616 professionals have used our research since 2012.

    Cassandra is ranked 4th in NoSQL Databases with 19 reviews while Cloudera Distribution for Hadoop is ranked 5th in NoSQL Databases with 47 reviews. Cassandra is rated 8.0, while Cloudera Distribution for Hadoop is rated 8.0. The top reviewer of Cassandra writes "Well-equipped to handle a massive influx of data and billions of requests". On the other hand, the top reviewer of Cloudera Distribution for Hadoop writes "Good end-to-end security features and we like that it's cloud independent". Cassandra is most compared with Couchbase, MongoDB, InfluxDB, ScyllaDB and Vertica, whereas Cloudera Distribution for Hadoop is most compared with Amazon EMR, HPE Ezmeral Data Fabric, Apache Spark, MongoDB and ScyllaDB. See our Cassandra vs. Cloudera Distribution for Hadoop report.

    See our list of best NoSQL Databases vendors.

    We monitor all NoSQL Databases reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.