Cloudera Distribution for Hadoop vs ScyllaDB comparison

Cancel
You must select at least 2 products to compare!
Cloudera Logo
772 views|598 comparisons
91% willing to recommend
ScyllaDB Logo
2,487 views|1,529 comparisons
100% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Cloudera Distribution for Hadoop and ScyllaDB based on real PeerSpot user reviews.

Find out in this report how the two NoSQL Databases solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Cloudera Distribution for Hadoop vs. ScyllaDB Report (Updated: March 2024).
767,847 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization.""The main advantage is the storage is less expensive.""We also really like the Cloudera community. You can have any question and will have your answer within a few hours.""The tool can be deployed using different container technologies, which makes it very scalable.""We had a data warehouse before all the data. We can process a lot more data structures.""The data science aspect of the solution is valuable.""Very good end-to-end security features.""With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."

More Cloudera Distribution for Hadoop Pros →

"It is lightweight, and it requires less infrastructure.""The performance aspects of Scylla are good, as always... A good point about Scylla is that it can be used extensively."

More ScyllaDB Pros →

Cons
"The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it.""I would like to see an improvement in how the solution helps me to handle the whole cluster.""The areas of improvement depend on the scale of the project. For banking customers, security features and an essential budget for commercial licenses would be the top priority. Data regulation could be the most crucial for a project with extensive data or an extra use case.""The initial setup of Cloudera is difficult.""While the deployed product is generally functional, there are instances where it presents difficulties.""The procedure for operations could be simplified.""There are better solutions out there that have more features than this one.""We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there is a lot of things that need to improve."

More Cloudera Distribution for Hadoop Cons →

"The documentation of Scylla is an area with shortcomings and needs to be improved.""Data export, along with how we can purchase the data periodically, needs to be improved so that the storage is within control. Then, we could optimize it even better."

More ScyllaDB Cons →

Pricing and Cost Advice
  • "When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
  • "The price could be better for the product."
  • "I haven't bought a license for this solution. I'm only using the Apache license version."
  • "Cloudera requires a license to use."
  • "Cloudera Distribution for Hadoop is expensive, with support costs involved."
  • "I wouldn't recommend CDH to others because of its high cost."
  • "The price is very high. The solution is expensive."
  • "The solution is expensive."
  • More Cloudera Distribution for Hadoop Pricing and Cost Advice →

  • "I believe that there is a yearly licensing cost and that it's expensive."
  • More ScyllaDB Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
    767,847 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:The tool can be deployed using different container technologies, which makes it very scalable.
    Top Answer:The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its… more »
    Top Answer:The tool's ability to be deployed on a cloud model is an area of concern where improvements are required. The tool works very well when deployed on an on-premises model. The deployment on a cloud… more »
    Top Answer:The performance aspects of Scylla are good, as always... A good point about Scylla is that it can be used extensively.
    Top Answer:I believe that there is a yearly licensing cost and that it's expensive.
    Top Answer:It has just been a month or so for me with Scylla. The documentation of Scylla is an area with shortcomings and needs to be improved. Improvement of documentation is needed considering that I work… more »
    Ranking
    5th
    out of 18 in NoSQL Databases
    Views
    772
    Comparisons
    598
    Reviews
    14
    Average Words per Review
    409
    Rating
    8.1
    6th
    out of 18 in NoSQL Databases
    Views
    2,487
    Comparisons
    1,529
    Reviews
    2
    Average Words per Review
    577
    Rating
    7.5
    Comparisons
    Learn More
    Overview
    Cloudera Distribution for Hadoop is the world's most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.

    ScyllaDB is an open-source, distributed NoSQL wide-column datastore (a highly scalable NoSQL database), known for its compatibility with Apache Cassandra, and for supporting the same protocols as Cassandra (CQL and Thrift) and the same file formats (SSTable). ScyllaDB is designed for high throughput and low latency, making it suitable for data-intensive applications. Its architecture allows it to deliver remarkable performance on a massive scale, utilizing modern multi-core servers to their fullest potential​

    ScyllaDB utilizes a similar architecture, data format, and query language as Apache Cassandra, providing compatibility while dramatically improving speed and scalability.

    The key advantages of ScyllaDB include its rewritten C++ implementation that eliminates Cassandra's expensive Java garbage collection pauses, built-in caching for fast access to frequently used data, and shard-aware drivers for direct routing of requests. This enables it to fully leverage modern multi-core servers for massive parallelism. The community is active and the latest major release, ScyllaDB Enterprise 2023.1.0 LTS, incorporates over 5,000 code commits focused on enhancing capabilities.

    ScyllaDB supports wide-column data modeling for fast read performance at scale. It includes integrated monitoring and management tools to track database health and performance. For organizations looking to boost speed and reduce costs for NoSQL workloads, ScyllaDB offers a drop-in replacement for Cassandra that delivers lower latency, higher throughput, and increased scalability with fewer nodes. Its seamless migration path makes switching from Cassandra seamless, requiring minimal code changes.

    Sample Customers
    37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
    IBM, Investing.com, mParticle, Comcast, GE, Fanatics, Ola, CERN, adgear, Samsung
    Top Industries
    REVIEWERS
    Financial Services Firm25%
    Computer Software Company21%
    Insurance Company14%
    Comms Service Provider11%
    VISITORS READING REVIEWS
    Financial Services Firm22%
    Computer Software Company16%
    Educational Organization8%
    Manufacturing Company8%
    VISITORS READING REVIEWS
    Computer Software Company17%
    Financial Services Firm14%
    Educational Organization7%
    Comms Service Provider6%
    Company Size
    REVIEWERS
    Small Business28%
    Midsize Enterprise17%
    Large Enterprise55%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise9%
    Large Enterprise74%
    VISITORS READING REVIEWS
    Small Business26%
    Midsize Enterprise17%
    Large Enterprise57%
    Buyer's Guide
    Cloudera Distribution for Hadoop vs. ScyllaDB
    March 2024
    Find out what your peers are saying about Cloudera Distribution for Hadoop vs. ScyllaDB and other solutions. Updated: March 2024.
    767,847 professionals have used our research since 2012.

    Cloudera Distribution for Hadoop is ranked 5th in NoSQL Databases with 47 reviews while ScyllaDB is ranked 6th in NoSQL Databases with 2 reviews. Cloudera Distribution for Hadoop is rated 8.0, while ScyllaDB is rated 7.6. The top reviewer of Cloudera Distribution for Hadoop writes "Good end-to-end security features and we like that it's cloud independent". On the other hand, the top reviewer of ScyllaDB writes "A solution that offers good performance and flexibility to its users". Cloudera Distribution for Hadoop is most compared with Amazon EMR, HPE Ezmeral Data Fabric, Apache Spark, MongoDB and SingleStore, whereas ScyllaDB is most compared with MongoDB, Cassandra, Couchbase, Apache HBase and Oracle NoSQL. See our Cloudera Distribution for Hadoop vs. ScyllaDB report.

    See our list of best NoSQL Databases vendors.

    We monitor all NoSQL Databases reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.