Cloudera Distribution for Hadoop Competitors and Alternatives

The top Cloudera Distribution for Hadoop competitors are
  • Amazon EMR
  • Cassandra
  • Hortonworks Data Platform
  • MapR
  • IBM InfoSphere BigInsights
  • Apache Spark
  • Neo4j
  • BlueData
Read reviews of Cloudera Distribution for Hadoop competitors and alternatives
Matthew Cloney
Real User
Data Science Engineer
Sep 27 2017

What is most valuable?

The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions. You can do it very easily and... more»

How has it helped my organization?

Well, I've been at two different companies and mostly I'll relate to my experience at HLI, Human Longevity, in San Diego. We used it for... more»

What needs improvement?

There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange. It could have been... more»

Which solutions did we use previously?

No, not really. The reason that we used it at that company - when I got there, that's what they were using. It was because my boss was very big... more»

What other advice do I have?

I would say take advantage of the documentation that exists, there are a lot of tutorials, and there's a really good community. The... more»
Abhijit Nayak
Consultant
Manager | Data Science Enthusiast | Management Consultant at a consultancy with 5,001-10,000 employees
Dec 10 2017

What do you think of Apache Spark?

How has it helped my organization?: Organisations can now harness richer data sets and benefit from use cases, which add value to their business functions. • What is most valuable?: Distributed in memory processing. Some of the algorithms are resource heavy and executing this requires a lot of RAM and CPU. With Hadoop-related technologies, we can distribute the workload with multiple commodity hardware. • What needs improvement?: Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing. • For how long have I used the solution?: Three to five years. • What do I think about the stability of the solution?: At times when users do not know how to use Spark and request a lot of resources, then the underlying JVMs...
Cassandra Logo
Apache
Anonymous User
Real User
Sr Manager, Engineering, Reporting & Analytics, Big data at a tech company with 1,001-5,000 employees
May 16 2017

What is most valuable?

I really appreciate the high availability, automated replication, linear scalability, and automated region fail-over.

How has it helped my organization?

We've used Apache Cassandra for solutions that we sell to our customers. It's used as our cloud based backend store as... more»

What needs improvement?

Out-of-the-box monitoring, troubleshooting, and maintenance are involved. There are several utilities/interfaces... more»

What's my experience with pricing, setup cost, and licensing?

We use the open source version, so it's free. Costing needs to take into account home grown maintenance and support, as... more»

Which solutions did we use previously?

This is a new cloud based enterprise product, so there weren't previous solutions.

What other advice do I have?

If you plan to use the open source version, make sure you hire a Cassandra expert or train yourself in the internals of... more»

Sign Up with Email