What is most valuable?
We evaluated Cloudera and Hortonworks. Based on our evaluation and actual experience in production of 60 nodes and development of 12 nodes, the most valuable features of Hortonworks are:
- 100% open
- No lock-in like Cloudera
- Fast and accurate support instantly
- Largest number of committers to Hadoop by any means
- Hive is better in performance and ease of use compared to Impala
How has it helped my organization?
It helps a lot in data in motion (ingestion and manage in real time). We are able to do 3rd-party data monetization of our data within a t+20 minute time frame to our end customers.
What needs improvement?
- Ease of use
For how long have I used the solution?
I have used it for three years.
What was my experience with deployment of the solution?
I initially encountered deployment issues, but they were very good in resolving them.
What do I think about the stability of the solution?
I have not encountered stability issues.
What do I think about the scalability of the solution?
I have not encountered any scalability issues at all. That's the key reason we picked HDP over Cloudera, as Cloudera have issues & don't support compression of Hive in ORC format. They push only their products (not good).
How are customer service and technical support?
Customer service has been excellent from the day one until now... and our Admin is comfortable with the SLA and turnaround time. Technical Support
Technical support is very good and proactive with SmartSense.
Which solution did I use previously and why did I switch?
We previously used a different solution. We switched from Cloudera. Initially, we went with Cloudera due to it being a popular choice in the market, etc, then realized it was bad choice. Before we scaled from 6 nodes to 12 nodes and before we went livein production, we scrapped it due to Impala's performance and lock-in.
How was the initial setup?
Using Ambari, it was easy to set up and we even tried the AWS for a test cluster.
What about the implementation team?
An in-house team implemented it: two admins, seven developers, one data scientist, one PM and 22 business users at the customer (end-user side).
What was our ROI?
What's my experience with pricing, setup cost, and licensing?
Hortonworks is the best, comparing all three flavors. If all is well, we might use open source alone in the next three years; others you can't due to lock-in...
Which other solutions did I evaluate?
Before choosing this product, we also evaluate Cloudera.
What other advice do I have?
It is the best in terms of product vision and actual delivery.
Which version of this solution are you currently using?