Find out what your peers are saying about Apache, Cloudera, Hortonworks and others in Hadoop.
279,296 professionals have used our research since 2012.
Find out what your peers are saying about Apache, Cloudera, Hortonworks and others in Hadoop.
279,296 professionals have used our research since 2012.
Chart Key
Average Rating
Average rating based on reviews
Views
Number of total page views
Comparisons
Number of times compared to another product
Reviews
Total number of reviews on IT Central Station
Followers
Number of followers on IT Central Station
The total ranking of a product, represented by the bar length, is based on a weighted aggregate score. The score is calculated as follows: The product with the highest count in each area gets the highest available score. (20 points for Reviews; 16 points for Views, Comparisons, and Followers.) Every other product gets assigned points based on its total in proportion to the #1 product in that area. For example, if a product has 80% of the number of reviews compared to the product with the most reviews then the product's score for reviews would be 20% (weighting factor) * 80% = 16. For Average Rating, the maximum score is 32 points awarded linearly based on our rating scale of 1-10. If a product has fewer than ten reviews, the point contribution for Average Rating is reduced (one-third reduction in points for products with 5-9 reviews; two-thirds reduction for products with fewer than five reviews). Reviews that are more than 24 months old, as well as those written by resellers, are completely excluded from the ranking algorithm.

Hadoop Reviews

Read top reviews of Hadoop solutions from the IT Central Station community:
Your trust is our top concern, so companies can't alter or remove reviews.
Real User
Data Science Engineer
Sep 27 2017

What is most valuable?

The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions. You can do it very easily and quickly. It is a managed service from AWS Amazon so it removes a lot of the headaches of configuring... more»

How has it helped my organization?

Well, I've been at two different companies and mostly I'll relate to my experience at HLI, Human Longevity, in San Diego. We used it for genomics. Genomics is a perfect use case for big data. We manage literally terabytes of data using some... more»

What needs improvement?

There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange. It could have been a red herring, it could have been that something else changed in our environment that we never... more»
Consultant
Manager-Projects,Data Analyst & DBA at a tech services company with 10,001+ employees
Oct 04 2017

What is most valuable?

Currently, we are using Netezza to the utmost. We first used it for our data warehouse. Then we moved on to doing analytics. Also, we are now doing some packages on it. Currently we are using it for multiple purposes, but, mostly it's used... more»

How has it helped my organization?

I'm working for a retail client. We have a lot of reports, which are for store managers. The store managers used to get all the reports using the SSIS package, which was built upon our back end and they would have a lot of performance issues.... more»

What needs improvement?

Administration of this product is too tough. It's very complex because of the tools which it's missing. We would require better tools for doing administration. For example, if I need to get a permission of a user on a particular object, it is... more»
Find out what your peers are saying about Apache, Cloudera, Hortonworks and others in Hadoop.
279,296 professionals have used our research since 2012.
MapR
Real User
Founder at Chicago area Hadoop User Group (CHUG)
Mar 02 2018

What is most valuable?

MapR’s strength comes from their file system. Because they start with the raw disk, they are able to expose the storage through various APIs and have the ability to lockdown and secure the file system better than the Apache derivatives, which... more»

How has it helped my organization?

To be clear, all of the main three vendors are capable of supporting multiple big data use cases. Where they differ is going to be in terms of scalability, supported tools, and the stability of the tools. MapR has MapR-DB, which supports most... more»

What needs improvement?

All products have room for improvement. Because of MapR-FS, they have an incredible advantage in terms of stability, cross cluster replication, and extensibility to create products like MapR-DB (Binary and JSON tables) and MapR Streams. One... more»
Real User
Vice President - Big Data and Delivery at a software R&D company with 51-200 employees
Nov 22 2016

What is most valuable?

* Cloudera Manager for administering the Hadoop cluster * Cloudera specific solutions like Impala * Extensive documentation * Good user community

How has it helped my organization?

Implementing a Hadoop cluster has become relatively straight-forward using CDH. Administering it is also less complex. As a result, efforts spent in these areas are less than anticipated.

What needs improvement?

* Some of the UI features seem confusing e.g. charts on the CM Services page * Sometimes it gets confusing to follow a single path for installation due to multiple recommended approaches e.g. parcels vs packages
Real User
Big Data - Senior Solutions Architect at a tech vendor with 10,001+ employees
Mar 29 2017

What is most valuable?

We evaluated Cloudera and Hortonworks. Based on our evaluation and actual experience in production of 60 nodes and development of 12 nodes, the most valuable features of Hortonworks are: * 100% open * No lock-in like Cloudera * Fast and... more»

How has it helped my organization?

It helps a lot in data in motion (ingestion and manage in real time). We are able to do 3rd-party data monetization of our data within a t+20 minute time frame to our end customers.

What needs improvement?

* Cost * Reliability * Speed * Ease of use
Real User
Business Unit technical Lead at a tech services company with 1,001-5,000 employees
Jul 17 2016

What is most valuable?

The things that I have found of value are those that make database management easier to deal with. I have been using Netezza and Oracle DBA for several years so the big data is a bit of a mind shift for me. The thing that I have found most... more»

How has it helped my organization?

The customer that I am working with has had a portion of the database that they will be shifting from Netezza to BigInsights. This will remove all of the queries and overhead on Netezza around those tables. This will be a big improvement and... more»

What needs improvement?

I have found a lot of issues in Fluid Query and BigInsights Applications to move data in the enterprise version. I have several tickets on these with IBM as they need to be addressed to have a solid implementation. Fortunately none of these... more»
MapR
Consultant
Technical Architect at a tech services company with 10,001+ employees
Jun 29 2017

What do you think of MapR?

Valuable Features: MapR-DB is a NoSQl datastore on top of MaprFS. The data can be updated and random data can be picked very fast.  MapR-DB stores data in MapR-FS and it does not have region server like in HBase. • Improvements to My Organization: We have a couple of Hadoop development projects actively going on. Some of them are time series data and others involve customer interaction and marketing data. A NoSQL data storage is ideal for such applications. • Room for Improvement: It would be great if it were completely compatible with HBase. • Use of Solution: 9 months. • Stability Issues: No. • Scalability Issues: No. • Customer Service: Our customer purchased a paid support service and so far MapR has addressed our issues well. They even provided a fix...
Real User
Senior Enterprise Architect at a retailer with 10,001+ employees
Sep 28 2017

What is most valuable?

* Its ability to process and query large amounts of structured data. * Low administrative support in terms of query optimization and indexing support. Indexing and data partitioning is built into the firmware. * Data compression. It was... more»

How has it helped my organization?

I can't because we're using it in a fashion such that a traditional RDMS could have been used in place of it. Our data was relatively small so we didn't see a huge benefit in transitioning new subject areas into production.

What needs improvement?

Community support. There was none, or very little, especially when using add-on software (e.g. building functions, MapReduce, R, Lua, etc.). Driver support for windows based applications. Disaster recovery support. Because it was an... more»
Consultant
Big Data Consultant at a tech services company with 501-1,000 employees
Aug 25 2017

What do you think of Apache Spark?

Valuable Features The good performance. The nice graphical management console. The long list of ML algorithms. • Improvements to My Organization We are able to solve problems, e.g., reporting on big data, that we were not able to tackle in the past. • Room for Improvement Apache Spark provides very good performance The tuning phase is still tricky. • Use of Solution I've used it for 2 years. • Deployment Issues We didn't have an issue with the deployment. • Stability Issues In the past we deployed Spark 1.3 to use Spark SQL but unfortunately one of our queries failed because of a bug fixed in following releases. Then we moved to Spark 1.6 but still some queries were failing when run against huge datasets. Now we are using version 2.1: it is more stable, it...
Consultant
BigData Consultant at a tech services company with 10,001+ employees
Sep 27 2017

What is most valuable?

InfoSphere Streams was the one core product from the platform in which we were using. We were building a real-time response system and we built it on InfoSphere Streams.

How has it helped my organization?

The solution we built using InfoSphere was mainly aiming to: * Collect telematics data from vehicle engines. * Do root cause analysis (RCA) on any engine failure. * Give recommendations to the customer and the service center, in real time.... more»

What needs improvement?

* The UI was not interactive: Responses used to be very slow and hang up at times. * The UI was not really helping to track the real-time jobs and its logs. * You can bring in a better UI for job management and health checks. * Developer API... more»
MapR
Real User
Director at a tech services company with 51-200 employees
Oct 06 2016

What is most valuable?

The fact that the heavy computation is required on Big Data can be distributed across many nodes in a cluster, makes this solution a winner. Of course the same concept is available across any solution based on the Hadoop architecture, but... more»

How has it helped my organization?

We implemented this for our client where data from sensors was to be analyzed and sense to be made of this data. Sensor data is huge, and earlier there was no meaningful early warning raised against the discrepancies observed on Sensor’s... more»

What needs improvement?

* Installations and setups are still a bit cryptic and can be improved. * Skilled resource base in Big Data Tools is generally low and hence project costing is that much higher.
Real User
Solution Architect at MIMOS Berhad
Feb 26 2017

What is most valuable?

* It's the one and only complete open source big data platform * Ambari-managed admin configuration for HDFS, YARN, Hive, HBase, etc. * Customized dashboards * Web-based HDFS browser * SQL editor for Hive * Apache Phoenix - OLTP and... more»

How has it helped my organization?

* Maintenance of our own data lake in the enterprise-level * Storage and analysis of server logs * Applying Operational Intelligence in the enterprise-level based on the analysis of various department units data * Semantic analysis based on... more»

What needs improvement?

* Rolling upgrade * Disaster recovery features such as mirroring should be supported
Real User
User at a comms service provider with 1,001-5,000 employees
Sep 28 2017

What is most valuable?

A few of them, namely: Hive/Tez, HBase, Ranger, Yarn and Ambari. Ambari helps managing the platform, Hive is very easy to use. Ranger for security; with Ranger we can manager user’s permissions/access controls very easily.

How has it helped my organization?

We have successfully ported a Microsoft SSIS product application into Hadoop, that saved millions of dollars for the company and, at the same time, they are getting better performance. Also, we implemented fraud detection, as quickly as... more»

What needs improvement?

Hive performance. If Hive performance increased, Hadoop would replace, which would save a lot of money for the company.
Real User
Sr. Software Engineer at a tech vendor with 1-10 employees
Oct 01 2017

What is most valuable?

The most valuable feature is the Fault Tolerance and easy binding with other processes like Machine Learning, graph analytics. The community is growing and hence executing ML in a distributed fashion is quite good.

How has it helped my organization?

Previously we were using Hadoop MapReduce to reduce the Google Ngrams (3TB), which took us approximately five days on our cluster. After using Spark, we were able to accomplish this task within hours.

What needs improvement?

This product is already improving as the community is developing it rapidly. More ML based algorithms should be added to it, to make it algorithmic-rich for developers.
Real User
Project Manager
Sep 28 2017

What do you think of Netezza Analytics?

Valuable Features Speed storage RAM all of which contribute to large capacity. • Improvements to My Organization Faster data processing compared to commodity servers. • Room for Improvement In-DB processing with SAS Analytics, since this is supposed to be an analytics server so the expectation is there. • Use of Solution Three-plus years. • Stability Issues No. • Scalability Issues No. • Customer Service and Technical Support Eight out of 10. • Previous Solutions Unix, commodity server. Switched to check this technology. • Initial Setup SAS server (Unix) and data storage only. • Pricing, Setup Cost and Licensing Expensive to maintain compared to other solutions. • Other Solutions Considered Teradata and Greenplum. • Other...
Consultant
Senior Information Management Consultant at a retailer with 51-200 employees
Sep 25 2016

What do you think of Netezza Analytics?

Valuable Features There is minimal on-going maintenance. • Improvements to My Organization There were some reports we couldn't get because of performance, but with Netezza we could get more of them. • Room for Improvement It needs a better in-house development tool. • Use of Solution I've used it for three years. • Deployment Issues Functions are difficult to deploy. • Stability Issues No issues encountered. • Scalability Issues No issues encountered. • Customer Service and Technical Support Customer Service: 7/10. Technical Support: 7/10. • Previous Solutions Our customers mostly switch from traditional RDMs to appliances. • Initial Setup It is very easy to setup. • Implementation Team We used an in-house vendor who were 8/10. ...
Real User
Architect at a healthcare company with 51-200 employees
Sep 27 2017

What do you think of Apache Spark?

Valuable Features ETL and streaming capabilities. • Improvements to My Organization Made Big Data processing more convenient and a uniform framework adds to efficiency of usage since the same framework can be used for batch and stream processing. • Room for Improvement Stability in terms of API (things were difficult, when transitioning from RDD to DataFrames, then to DataSet). • Use of Solution I have used Spark since its inception in March 2015, from Spark 1.1 onwards. Currently, I use 2.2 extensively. • Stability Issues Yes, occasionally with different APIs. • Scalability Issues No. • Customer Service and Technical Support Since we were using the Open Source version of Apache Spark, without the Databricks support, we never used technical support form...
Consultant
BigData(QA & RnD) with 51-200 employees
Sep 27 2017

What do you think of Hortonworks Data Platform?

Valuable Features Ambari Web UI: user-friendly Views for Hive, Tez, Pig Spark and Ranger • Improvements to My Organization It has helped our organisation cater to clients who are using Big Data for data storage and analysis combined with our security product. • Room for Improvement Deleting any service requires a lot of clean up, unlike Cloudera. • Use of Solution Five years. • Stability Issues Not until now. • Scalability Issues No. • Customer Service and Technical Support Very supportive, prompt responses. • Previous Solutions We didn't use a previous solution. • Initial Setup The Ambari upgrade is not very user-friendly. • Pricing, Setup Cost and Licensing Not applicable. • Other Solutions Considered Cloudera and MapR. • Other...
Consultant
Manager | Data Science Enthusiast | Management Consultant at a consultancy with 5,001-10,000 employees
Dec 10 2017

What do you think of Apache Spark?

Improvements to My Organization Organisations can now harness richer data sets and benefit from use cases, which add value to their business functions. • Valuable Features Distributed in memory processing. Some of the algorithms are resource heavy and executing this requires a lot of RAM and CPU. With Hadoop-related technologies, we can distribute the workload with multiple commodity hardware. • Room for Improvement Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing. • Use of Solution Three to five years. • Stability Issues At times when users do not know how to use Spark and request a lot of resources, then the underlying JVMs can crash, which is a big sense of worry.  • Scalability Issues No...
Consultant
Big Data and Cloud Solution Consultant at a financial services firm with 10,001+ employees
Oct 02 2017

What do you think of Apache Spark?

Valuable Features DataFrame: Spark SQL gives the leverage to create applications more easily and with less coding effort. • Improvements to My Organization We developed a tool for data ingestion from HDFS->Raw->L1 layer with data quality checks, putting data to elastic search, performing CDC. • Room for Improvement Dynamic DataFrame options are not yet available. • Use of Solution One and a half years. • Stability Issues No. • Scalability Issues No. • Other Advice Spark gives the flexibility for developing custom applications.
Find out what your peers are saying about Apache, Cloudera, Hortonworks and others in Hadoop.
279,296 professionals have used our research since 2012.
191
Associate Consultant
UC Santa Cruz Banana Slug alumni currently working as a Big Data Administrator for Saama Technologies at our client site CSAA Insurance Group in Phoenix, AZ. Providing support for various big data infrastructure needs, including Hadoop and Splunk. Big Data and Analytics is a field I have enjoyed... more>>
Reviewed Hortonworks Data Platform: The Ambari UI is valuable for cluster monitoring,...
689
ICT Consultant (Advanced Infrastructure)
I m looking for real big challenges in Big Data and Networking!
Reviewed Hortonworks Data Platform: The Ambari server provides the user an easy way to...
210
Big Data Consultant
I am a data aficionado with an interest for tech that challenges the status quo.
Reviewed Hortonworks Data Platform: It allows us to provide our customers with data...
116
Lead IT Consultant
I have around 7 years of experience in architecting, designing and developing enterprise applications using Core Java, J2EE and BIG DATA technologies. Followed Agile methodologies while delivering the projects and was responsible for Sprint Planning, Demo, and Capacity Planning. Used cutting... more>>
231
Senior Information Management Consultant
For the past nine years I have been working as an Information Management Consultant. I am both a certified IBM DataStage Professional and a certified IBM Netezza Professional. I have worked on many DataStage projects, along with projects in other platforms such as ODI, SSIS, and PL/SQL. I have... more>>

Sign Up with Email