Compare Apache Hadoop vs. Pivotal Greenplum

Apache Hadoop is ranked 4th in Data Warehouse with 7 reviews while Pivotal Greenplum is ranked 6th in Data Warehouse with 5 reviews. Apache Hadoop is rated 7.6, while Pivotal Greenplum is rated 7.8. The top reviewer of Apache Hadoop writes "We are able to ingest huge volumes/varieties of data, but it needs a data visualization tool and enhanced Ambari for management". On the other hand, the top reviewer of Pivotal Greenplum writes "Handles complex queries and report production efficiently, integrates with Hadoop". Apache Hadoop is most compared with Snowflake, Pivotal Greenplum and Oracle Exadata, whereas Pivotal Greenplum is most compared with Apache Hadoop, Amazon Redshift and Teradata. See our Apache Hadoop vs. Pivotal Greenplum report.
Cancel
You must select at least 2 products to compare!
Apache Hadoop Logo
11,979 views|10,438 comparisons
Pivotal Greenplum Logo
11,662 views|8,217 comparisons
Most Helpful Review
Find out what your peers are saying about Apache Hadoop vs. Pivotal Greenplum and other solutions. Updated: September 2019.
371,639 professionals have used our research since 2012.
Quotes From Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros
What comes with the standard setup is what we mostly use, but Ambari is the most important.The best thing about this solution is that it is very powerful and very cheap.The most valuable features are the ability to process the machine data at a high speed, and to add structure to our data so that we can generate relevant analytics.Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing.Initially, with RDBMS alone, we had a lot of work and few servers running on-premise and on cloud for the PoC and incubation. With the use of Hadoop and ecosystem components and tools, and managing it in Amazon EC2, we have created a Big Data "lab" which helps us to centralize all our work and solutions into a single repository. This has cut down the time in terms of maintenance, development and, especially, data processing challenges.Since both Apache Hadoop and Amazon EC2 are elastic in nature, we can scale and expand on demand for a specific PoC, and scale down when it's done.Most valuable features are HDFS and Kafka: Ingestion of huge volumes and variety of unstructured/semi-structured data is feasible, and it helps us to quickly onboard a new Big Data analytics prospect.High throughput and low latency. We start with data mashing on Hive and finally use this for KPI visualization.

Read more »

The most valuable feature for us is horizontal scaling.Scalability is simple because it's an MPP database. If you need more processing power or you need more storage, you just add a few more nodes in the cluster. It works on common commodity hardware. You can use any type of server. You don't need to have proprietary hardware. It's fairly flexible.We chose Greenplum because of the architecture in terms of clustering databases and being able to have, or at least utilize the resources that are sitting on a database.It's one of the fastest databases in the market. It's easy to use. From a maintenance perspective it's a good product. The segmentation, or architecture of the product is different than other databases such as Oracle. So even in 10 years, the data distribution for such segments will not affect other segments. The query performance of the product, for complex queries, is very good. It has good integration with Hadoop.Very fast for query processing.

Read more »

Cons
In the next release, I would like to see Hive more responsive for smaller queries and to reduce the latency.The upgrade path should be improved because it is not as easy as it should be.We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it.I would like to see more direct integration of visualization applications.Based on our needs, we would like to see a tool for data visualization and enhanced Ambari for management, plus a pre-built IoT hub/model. These would reduce our efforts and the time needed to prove to a customer that this will help them.General installation/dependency issues were there, but were not a major, complex issue. While migrating data from MySQL to Hive, things are a little challenging, but we were able to get through that with support from forums and a little trial and error.It needs better user interface (UI) functionalities.

Read more »

I saw some limitation with respect to the column store, and removing this would be an improvement.Some integration with other platforms like design tools, and ETL development tools, that will enable some advanced functionality, like fully down processing, etc.The installation is difficult and should be made easier.Implementation takes a long time.One of the disadvantages, not a disadvantage with the product itself, but overall, is the expertise in the marketplace. It's not easy to find a Greenplum administrator in the market, compared to other products such as Oracle.they need to interact more with customers. They need to explain the features, especially when there are new releases of Greenplum. I know just from information I've found that it has other features, it can be used to for analytics, for integration with Big Data, Hadoop. They need to focus on this part with the customer.They need to enhance integration with other Big Data products... to integrate with Big Data platforms, and to open a bi-directional connection between Greenplum and Big Data.It will be very useful if we could communicate with other database types from Greenplum (using a database link).

Read more »

Pricing and Cost Advice
This is a low cost and powerful solution.​There are no licensing costs involved, hence money is saved on the software infrastructure​.

Read more »

We are using the open-source version of this solution.Pricing is good compared to other products. It's fine.It is the best product with best fit for price/performance customer objectives.

Read more »

report
Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
371,639 professionals have used our research since 2012.
Ranking
4th
out of 30 in Data Warehouse
Views
11,979
Comparisons
10,438
Reviews
7
Average Words per Review
478
Avg. Rating
7.6
6th
out of 30 in Data Warehouse
Views
11,662
Comparisons
8,217
Reviews
5
Average Words per Review
447
Avg. Rating
7.8
Top Comparisons
Compared 31% of the time.
Compared 30% of the time.
Compared 13% of the time.
Compared 43% of the time.
Compared 11% of the time.
Compared 11% of the time.
Also Known As
Greenplum
Learn
Apache
Pivotal
Overview
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

Advanced analytics meets traditional business intelligence with Pivotal Greenplum, the world’s first fully-featured, multi-cloud, massively parallel processing (MPP) data platform based on the open source Greenplum Database. Pivotal Greenplum provides comprehensive and integrated analytics on multi-structured data. Powered by one of the world’s most advanced cost-based query optimizers, Pivotal Greenplum delivers unmatched analytical query performance on massive volumes of data.

Offer
Learn more about Apache Hadoop
Learn more about Pivotal Greenplum
Sample Customers
Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web LabGeneral Electric, Conversant, China CITIC Bank, Aridhia, Purdue University
Top Industries
VISITORS READING REVIEWS
Software R&D Company29%
Financial Services Firm20%
Comms Service Provider12%
Government7%
REVIEWERS
Financial Services Firm44%
Marketing Services Firm19%
Comms Service Provider19%
Software R&D Company6%
VISITORS READING REVIEWS
Software R&D Company25%
Financial Services Firm20%
Comms Service Provider12%
Insurance Company6%
Find out what your peers are saying about Apache Hadoop vs. Pivotal Greenplum and other solutions. Updated: September 2019.
371,639 professionals have used our research since 2012.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.
Sign Up with Email