Compare Apache Hadoop vs. IBM Db2 Warehouse

Cancel
You must select at least 2 products to compare!
Most Helpful Review
Find out what your peers are saying about Apache Hadoop vs. IBM Db2 Warehouse and other solutions. Updated: July 2021.
523,535 professionals have used our research since 2012.
Quotes From Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros
"What comes with the standard setup is what we mostly use, but Ambari is the most important.""The ability to add multiple nodes without any restriction is the solution's most valuable aspect.""It's good for storing historical data and handling analytics on a huge amount of data.""The most valuable feature is the database.""The most valuable features are powerful tools for ingestion, as data is in multiple systems.""The solution is easy to expand. We haven't seen any issues with it in that sense. We've added 10 servers, and we've added two nodes. We've been expanding since we started using it since we started out so small. Companies that need to scale shouldn't have a problem doing so.""The performance is pretty good.""Hadoop is designed to be scalable, so I don't think that it has limitations in regards to scalability."

More Apache Hadoop Pros »

"Some of the best features are stored procedures, parallelism, and different indexing strategies.""I think it scales really well and as long as you take enough time to learn a little bit about it, it works really well."

More IBM Db2 Warehouse Pros »

Cons
"In the next release, I would like to see Hive more responsive for smaller queries and to reduce the latency.""There is a lack of virtualization and presentation layers, so you can't take it and implement it like a radio solution.""The solution could use a better user interface. It needs a more effective GUI in order to create a better user environment.""It would be good to have more advanced analytics tools.""It would be helpful to have more information on how to best apply this solution to smaller organizations, with less data, and grow the data lake.""The solution needs a better tutorial. There are only documents available currently. There's a lot of YouTube videos available. However, in terms of learning, we didn't have great success trying to learn that way. There needs to be better self-paced learning.""The solution is very expensive.""From the Apache perspective or the open-source community, they need to add more capabilities to make life easier from a configuration and deployment perspective."

More Apache Hadoop Cons »

"The areas of the solution that is needing the most improvement are separating compute from storage, elasticity, which means scaling up and then retracting.""The biggest problems we have is when the backup solution is failing or slow and we run out of log space, which has happened probably a couple of times in the last four years."

More IBM Db2 Warehouse Cons »

Pricing and Cost Advice
Information Not Available
"In a traditional on-prem database, in a data warehouse, the solution is probably on the expensive side."

More IBM Db2 Warehouse Pricing and Cost Advice »

report
Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
523,535 professionals have used our research since 2012.
Questions from the Community
Top Answer: Hadoop is designed to be scalable, so I don't think that it has limitations in regards to scalability.
Top Answer: I don't have any concerns because each part of Hadoop has its use cases. To date, I haven't implemented a huge product or project using Hadoop, but on the level of POCs, it's fine. The community of… more »
Top Answer: Some of the best features are stored procedures, parallelism, and different indexing strategies.
Top Answer: In a traditional on-prem database, in a data warehouse, the solution is probably on the expensive side.
Top Answer: The areas of the solution that is needing the most improvement are separating compute from storage, elasticity, which means scaling up and then retracting.
Ranking
7th
out of 30 in Data Warehouse
Views
8,838
Comparisons
7,187
Reviews
8
Average Words per Review
429
Rating
7.5
16th
out of 30 in Data Warehouse
Views
335
Comparisons
246
Reviews
2
Average Words per Review
468
Rating
8.0
Popular Comparisons
Also Known As
InfoSphere Warehouse, IBM InfoSphere Warehouse
Learn More
Overview
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.
InfoSphere Warehouse is a suite of products that combines the strength of DB2 with a data warehousing infrastructure from IBM. You can use InfoSphere Warehouse to build a complete data warehousing solution that includes a highly scalable relational database, data access capabilities, and front-end analysis tools.
Offer
Learn more about Apache Hadoop
Learn more about IBM Db2 Warehouse
Sample Customers
Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
Alameda County Social Services Agency, Sui Southern Gas Company Limited
Top Industries
VISITORS READING REVIEWS
Computer Software Company31%
Comms Service Provider17%
Financial Services Firm13%
Energy/Utilities Company5%
VISITORS READING REVIEWS
Computer Software Company23%
Comms Service Provider23%
Financial Services Firm13%
Energy/Utilities Company11%
Company Size
REVIEWERS
Small Business37%
Midsize Enterprise21%
Large Enterprise42%
No Data Available
Find out what your peers are saying about Apache Hadoop vs. IBM Db2 Warehouse and other solutions. Updated: July 2021.
523,535 professionals have used our research since 2012.

Apache Hadoop is ranked 7th in Data Warehouse with 8 reviews while IBM Db2 Warehouse is ranked 16th in Data Warehouse with 2 reviews. Apache Hadoop is rated 7.6, while IBM Db2 Warehouse is rated 8.0. The top reviewer of Apache Hadoop writes "Great micro-partitions, helpful technical support and quite stable". On the other hand, the top reviewer of IBM Db2 Warehouse writes "If you have good people designing how the data is stored, this is a marvelous tool". Apache Hadoop is most compared with Snowflake, Microsoft Azure Synapse Analytics, VMware Tanzu Greenplum and Oracle Exadata, whereas IBM Db2 Warehouse is most compared with Oracle Exadata, Microsoft Azure Synapse Analytics, Oracle Autonomous Data Warehouse, Teradata and Microsoft Parallel Data Warehouse. See our Apache Hadoop vs. IBM Db2 Warehouse report.

See our list of best Data Warehouse vendors and best Cloud Data Warehouse vendors.

We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.