We performed a comparison between Apache Hadoop and IBM Db2 Warehouse based on real PeerSpot user reviews.
Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."The solution is easy to expand. We haven't seen any issues with it in that sense. We've added 10 servers, and we've added two nodes. We've been expanding since we started using it since we started out so small. Companies that need to scale shouldn't have a problem doing so."
"I liked that Apache Hadoop was powerful, had a lot of tools, and the fact that it was free and community-developed."
"It's good for storing historical data and handling analytics on a huge amount of data."
"Most valuable features are HDFS and Kafka: Ingestion of huge volumes and variety of unstructured/semi-structured data is feasible, and it helps us to quickly onboard a new Big Data analytics prospect."
"Initially, with RDBMS alone, we had a lot of work and few servers running on-premise and on cloud for the PoC and incubation. With the use of Hadoop and ecosystem components and tools, and managing it in Amazon EC2, we have created a Big Data "lab" which helps us to centralize all our work and solutions into a single repository. This has cut down the time in terms of maintenance, development and, especially, data processing challenges."
"Hadoop is extensible — it's elastic."
"It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database."
"The most valuable feature is the database."
"Some of the best features are stored procedures, parallelism, and different indexing strategies."
"Provides good security and reliability."
"The standout feature of IBM Db2 Warehouse, which is particularly valuable for large enterprises, is its ability to handle big data."
"It can be mounted on the cloud, which is a huge plus. If the client, for example, starts small with on-premise deployment and then it rapidly needs to grow, we can transfer this to the cloud easily."
"The analytics engine is not bad at forecasting predictions."
"I think it scales really well and as long as you take enough time to learn a little bit about it, it works really well."
"In certain cases, the configurations for dealing with data skewness do not make any sense."
"The key shortcoming is its inability to handle queries when there is insufficient memory. This limitation can be bypassed by processing the data in chunks."
"It requires a great deal of learning curve to understand. The overall Hadoop ecosystem has a large number of sub-products. There is ZooKeeper, and there are a whole lot of other things that are connected. In many cases, their functionalities are overlapping, and for a newcomer or our clients, it is very difficult to decide which of them to buy and which of them they don't really need. They require a consulting organization for it, which is good for organizations such as ours because that's what we do, but it is not easy for the end customers to gain so much knowledge and optimally use it."
"I would like to see more direct integration of visualization applications."
"It would be helpful to have more information on how to best apply this solution to smaller organizations, with less data, and grow the data lake."
"Hadoop's security could be better."
"The solution could use a better user interface. It needs a more effective GUI in order to create a better user environment."
"The stability of the solution needs improvement."
"The areas of the solution that is needing the most improvement are separating compute from storage, elasticity, which means scaling up and then retracting."
"In terms of improvement, IBM Db2 Warehouse should be more scalable."
"Lacks sufficient documentation and particularly in Spanish."
"The biggest challenge anyone could have with Db2 Warehouse is their references or online resources and documentation. They are very, very, very limited on the web."
"There should be more material available for training and training should be free."
"IBM Db2 Warehouse needs to improve its interface."
"The biggest problems we have is when the backup solution is failing or slow and we run out of log space, which has happened probably a couple of times in the last four years."
Apache Hadoop is ranked 5th in Data Warehouse with 32 reviews while IBM Db2 Warehouse is ranked 14th in Data Warehouse with 8 reviews. Apache Hadoop is rated 7.8, while IBM Db2 Warehouse is rated 7.6. The top reviewer of Apache Hadoop writes "A file system for data collection that contains needed information and files". On the other hand, the top reviewer of IBM Db2 Warehouse writes "Useful for ETL process and has good documentation ". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and Dremio, whereas IBM Db2 Warehouse is most compared with Oracle Exadata, Snowflake, Amazon Redshift, IBM Db2 Warehouse on Cloud and BigQuery. See our Apache Hadoop vs. IBM Db2 Warehouse report.
See our list of best Data Warehouse vendors.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.