We performed a comparison between Apache Hadoop and IBM Netezza Performance Server based on real PeerSpot user reviews.
Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Initially, with RDBMS alone, we had a lot of work and few servers running on-premise and on cloud for the PoC and incubation. With the use of Hadoop and ecosystem components and tools, and managing it in Amazon EC2, we have created a Big Data "lab" which helps us to centralize all our work and solutions into a single repository. This has cut down the time in terms of maintenance, development and, especially, data processing challenges."
"Hadoop File System is compatible with almost all the query engines."
"The tool's stability is good."
"It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database."
"The most valuable features are powerful tools for ingestion, as data is in multiple systems."
"What I like about Apache Hadoop is that it's for big data, in particular big data analysis, and it's the easier solution. I like the data processing feature for AI/ML use cases the most because some solutions allow me to collect data from relational databases, while Hadoop provides me with more options for newer technologies."
"Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing."
"The scalability of Apache Hadoop is very good."
"The data governance prospect... from what I've seen, that is a really powerful tool as well, to help with data lineage and keeping track of that."
"We are able to execute very complex queries. Over 90 percent of our query executions are one second or less. We do millions of queries everyday."
"The most valuable features of the IBM Netezza Performance Server are the NPS server because of the reduced maintenance and overall good performance."
"The underlying hardware that IBM provides with this appliance is made for a specific purpose, to serve performance on a large amount of data, and to do analytics as well. It is faster, when you compare it to any other product."
"The benefit is really because of the additional speed that we have and, truth be told, the more updated ETL processes and the revamped scheduler in general."
"Distribution concurrency control."
"The most valuable feature would be the fact that it has been running for awhile in an appliance format."
"IBM Netezza Performance Server is a cost-effective solution."
"In certain cases, the configurations for dealing with data skewness do not make any sense."
"It needs better user interface (UI) functionalities."
"Based on our needs, we would like to see a tool for data visualization and enhanced Ambari for management, plus a pre-built IoT hub/model. These would reduce our efforts and the time needed to prove to a customer that this will help them."
"In the next release, I would like to see Hive more responsive for smaller queries and to reduce the latency."
"It requires a great deal of learning curve to understand. The overall Hadoop ecosystem has a large number of sub-products. There is ZooKeeper, and there are a whole lot of other things that are connected. In many cases, their functionalities are overlapping, and for a newcomer or our clients, it is very difficult to decide which of them to buy and which of them they don't really need. They require a consulting organization for it, which is good for organizations such as ours because that's what we do, but it is not easy for the end customers to gain so much knowledge and optimally use it."
"The solution is not easy to use. The solution should be easy to use and suitable for almost any case connected with the use of big data or working with large amounts of data."
"There is a lack of virtualization and presentation layers, so you can't take it and implement it like a radio solution."
"The price could be better. I think we would use it more, but the company didn't want to pay for it. Hortonworks doesn't exist anymore, and Cloudera killed the free version of Hadoop."
"Concurrency limit needs to be increased somewhat."
"The scalability is not as expected. The capacity in the black box is not enough."
"Our main problem with it is concurrency. When there are too many users running Netezza at the same time, this is when we have the most complaints."
"Oracle Exadata's security features, like TDE encryption, are missing in IBM Netezza Performance Server."
"In terms of features that I would like to see, one is the ability to actually scale out an architecture. Right now, if you buy one, it's fixed. There is no scale-up availability at all."
"We are not able to scale. The only way to scale is to get another appliance, but we have a customers who would need us to hydrate the data between the two appliances, and Netezza does not do that."
"LIke Teradata, we can’t add a node/SPU to the existing appliance."
"IBM Netezza Performance Server could improve its interface, support for big data, and APA-based connectivity should be available."
More IBM Netezza Performance Server Pricing and Cost Advice →
Apache Hadoop is ranked 5th in Data Warehouse with 32 reviews while IBM Netezza Performance Server is ranked 10th in Data Warehouse with 33 reviews. Apache Hadoop is rated 7.8, while IBM Netezza Performance Server is rated 8.0. The top reviewer of Apache Hadoop writes "A file system for data collection that contains needed information and files". On the other hand, the top reviewer of IBM Netezza Performance Server writes "A cost-effective data warehousing tool, but security features like TDE encryption are missing". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata and Snowflake, whereas IBM Netezza Performance Server is most compared with Oracle Exadata, Snowflake, Oracle Database, Teradata and IBM Db2 Warehouse on Cloud. See our Apache Hadoop vs. IBM Netezza Performance Server report.
See our list of best Data Warehouse vendors.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.