We performed a comparison between Amazon Redshift and Apache Hadoop based on real PeerSpot user reviews.
Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."The most valuable features are that it's easy to set up and easy to connect the many tools that connect to it."
"I have primarily used the Redshift Spectrum feature and found it most valuable."
"If the analyst knows SQL, which is comfortable and easy to use to go between all of these tool stacks, I think it's reliable. It's a secure and reliable data warehouse."
"This service can merge and integrate well with all databases."
"Amazon Redshift offers a relatively flexible structure...I rate the technical support a nine out of ten."
"The ability to reload data multiple times at different times."
"The product is relatively easy to use because there is no indexing and no partitions."
"Redshift's Excel features are handy. Redshift spectrum allows you to directly query the data on an Excel sheet. Now, SQL Server also allows this, but Redshift has many more features."
"It's open-source, so it's very cost-effective."
"The most valuable features are powerful tools for ingestion, as data is in multiple systems."
"It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming."
"Apache Hadoop can manage large amounts and volumes of data with relative ease, which is a feature that is beneficial."
"I liked that Apache Hadoop was powerful, had a lot of tools, and the fact that it was free and community-developed."
"Data ingestion: It has rapid speed, if Apache Accumulo is used."
"What comes with the standard setup is what we mostly use, but Ambari is the most important."
"Since both Apache Hadoop and Amazon EC2 are elastic in nature, we can scale and expand on demand for a specific PoC, and scale down when it's done."
"It would be nice if we could turn off an instance. However, it would retain the instance in history, thus allowing us to restart without beginning from scratch."
"It takes a lot of time to ingest and update the data."
"One area where Amazon Redshift could improve is in adopting the compute-separate, data-separate architecture, which Delta, Snowflake are adopting, and a few others in the cloud data warehouse spectrum."
"The customer support could be more responsive."
"There is some missing functionality and sometimes it's so difficult to work in. We need to convert these functionalities using VACUUM inside Amazon Redshift and then it causes some complexity."
"This solution lacks integration with non-AWS sources."
"The initial setup is a complex process, especially for someone who is not familiar with nodes and configuring terms like RPUs."
"It lacks a few features which can be very useful, such as stored procedures"
"The price could be better. I think we would use it more, but the company didn't want to pay for it. Hortonworks doesn't exist anymore, and Cloudera killed the free version of Hadoop."
"The upgrade path should be improved because it is not as easy as it should be."
"It could be more user-friendly."
"Hadoop's security could be better."
"The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support."
"It would be good to have more advanced analytics tools."
"The solution is not easy to use. The solution should be easy to use and suitable for almost any case connected with the use of big data or working with large amounts of data."
"I would like to see more direct integration of visualization applications."
Amazon Redshift is ranked 4th in Cloud Data Warehouse with 59 reviews while Apache Hadoop is ranked 5th in Data Warehouse with 33 reviews. Amazon Redshift is rated 7.8, while Apache Hadoop is rated 7.8. The top reviewer of Amazon Redshift writes "Provides one place where we can store data, and allows us to easily connect to other services with AWS". On the other hand, the top reviewer of Apache Hadoop writes "Handles huge data volumes and create your own workflows and tables but you need to have deeper knowledge". Amazon Redshift is most compared with Snowflake, Teradata, AWS Lake Formation, Vertica and Microsoft Azure Synapse Analytics, whereas Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and Dremio. See our Amazon Redshift vs. Apache Hadoop report.
See our list of best Data Warehouse vendors and best Cloud Data Warehouse vendors.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.