We performed a comparison between Apache Hadoop and Snowflake based on real PeerSpot user reviews.
Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Its integration is Hadoop's best feature because that allows us to support different tools in a big data platform."
"The ability to add multiple nodes without any restriction is the solution's most valuable aspect."
"The scalability of Apache Hadoop is very good."
"The most valuable features are powerful tools for ingestion, as data is in multiple systems."
"Hadoop is extensible — it's elastic."
"Apache Hadoop can manage large amounts and volumes of data with relative ease, which is a feature that is beneficial."
"Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing."
"What comes with the standard setup is what we mostly use, but Ambari is the most important."
"Snowflake is an enormously useful platform. The Snowpipe feature is valuable because it allows us to load terabytes and petabytes of data into the data mart at a very low cost."
"The most valuable feature is the clone copy."
"Time travel is one feature that really helps us out."
"The most valuable feature of Snowflake is its performance. We can access the data quickly. Additionally, it handles structured and non-structured data."
"I like Snowflake's data exchange capabilities. It can exchange data with downstream systems and other vendor partners as well."
"It is a cloud solution with many useful features. It has the data science capability. It can transform data and prepare data for a data science project with scalability."
"The most valuable features of Snowflake are its performance and power."
"Snowflake is faster than on-premise systems and allows for variable compute power based on need."
"The solution is very expensive."
"We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it."
"The stability of the solution needs improvement."
"It requires a great deal of learning curve to understand. The overall Hadoop ecosystem has a large number of sub-products. There is ZooKeeper, and there are a whole lot of other things that are connected. In many cases, their functionalities are overlapping, and for a newcomer or our clients, it is very difficult to decide which of them to buy and which of them they don't really need. They require a consulting organization for it, which is good for organizations such as ours because that's what we do, but it is not easy for the end customers to gain so much knowledge and optimally use it."
"It would be good to have more advanced analytics tools."
"It could be more user-friendly."
"I would like to see more direct integration of visualization applications."
"Hadoop's security could be better."
"They have a new console, but I couldn't figure out anything in the new console. So, if I shift to the old console, I can figure out where to create the database schema and other things, but I have no idea where to go in the new console. That's one thing they can improve. I don't know why they created a new console to confuse. The old, classic console is much better."
"Maybe there could be some more connectors to other systems, but this is what they are constantly developing anyway."
"There are always a few operation updates here and there that can be made."
"The solution could improve by allowing non-structured data, such as PDFs, images, or videos. We cannot see the data."
"The user interface continues to be an issue, especially when we need to get data out of Snowflake. It's very easy to get data in, but it's not too easy to get it out or extract it."
"The cost of the solution could be reduced."
"Sometimes it can be tricky to manage multiple environments if you're purely using Snowflake as your scripting and pipeline environment."
"The solution needs more connectors."
Apache Hadoop is ranked 5th in Data Warehouse with 32 reviews while Snowflake is ranked 1st in Data Warehouse with 92 reviews. Apache Hadoop is rated 7.8, while Snowflake is rated 8.4. The top reviewer of Apache Hadoop writes "A file system for data collection that contains needed information and files". On the other hand, the top reviewer of Snowflake writes "Good usability, good data sharing and elastic compute features, and requires less DBA involvement". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Teradata and BigQuery, whereas Snowflake is most compared with BigQuery, Azure Data Factory, Teradata, Vertica and Teradata Cloud Data Warehouse. See our Apache Hadoop vs. Snowflake report.
See our list of best Data Warehouse vendors and best Cloud Data Warehouse vendors.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.
Apache Hadoop is for data lake use cases. But getting data out of Hadoop for meaningful analytics is indeed need quite an amount of work. by either using spark/Hive/presto and so on. The way i look at Snowflake and Hadoop is they complement each other. For data lake you can use hadoop and then for datawarehouse companies can use snowflake. Depending on the size of the company you can turn snowflake into a data lake use case too. Snowflake is SQL friendly and you don't need to carry out any circus to get the data in and out of snowflake.