We performed a comparison between Apache Hadoop and Snowflake based on real PeerSpot user reviews.
Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Its integration is Hadoop's best feature because that allows us to support different tools in a big data platform."
"Hadoop is designed to be scalable, so I don't think that it has limitations in regards to scalability."
"One valuable feature is that we can download data."
"What I like about Apache Hadoop is that it's for big data, in particular big data analysis, and it's the easier solution. I like the data processing feature for AI/ML use cases the most because some solutions allow me to collect data from relational databases, while Hadoop provides me with more options for newer technologies."
"The best thing about this solution is that it is very powerful and very cheap."
"The scalability of Apache Hadoop is very good."
"It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming."
"The most valuable feature is scalability and the possibility to work with major information and open source capability."
"I like Snowflake's data exchange capabilities. It can exchange data with downstream systems and other vendor partners as well."
"Scaling is a big plus point of Snowflake."
"The tool is very easy to use. The solution’s desktop features are also very easy to use. Also, the product’s SQL-based connectivity is also good. It can connect with any tool."
"It helped us to build MVP (minimum viable product) for our idea of building a data warehouse model for small businesses."
"This is the advanced version of the cloud version, so it's really a flexible tool. If you have it implemented at home, you can access it from anywhere."
"Its performance is most valuable. As compared to SQL Server, we are able to see a significant improvement in performance with Snowflake."
"My company wanted to have all our data in one single place and this what we use Snowflake for. Snowflake also allows us to build connectors to different data sources."
"The Mbps they have established is quite a bit faster than any other data warehouse."
"The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support."
"The upgrade path should be improved because it is not as easy as it should be."
"Based on our needs, we would like to see a tool for data visualization and enhanced Ambari for management, plus a pre-built IoT hub/model. These would reduce our efforts and the time needed to prove to a customer that this will help them."
"The solution could use a better user interface. It needs a more effective GUI in order to create a better user environment."
"From the Apache perspective or the open-source community, they need to add more capabilities to make life easier from a configuration and deployment perspective."
"The solution needs a better tutorial. There are only documents available currently. There's a lot of YouTube videos available. However, in terms of learning, we didn't have great success trying to learn that way. There needs to be better self-paced learning."
"I mentioned it definitely, and this is probably the only feature we can improve a little bit because the terminal and coding screen on Hadoop is a little outdated, and it looks like the old C++ bio screen. If the UI and UX can be improved slightly, I believe it will go a long way toward increasing adoption and effectiveness."
"The stability of the solution needs improvement."
"It would be better if they had a data profile tool that tells me where the gaps are in my time series data."
"I think that Snowflake could improve its user interface. The current one is not interactive."
"Its pricing or affordability is one of the big challenges. Pricing was the only thing that we didn't like about Snowflake. In terms of technical features, it is a complete solution."
"Its stability could be better."
"I see room for improvement when it comes to credit performance. The other thing I'd like to be improved is the warehouse facility."
"Currently, Snowflake doesn't support unstructured data."
"The user interface continues to be an issue, especially when we need to get data out of Snowflake. It's very easy to get data in, but it's not too easy to get it out or extract it."
"I am still in the learning stage. It has good security, but it can always be more secure."
Apache Hadoop is ranked 5th in Data Warehouse with 33 reviews while Snowflake is ranked 1st in Data Warehouse with 92 reviews. Apache Hadoop is rated 7.8, while Snowflake is rated 8.4. The top reviewer of Apache Hadoop writes "Handles huge data volumes and create your own workflows and tables but you need to have deeper knowledge". On the other hand, the top reviewer of Snowflake writes "Good usability, good data sharing and elastic compute features, and requires less DBA involvement". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Teradata and BigQuery, whereas Snowflake is most compared with BigQuery, Azure Data Factory, Teradata, Vertica and Teradata Cloud Data Warehouse. See our Apache Hadoop vs. Snowflake report.
See our list of best Data Warehouse vendors and best Cloud Data Warehouse vendors.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.
Apache Hadoop is for data lake use cases. But getting data out of Hadoop for meaningful analytics is indeed need quite an amount of work. by either using spark/Hive/presto and so on. The way i look at Snowflake and Hadoop is they complement each other. For data lake you can use hadoop and then for datawarehouse companies can use snowflake. Depending on the size of the company you can turn snowflake into a data lake use case too. Snowflake is SQL friendly and you don't need to carry out any circus to get the data in and out of snowflake.