Apache Hadoop vs BigQuery comparison

Cancel
You must select at least 2 products to compare!
Apache Logo
2,387 views|2,021 comparisons
87% willing to recommend
Google Logo
3,751 views|2,734 comparisons
100% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Apache Hadoop and BigQuery based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Apache Hadoop vs. BigQuery Report (Updated: May 2024).
772,679 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Apache Hadoop is crucial in projects that save and retrieve data daily. Its valuable features are scalability and stability. It is easy to integrate with the existing infrastructure.""Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing.""Hadoop is extensible — it's elastic.""High throughput and low latency. We start with data mashing on Hive and finally use this for KPI visualization.""Since both Apache Hadoop and Amazon EC2 are elastic in nature, we can scale and expand on demand for a specific PoC, and scale down when it's done.""The most valuable features are powerful tools for ingestion, as data is in multiple systems.""The ability to add multiple nodes without any restriction is the solution's most valuable aspect.""It's open-source, so it's very cost-effective."

More Apache Hadoop Pros →

"BigQuery is a powerful tool for managing and analyzing large datasets. The versatility of BigQuery extends to its compatibility with external data visualization tools like Power BI and Tableau. This means you not only get query results but can also seamlessly integrate and visualize your data for better insights.""It has a proprietary way of storing and accessing data in its own data store and is 100% managed without you needing to install anything. There is no need to arrange for any infrastructure to be able to use this solution.""The product is serverless. We only need to write SQL queries to analyze the data. We need to pay based on the number of queries. The retrieval time is very less. Even if you write large queries, the tool is able to bring back data in a few seconds.""The initial setup process is easy.""We basically used it to store server data and generate reports for enterprise architects. It was a valuable tool for our enterprise design architect.""The most valuable features of BigQuery is that it supports standard SQL and provides good performance.""The query tool is scalable and allows for petabytes of data.""It's similar to a Hadoop cluster, except it's managed by Google."

More BigQuery Pros →

Cons
"There is a lack of virtualization and presentation layers, so you can't take it and implement it like a radio solution.""We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it.""The upgrade path should be improved because it is not as easy as it should be.""I think more of the solution needs to be focused around the panel processing and retrieval of data.""The key shortcoming is its inability to handle queries when there is insufficient memory. This limitation can be bypassed by processing the data in chunks.""It could be more user-friendly.""The solution is very expensive.""Real-time data processing is weak. This solution is very difficult to run and implement."

More Apache Hadoop Cons →

"The main challenges are in the areas of performance and cost optimizations.""We'd like to have more integrations with other technologies.""They could enhance the platform's user accessibility.""The solution hinges on Google patterns so continued improvement is important.""The primary hurdle in this migration lies in the initial phase of moving substantial volumes of data to cloud-based platforms.""It would be helpful if they could provide some dashboards where you can easily view charts and information.""The initial setup could be improved making it easier to deploy.""As a product, BigQuery still requires a lot of maturity to accommodate other use cases and to be widely acceptable across other organizations."

More BigQuery Cons →

Pricing and Cost Advice
  • "Do take into consider that data storage and compute capacity scale differently and hence purchasing a "boxed" / 'all-in-one" solution (software and hardware) might not be the best idea."
  • "​There are no licensing costs involved, hence money is saved on the software infrastructure​."
  • "This is a low cost and powerful solution."
  • "The price of Apache Hadoop could be less expensive."
  • "If my company can use the cloud version of Apache Hadoop, particularly the cloud storage feature, it would be easier and would cost less because an on-premises deployment has a higher cost during storage, for example, though I don't know exactly how much Apache Hadoop costs."
  • "We don't directly pay for it. Our clients pay for it, and they usually don't complain about the price. So, it is probably acceptable."
  • "The price could be better. Hortonworks no longer exists, and Cloudera killed the free version of Hadoop."
  • "We just use the free version."
  • More Apache Hadoop Pricing and Cost Advice →

  • "I have tried my own setup using my Gmail ID, and I think it had a $300 limit for free for a new user. That's what Google is offering, and we can register and create a project."
  • "BigQuery is inexpensive."
  • "One terabyte of data costs $20 to $22 per month for storage on BigQuery and $25 on Snowflake. Snowflake is costlier for one terabyte, but BigQuery charges based on how much data is inserted into the tables. BigQuery charges you based on the amount of data that you handle and not the time in which you handle it. This is why the pricing models are different and it becomes a key consideration in the decision of which platform to use."
  • "The price is a bit high but the technology is worth it."
  • "The price could be better. Usually, you need to buy the license for a year. Whenever you want more, you can subscribe to it, and you can use it. Otherwise, you can terminate the license. You can use it daily or monthly, and we use it based on a project's requirements."
  • "The solution is pretty affordable and quite cheap in comparison to PDP or Cloudera."
  • "BigQuery pricing can increase quickly. It's a high-priced solution."
  • "The pricing is good and there are no additional costs involved."
  • More BigQuery Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
    772,679 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.
    Top Answer:Since it is an open-source product, there won't be much support. So, you have to have deeper knowledge. You need to improvise based on that.
    Top Answer:The initial setup process is easy.
    Top Answer:They could enhance the platform's user accessibility. Currently, the structure of BigQuery leans more towards catering to hard-code developers, making it less user-friendly for data analysts or… more »
    Ranking
    6th
    out of 35 in Data Warehouse
    Views
    2,387
    Comparisons
    2,021
    Reviews
    13
    Average Words per Review
    530
    Rating
    7.8
    5th
    Views
    3,751
    Comparisons
    2,734
    Reviews
    29
    Average Words per Review
    485
    Rating
    8.1
    Comparisons
    Learn More
    Overview
    The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

    BigQuery is an enterprise data warehouse that solves this problem by enabling super-fast SQL queries using the processing power of Google's infrastructure. ... You can control access to both the project and your data based on your business needs, such as giving others the ability to view or query your data.

    Sample Customers
    Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
    Information Not Available
    Top Industries
    REVIEWERS
    Financial Services Firm35%
    Comms Service Provider24%
    Hospitality Company6%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm29%
    Computer Software Company11%
    University6%
    Manufacturing Company5%
    REVIEWERS
    Financial Services Firm11%
    Comms Service Provider11%
    Computer Software Company11%
    Wellness & Fitness Company6%
    VISITORS READING REVIEWS
    Computer Software Company17%
    Financial Services Firm13%
    Manufacturing Company11%
    Retailer7%
    Company Size
    REVIEWERS
    Small Business33%
    Midsize Enterprise19%
    Large Enterprise47%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    REVIEWERS
    Small Business31%
    Midsize Enterprise21%
    Large Enterprise48%
    VISITORS READING REVIEWS
    Small Business21%
    Midsize Enterprise13%
    Large Enterprise67%
    Buyer's Guide
    Apache Hadoop vs. BigQuery
    May 2024
    Find out what your peers are saying about Apache Hadoop vs. BigQuery and other solutions. Updated: May 2024.
    772,679 professionals have used our research since 2012.

    Apache Hadoop is ranked 6th in Data Warehouse with 34 reviews while BigQuery is ranked 5th in Cloud Data Warehouse with 31 reviews. Apache Hadoop is rated 7.8, while BigQuery is rated 8.2. The top reviewer of Apache Hadoop writes "Handles huge data volumes and create your own workflows and tables but you need to have deeper knowledge". On the other hand, the top reviewer of BigQuery writes "Expandable and easy to set up but needs more local data residency". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and VMware Tanzu Data Services, whereas BigQuery is most compared with Snowflake, Teradata, Oracle Autonomous Data Warehouse, Vertica and AWS Lake Formation. See our Apache Hadoop vs. BigQuery report.

    See our list of best Cloud Data Warehouse vendors.

    We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.