Apache Hadoop vs Teradata comparison

Cancel
You must select at least 2 products to compare!
Apache Logo
2,387 views|2,021 comparisons
87% willing to recommend
Teradata Logo
5,643 views|4,710 comparisons
87% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Apache Hadoop and Teradata based on real PeerSpot user reviews.

Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Apache Hadoop vs. Teradata Report (Updated: May 2024).
772,679 professionals have used our research since 2012.
Q&A Highlights
Question: Which data catalog can provide support for BI data sources such as SAP BO and Tableau?
Answer: Dear Community, Many thanks for yor support and help!
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The most valuable feature is scalability and the possibility to work with major information and open source capability.""One valuable feature is that we can download data.""The most valuable features are the ability to process the machine data at a high speed, and to add structure to our data so that we can generate relevant analytics.""It's open-source, so it's very cost-effective.""The performance is pretty good.""It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.""Its integration is Hadoop's best feature because that allows us to support different tools in a big data platform.""The scalability of Apache Hadoop is very good."

More Apache Hadoop Pros →

"Teradata's capabilities enhance data management efficiency, support scalability, and contribute to faster query performance.""Things have started moving faster in my company, such as data retrieval happens more quickly.​""Auto-partitioning and indexing, and resource allocation on the fly are key features.""Teradata's most valuable feature is that it's easy to use.""I like this solution's ease of design and the fact that its performance is quite good. It is stable as well.""The solution scales well on the cloud.""Teradata solutions help organizations reduce IT, operations, and maintenance costs; enhance on-time delivery of products and services.""We did performance testing. We had a set of real life MicroStrategy reports. Our conditions were: Not allowed to redesign data model, not allowed to rewrite the queries, all queries should be generated by MicroStrategy, no aggregates. Teradata appeared to be way faster than a similarly configured (in terms of hardware) Oracle server."

More Teradata Pros →

Cons
"The solution needs a better tutorial. There are only documents available currently. There's a lot of YouTube videos available. However, in terms of learning, we didn't have great success trying to learn that way. There needs to be better self-paced learning.""The stability of the solution needs improvement.""I would like to see more direct integration of visualization applications.""The solution is very expensive.""We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it.""What could be improved in Apache Hadoop is its user-friendliness. It's not that user-friendly, but maybe it's because I'm new to it. Sometimes it feels so tough to use, but it could be because of two aspects: one is my incompetency, for example, I don't know about all the features of Apache Hadoop, or maybe it's because of the limitations of the platform. For example, my team is maintaining the business glossary in Apache Atlas, but if you want to change any settings at the GUI level, an advanced level of coding or programming needs to be done in the back end, so it's not user-friendly.""The integration with Apache Hadoop with lots of different techniques within your business can be a challenge.""I think more of the solution needs to be focused around the panel processing and retrieval of data."

More Apache Hadoop Cons →

"An additional feature I would you like to see included in the next release, is that it needs to be more cloud-friendly.""It would help to make scaling easier with a reduced cost. ​""Teradata needs to expand the kind of training that's available to customers. Teradata only offers training directly and doesn't delegate to any third-party companies. As a result, it's harder to find people trained on Teradata in our market relative to Oracle.""The solution could improve by having a cloud version or a cloud component. We have to use other solutions, such as Amazon AWS, Microsoft Azure, or Snowflake for the cloud.""I'm not sure about the unstructured data management capabilities. It could be improved.""Data ingestion is done via external utilities and not by the query language itself. It would be more convenient to have that functionality within its SQL dialect.""Teradata's UI could be more user-friendly.""I would like to see more integration with many different types of data."

More Teradata Cons →

Pricing and Cost Advice
  • "Do take into consider that data storage and compute capacity scale differently and hence purchasing a "boxed" / 'all-in-one" solution (software and hardware) might not be the best idea."
  • "​There are no licensing costs involved, hence money is saved on the software infrastructure​."
  • "This is a low cost and powerful solution."
  • "The price of Apache Hadoop could be less expensive."
  • "If my company can use the cloud version of Apache Hadoop, particularly the cloud storage feature, it would be easier and would cost less because an on-premises deployment has a higher cost during storage, for example, though I don't know exactly how much Apache Hadoop costs."
  • "We don't directly pay for it. Our clients pay for it, and they usually don't complain about the price. So, it is probably acceptable."
  • "The price could be better. Hortonworks no longer exists, and Cloudera killed the free version of Hadoop."
  • "We just use the free version."
  • More Apache Hadoop Pricing and Cost Advice →

  • "Teradata is not cheap, but you get what you pay for."
  • "Make sure you have the in-house skills to design and support the solution, as relying on external sources is extremely costly and tends to lock you into specific platforms, tools, and paradigms."
  • "In the past, it turned out that other solutions, in order to provide the full range of abilities that the Teradata platform provides plus the migration costs, would end up costing more than Teradata does."
  • "The initial cost may seem high, but the TCO is low."
  • "Teradata is currently making improvements in this area."
  • "It is still a very expensive solution. While I very much like the pure technological supremacy of the software itself, I believe Teradata as a company needs to become more affordable. They are already losing the market to more flexible or cheaper competitors."
  • "Teradata is expensive but gives value for money, especially if you don't want to move your data to the cloud."
  • "Price is quite high, so if it is really possible to use other solutions (e.g. you do not have strict requirements for performance and huge data volumes), it might be better to look at alternatives from the RDBMS world."
  • More Teradata Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
    772,679 professionals have used our research since 2012.
    Comparison Review
    Anonymous User
    Answers from the Community
    Tomasz Rabong
    InitZero - PeerSpot reviewerInitZero
    Real User

    Hi Tomasz,


    Collibra can scan all these sources. See this link: https://marketplace.collibra.c...


    Also, Erwin Data Intelligence Suite can harvest most (if not all) of these sources:


    https://www.erwin.com/products...

    Leandro Sodré - PeerSpot reviewerLeandro Sodré
    User

    Hi Tomasz Rabong


    I believe that if you have a developer team in Amundsen it would be possible. 


    Alternatively, you can look at Informatica EDC or at Data Virtualization Data Catalog (from Denodo).

    Ritesh Misra - PeerSpot reviewerRitesh Misra
    User

    @Tomasz Rabong, it depends upon the actual requirements of the data catalog. 


    As far as we have experienced SAP BO 4.0 is way ahead in solving architectural, clustering, warehousing and mining complex problems whereas Tableau server 2022.1 is really awesome and has recently included features to solve complex problems. 


    As a team, we prefer SAP BO for billions of data.

    Delmar Assis - PeerSpot reviewerDelmar Assis
    Real User

    Hi @Tomasz Rabong, I hope you're well and safe. 


    Specifically, if you need any help regarding Infogix Data360 Govern, please let me know. 


    Cheers.

    Questions from the Community
    Top Answer:It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.
    Top Answer:Since it is an open-source product, there won't be much support. So, you have to have deeper knowledge. You need to improvise based on that.
    Top Answer: I have spoken to my colleagues about this comparison and in our collective opinion, the reason why some people may declare Teradata better than Oracle is the pricing. Both solutions are quite… more »
    Top Answer: Before my organization implemented this solution, we researched which big brands were using Teradata, so we knew if it would be compatible with our field. According to the product's site, the… more »
    Top Answer:Teradata is not a difficult product to work with, especially since they offer you technical support at all levels if you just ask. There are some features that may cause difficulties - for example… more »
    Ranking
    6th
    out of 35 in Data Warehouse
    Views
    2,387
    Comparisons
    2,021
    Reviews
    13
    Average Words per Review
    530
    Rating
    7.8
    3rd
    out of 35 in Data Warehouse
    Views
    5,643
    Comparisons
    4,710
    Reviews
    18
    Average Words per Review
    454
    Rating
    7.6
    Comparisons
    SQL Server logo
    Compared 19% of the time.
    Snowflake logo
    Compared 12% of the time.
    Oracle Exadata logo
    Compared 11% of the time.
    MySQL logo
    Compared 10% of the time.
    Teradata IntelliFlex logo
    Compared 2% of the time.
    Learn More
    Overview
    The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

    Teradata is a multi-cloud data platform company that provides data warehouse and relational database tools. The brand has an extensive portfolio of big data analytic solutions, integrated marketing applications, and services. The products that Teradata offers can be categorized in the following ways:

    • Software: The products in this category include Teradata Vantage, an advanced SQL engine and Teradata database.

    • Cloud: The solutions can work with popular cloud services such as Amazon AWS, Microsoft Azure, Google Cloud Platform, and VMware. Teradata Cloud and customer cloud are also included in this product category.

    • Ecosystem management: The products in this section include IntelliSphere, business continuity manager, data lab, data mover, data stream, QueryGrid, and viewpoint.

    • Hardware: This category includes backup, archive, and restore (BAR) and IntelliFlex.

    • Applications: Master data management (MDM) and Teradata analytics for enterprise applications are the two products in this category.

    One of the most popular and commonly used products by Teradata is Teradata Vantage. This is a connected multi-cloud data platform for enterprise analytics. The product unifies data lakes, analytics, and data warehouses, as well as data sources and types. The solution offers its customers scalability, which allows them to scale dimensions to handle massive workloads of data. Teradata Vantage utilizes artificial intelligence (AI) and machine learning (ML) to power more models and enhance quality.

    Through Vantage Console, businesses can benefit from role-based, no-code software to organize and manage their data. The product offers deployment on many popular public clouds, on premises, and on commodity hardware. Teradata Vantage unifies and integrates all data types from sources and provides companies with a single source of information. It achieves this by supporting all common data types and formats. The product also provides organizations with tools for monitoring, analyzing, and connecting data throughout the organizations.

    Part of this product is also VantageCloud, which offers a modern cloud-native architecture as well as hybrid and multi-cloud deployment options. This solution offers new ways for clients to deploy their platforms. These include the next-generation, cloud-native architecture of Teradata VantageCloud Lake.

    Teradata Features

    The different products that Teradata offers have various features which facilitate data management for customers. Some of the key capabilities of the solutions offered by Teradata include:

    • Connectivity: Users can benefit from connections to the mainframe or network-attached systems. The product provides its own extension for interaction with data stored in the tables. It also supports SQL.

    • Linear scalability: Teradata solutions are highly scalable and linear. The solution can handle large volumes of data effectively while also supporting the ability to scale up to 2048 nodes.

    • Load and unload utilities: The product offers features to move data in and out of the product's system effectively.

    • Mature optimizer: Part of this feature is Teradata Optimizer, which is an advanced product that can handle up to 64 joins in a single query.

    • Robust utilities: The solution includes multiple robust utilities in this category of features that facilitate data handling in and out of the product's systems. The tools include MultiLoad, FastLoad, and FastExport.

    • Shared nothing architecture: This set of features, connected to the architecture of Teradata, is known as “shared nothing architecture” because the nodes, processors, and disks all work independently. These capacities ensure better value for a given task.

    • Unlimited parallelism: These features divide large volumes of data into smaller processes that are executed in parallel. This contributes to the execution of complex tasks in a timely manner.

    Teradata Benefits

    Through its multiple solutions, Teradata offers various benefits for its clients. Some of these include:

    • The solution facilitates prototype creation and offers faster times for completion.

    • Through Teradata, users are able to use faster, simpler, and easier solutions for data-related tasks.

    • The product eliminates the need for unnecessary data movement, which contributes to performance improvements.

    • Teradata offers developers the ability to decide where in the architecture different parts of an application run.

    • Teradata allows database administrators to manage databases from a single point of control.

    • The solution provides the option to get the same data on multiple deployment options.

    • The product supports Online Analytical Programming (OLAP) functions, which allows it to perform a complex analytical process of data.

    • Teradata uses the Service Workstation to provide a single operation view for the multi-node system of the product.

    Reviews from Real Users

    Martin P., a services manager at Bytes Systems Integration, describes Teradata as a product that is very fast with good database control and excellent support.

    Blaine V., principal at Insight Data Consulting, rates Teradata highly because of its excellent native features, highly stable, and impressive automation.

    Sample Customers
    Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
    Netflix
    Top Industries
    REVIEWERS
    Financial Services Firm35%
    Comms Service Provider24%
    Hospitality Company6%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm29%
    Computer Software Company11%
    University6%
    Manufacturing Company5%
    REVIEWERS
    Comms Service Provider29%
    Computer Software Company21%
    Financial Services Firm8%
    Energy/Utilities Company8%
    VISITORS READING REVIEWS
    Financial Services Firm25%
    Computer Software Company10%
    Manufacturing Company8%
    Healthcare Company7%
    Company Size
    REVIEWERS
    Small Business33%
    Midsize Enterprise19%
    Large Enterprise47%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    REVIEWERS
    Small Business32%
    Midsize Enterprise11%
    Large Enterprise56%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise9%
    Large Enterprise77%
    Buyer's Guide
    Apache Hadoop vs. Teradata
    May 2024
    Find out what your peers are saying about Apache Hadoop vs. Teradata and other solutions. Updated: May 2024.
    772,679 professionals have used our research since 2012.

    Apache Hadoop is ranked 6th in Data Warehouse with 34 reviews while Teradata is ranked 3rd in Data Warehouse with 54 reviews. Apache Hadoop is rated 7.8, while Teradata is rated 8.2. The top reviewer of Apache Hadoop writes "Handles huge data volumes and create your own workflows and tables but you need to have deeper knowledge". On the other hand, the top reviewer of Teradata writes "Offers seamless integration capabilities and performance optimization features, including extensive indexing and advanced tuning capabilities". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and BigQuery, whereas Teradata is most compared with SQL Server, Snowflake, Oracle Exadata, MySQL and Teradata IntelliFlex. See our Apache Hadoop vs. Teradata report.

    See our list of best Data Warehouse vendors.

    We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.