Apache Hadoop vs Teradata comparison

Cancel
You must select at least 2 products to compare!
Apache Logo
2,467 views|2,109 comparisons
87% willing to recommend
Teradata Logo
5,898 views|4,926 comparisons
87% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Apache Hadoop and Teradata based on real PeerSpot user reviews.

Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Apache Hadoop vs. Teradata Report (Updated: May 2024).
770,141 professionals have used our research since 2012.
Q&A Highlights
Question: Which data catalog can provide support for BI data sources such as SAP BO and Tableau?
Answer: Dear Community, Many thanks for yor support and help!
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database.""We selected Apache Hadoop because it is not dependent on third-party vendors.""What comes with the standard setup is what we mostly use, but Ambari is the most important.""Most valuable features are HDFS and Kafka: Ingestion of huge volumes and variety of unstructured/semi-structured data is feasible, and it helps us to quickly onboard a new Big Data analytics prospect.""The most valuable feature is the database.""It's open-source, so it's very cost-effective.""Since both Apache Hadoop and Amazon EC2 are elastic in nature, we can scale and expand on demand for a specific PoC, and scale down when it's done.""As compared to Hive on MapReduce, Impala on MPP returns results of SQL queries in a fairly short amount of time, and is relatively fast when reading data into other platforms like R."

More Apache Hadoop Pros →

"Teradata can be deployed on-premise, on the cloud, or in a virtual machine, which means customers can move without having to create their architecture all over again.""It's very, very fast""Viewpoint, the detailed query logs and performance statistics are valuable features.""It's a pre-configured appliance that requires very little in terms of setting-up.""Teradata solutions help organizations reduce IT, operations, and maintenance costs; enhance on-time delivery of products and services.""Teradata is a great, industry-leading data warehousing product that has MPP architecture.""I found all parts --loading, transformation, processing & querying work in parallel, and end-to-end-- to be valuable.""Teradata's best feature is its speed with historical data."

More Teradata Pros →

Cons
"The upgrade path should be improved because it is not as easy as it should be.""The solution needs a better tutorial. There are only documents available currently. There's a lot of YouTube videos available. However, in terms of learning, we didn't have great success trying to learn that way. There needs to be better self-paced learning.""In the next release, I would like to see Hive more responsive for smaller queries and to reduce the latency.""The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support.""From the Apache perspective or the open-source community, they need to add more capabilities to make life easier from a configuration and deployment perspective.""It would be good to have more advanced analytics tools.""General installation/dependency issues were there, but were not a major, complex issue. While migrating data from MySQL to Hive, things are a little challenging, but we were able to get through that with support from forums and a little trial and error.""The stability of the solution needs improvement."

More Apache Hadoop Cons →

"The following could be better: licensing, architecture openness, integration with other tools.""The increasing volumes of data demand more and more performance.""It's primarily designed for big projects and therefore, the pricing is pretty high. It's not suitable for smaller companies.""Teradata should focus on functionality for building predictive models because, in that regard, it can definitely improve.""Data ingestion is done via external utilities and not by the query language itself. It would be more convenient to have that functionality within its SQL dialect.""I've been using the same UI for 20 years in Teradata. It could use some updating. Adding more stability around Teradata Studio would be outstanding. Teradata Studio is a Java-based version of their tool. It's much better now, but it still has some room for improvement.""Teradata is an expensive tool. Like, if you're already using Microsoft products like Windows, they'll market all their products together. And with the rise of cloud technologies, companies will adopt solutions that offer them some privileges or facilities. Similar to how SAP does it in the market, so do Microsoft and other companies. Even Oracle and other such tools are quite commonly seen compared to Teradata's competitors in everyday solutions.""Teradata's pricing is quite high compared to Redshift, Synapse, or GCP alternatives."

More Teradata Cons →

Pricing and Cost Advice
  • "Do take into consider that data storage and compute capacity scale differently and hence purchasing a "boxed" / 'all-in-one" solution (software and hardware) might not be the best idea."
  • "​There are no licensing costs involved, hence money is saved on the software infrastructure​."
  • "This is a low cost and powerful solution."
  • "The price of Apache Hadoop could be less expensive."
  • "If my company can use the cloud version of Apache Hadoop, particularly the cloud storage feature, it would be easier and would cost less because an on-premises deployment has a higher cost during storage, for example, though I don't know exactly how much Apache Hadoop costs."
  • "We don't directly pay for it. Our clients pay for it, and they usually don't complain about the price. So, it is probably acceptable."
  • "The price could be better. Hortonworks no longer exists, and Cloudera killed the free version of Hadoop."
  • "We just use the free version."
  • More Apache Hadoop Pricing and Cost Advice →

  • "Teradata is not cheap, but you get what you pay for."
  • "Make sure you have the in-house skills to design and support the solution, as relying on external sources is extremely costly and tends to lock you into specific platforms, tools, and paradigms."
  • "In the past, it turned out that other solutions, in order to provide the full range of abilities that the Teradata platform provides plus the migration costs, would end up costing more than Teradata does."
  • "The initial cost may seem high, but the TCO is low."
  • "Teradata is currently making improvements in this area."
  • "It is still a very expensive solution. While I very much like the pure technological supremacy of the software itself, I believe Teradata as a company needs to become more affordable. They are already losing the market to more flexible or cheaper competitors."
  • "Teradata is expensive but gives value for money, especially if you don't want to move your data to the cloud."
  • "Price is quite high, so if it is really possible to use other solutions (e.g. you do not have strict requirements for performance and huge data volumes), it might be better to look at alternatives from the RDBMS world."
  • More Teradata Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
    770,141 professionals have used our research since 2012.
    Comparison Review
    Anonymous User
    Answers from the Community
    Tomasz Rabong
    InitZero - PeerSpot reviewerInitZero
    Real User

    Hi Tomasz,


    Collibra can scan all these sources. See this link: https://marketplace.collibra.c...


    Also, Erwin Data Intelligence Suite can harvest most (if not all) of these sources:


    https://www.erwin.com/products...

    Leandro Sodré - PeerSpot reviewerLeandro Sodré
    User

    Hi Tomasz Rabong


    I believe that if you have a developer team in Amundsen it would be possible. 


    Alternatively, you can look at Informatica EDC or at Data Virtualization Data Catalog (from Denodo).

    Ritesh Misra - PeerSpot reviewerRitesh Misra
    User

    @Tomasz Rabong, it depends upon the actual requirements of the data catalog. 


    As far as we have experienced SAP BO 4.0 is way ahead in solving architectural, clustering, warehousing and mining complex problems whereas Tableau server 2022.1 is really awesome and has recently included features to solve complex problems. 


    As a team, we prefer SAP BO for billions of data.

    Delmar Assis - PeerSpot reviewerDelmar Assis
    Real User

    Hi @Tomasz Rabong, I hope you're well and safe. 


    Specifically, if you need any help regarding Infogix Data360 Govern, please let me know. 


    Cheers.

    Questions from the Community
    Top Answer:Tools like Apache Hadoop are knowledge-intensive in nature. Unlike other tools in the market currently, we cannot understand knowledge-intensive products straight away. To use Apache Hadoop, a person… more »
    Top Answer: I have spoken to my colleagues about this comparison and in our collective opinion, the reason why some people may declare Teradata better than Oracle is the pricing. Both solutions are quite… more »
    Top Answer: Before my organization implemented this solution, we researched which big brands were using Teradata, so we knew if it would be compatible with our field. According to the product's site, the… more »
    Top Answer:Teradata is not a difficult product to work with, especially since they offer you technical support at all levels if you just ask. There are some features that may cause difficulties - for example… more »
    Ranking
    5th
    out of 35 in Data Warehouse
    Views
    2,467
    Comparisons
    2,109
    Reviews
    11
    Average Words per Review
    573
    Rating
    7.9
    3rd
    out of 35 in Data Warehouse
    Views
    5,898
    Comparisons
    4,926
    Reviews
    21
    Average Words per Review
    469
    Rating
    7.8
    Comparisons
    SQL Server logo
    Compared 19% of the time.
    Snowflake logo
    Compared 12% of the time.
    Oracle Exadata logo
    Compared 11% of the time.
    MySQL logo
    Compared 10% of the time.
    Teradata IntelliFlex logo
    Compared 2% of the time.
    Learn More
    Overview
    The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

    Teradata is a multi-cloud data platform company that provides data warehouse and relational database tools. The brand has an extensive portfolio of big data analytic solutions, integrated marketing applications, and services. The products that Teradata offers can be categorized in the following ways:

    • Software: The products in this category include Teradata Vantage, an advanced SQL engine and Teradata database.

    • Cloud: The solutions can work with popular cloud services such as Amazon AWS, Microsoft Azure, Google Cloud Platform, and VMware. Teradata Cloud and customer cloud are also included in this product category.

    • Ecosystem management: The products in this section include IntelliSphere, business continuity manager, data lab, data mover, data stream, QueryGrid, and viewpoint.

    • Hardware: This category includes backup, archive, and restore (BAR) and IntelliFlex.

    • Applications: Master data management (MDM) and Teradata analytics for enterprise applications are the two products in this category.

    One of the most popular and commonly used products by Teradata is Teradata Vantage. This is a connected multi-cloud data platform for enterprise analytics. The product unifies data lakes, analytics, and data warehouses, as well as data sources and types. The solution offers its customers scalability, which allows them to scale dimensions to handle massive workloads of data. Teradata Vantage utilizes artificial intelligence (AI) and machine learning (ML) to power more models and enhance quality.

    Through Vantage Console, businesses can benefit from role-based, no-code software to organize and manage their data. The product offers deployment on many popular public clouds, on premises, and on commodity hardware. Teradata Vantage unifies and integrates all data types from sources and provides companies with a single source of information. It achieves this by supporting all common data types and formats. The product also provides organizations with tools for monitoring, analyzing, and connecting data throughout the organizations.

    Part of this product is also VantageCloud, which offers a modern cloud-native architecture as well as hybrid and multi-cloud deployment options. This solution offers new ways for clients to deploy their platforms. These include the next-generation, cloud-native architecture of Teradata VantageCloud Lake.

    Teradata Features

    The different products that Teradata offers have various features which facilitate data management for customers. Some of the key capabilities of the solutions offered by Teradata include:

    • Connectivity: Users can benefit from connections to the mainframe or network-attached systems. The product provides its own extension for interaction with data stored in the tables. It also supports SQL.

    • Linear scalability: Teradata solutions are highly scalable and linear. The solution can handle large volumes of data effectively while also supporting the ability to scale up to 2048 nodes.

    • Load and unload utilities: The product offers features to move data in and out of the product's system effectively.

    • Mature optimizer: Part of this feature is Teradata Optimizer, which is an advanced product that can handle up to 64 joins in a single query.

    • Robust utilities: The solution includes multiple robust utilities in this category of features that facilitate data handling in and out of the product's systems. The tools include MultiLoad, FastLoad, and FastExport.

    • Shared nothing architecture: This set of features, connected to the architecture of Teradata, is known as “shared nothing architecture” because the nodes, processors, and disks all work independently. These capacities ensure better value for a given task.

    • Unlimited parallelism: These features divide large volumes of data into smaller processes that are executed in parallel. This contributes to the execution of complex tasks in a timely manner.

    Teradata Benefits

    Through its multiple solutions, Teradata offers various benefits for its clients. Some of these include:

    • The solution facilitates prototype creation and offers faster times for completion.

    • Through Teradata, users are able to use faster, simpler, and easier solutions for data-related tasks.

    • The product eliminates the need for unnecessary data movement, which contributes to performance improvements.

    • Teradata offers developers the ability to decide where in the architecture different parts of an application run.

    • Teradata allows database administrators to manage databases from a single point of control.

    • The solution provides the option to get the same data on multiple deployment options.

    • The product supports Online Analytical Programming (OLAP) functions, which allows it to perform a complex analytical process of data.

    • Teradata uses the Service Workstation to provide a single operation view for the multi-node system of the product.

    Reviews from Real Users

    Martin P., a services manager at Bytes Systems Integration, describes Teradata as a product that is very fast with good database control and excellent support.

    Blaine V., principal at Insight Data Consulting, rates Teradata highly because of its excellent native features, highly stable, and impressive automation.

    Sample Customers
    Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
    Netflix
    Top Industries
    REVIEWERS
    Financial Services Firm38%
    Comms Service Provider25%
    Hospitality Company6%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm28%
    Computer Software Company10%
    Comms Service Provider6%
    University6%
    REVIEWERS
    Comms Service Provider26%
    Computer Software Company22%
    Financial Services Firm9%
    Energy/Utilities Company9%
    VISITORS READING REVIEWS
    Financial Services Firm26%
    Computer Software Company10%
    Manufacturing Company8%
    Healthcare Company7%
    Company Size
    REVIEWERS
    Small Business34%
    Midsize Enterprise20%
    Large Enterprise46%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise11%
    Large Enterprise74%
    REVIEWERS
    Small Business33%
    Midsize Enterprise11%
    Large Enterprise56%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise9%
    Large Enterprise77%
    Buyer's Guide
    Apache Hadoop vs. Teradata
    May 2024
    Find out what your peers are saying about Apache Hadoop vs. Teradata and other solutions. Updated: May 2024.
    770,141 professionals have used our research since 2012.

    Apache Hadoop is ranked 5th in Data Warehouse with 33 reviews while Teradata is ranked 3rd in Data Warehouse with 54 reviews. Apache Hadoop is rated 7.8, while Teradata is rated 8.2. The top reviewer of Apache Hadoop writes "Handles huge data volumes and create your own workflows and tables but you need to have deeper knowledge". On the other hand, the top reviewer of Teradata writes "Offers seamless integration capabilities and performance optimization features, including extensive indexing and advanced tuning capabilities". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and BigQuery, whereas Teradata is most compared with SQL Server, Snowflake, Oracle Exadata, MySQL and Teradata IntelliFlex. See our Apache Hadoop vs. Teradata report.

    See our list of best Data Warehouse vendors.

    We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.