Apache Hadoop vs Teradata comparison

Cancel
You must select at least 2 products to compare!
Apache Logo
2,765 views|2,378 comparisons
Teradata Logo
6,277 views|5,267 comparisons
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Apache Hadoop and Teradata based on real PeerSpot user reviews.

Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Apache Hadoop vs. Teradata Report (Updated: March 2024).
763,955 professionals have used our research since 2012.
Q&A Highlights
Question: Which data catalog can provide support for BI data sources such as SAP BO and Tableau?
Answer: Dear Community, Many thanks for yor support and help!
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Apache Hadoop can manage large amounts and volumes of data with relative ease, which is a feature that is beneficial.""It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database.""​​Data ingestion: It has rapid speed, if Apache Accumulo is used.""Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing.""The most valuable features are the ability to process the machine data at a high speed, and to add structure to our data so that we can generate relevant analytics.""What I like about Apache Hadoop is that it's for big data, in particular big data analysis, and it's the easier solution. I like the data processing feature for AI/ML use cases the most because some solutions allow me to collect data from relational databases, while Hadoop provides me with more options for newer technologies.""The ability to add multiple nodes without any restriction is the solution's most valuable aspect.""The most valuable features are powerful tools for ingestion, as data is in multiple systems."

More Apache Hadoop Pros →

"Auto-partitioning and indexing, and resource allocation on the fly are key features.""​Building a data warehouse with Teradata has definitely helped a lot of our downstream applications to more easily access information.""Teradata's most valuable feature is that it's easy to use.""I've never had any issues with scalability.""Teradata can be deployed on-premise, on the cloud, or in a virtual machine, which means customers can move without having to create their architecture all over again.""It's very, very fast""Teradata features high productivity and reliability because it has several redundancy options, so the system is always up and running.""The initial setup was straightforward."

More Teradata Pros →

Cons
"The key shortcoming is its inability to handle queries when there is insufficient memory. This limitation can be bypassed by processing the data in chunks.""We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it.""In certain cases, the configurations for dealing with data skewness do not make any sense.""The solution is not easy to use. The solution should be easy to use and suitable for almost any case connected with the use of big data or working with large amounts of data.""It needs better user interface (UI) functionalities.""The upgrade path should be improved because it is not as easy as it should be.""The stability of the solution needs improvement.""In the next release, I would like to see Hive more responsive for smaller queries and to reduce the latency."

More Apache Hadoop Cons →

"There is a need to improve performance in high transaction processes, as well as the reporting system.""It could use some more advanced analytics relating to structured and semi-structured data.""Apart from Control-M, it would be nice if it could integrate with other tools.""GUI of administrative tools is really outdated.""The following could be better: licensing, architecture openness, integration with other tools.""An additional feature I would you like to see included in the next release, is that it needs to be more cloud-friendly.""Teradata is a bit late for the cloud.""I'm not sure about the unstructured data management capabilities. It could be improved."

More Teradata Cons →

Pricing and Cost Advice
  • "Do take into consider that data storage and compute capacity scale differently and hence purchasing a "boxed" / 'all-in-one" solution (software and hardware) might not be the best idea."
  • "​There are no licensing costs involved, hence money is saved on the software infrastructure​."
  • "This is a low cost and powerful solution."
  • "The price of Apache Hadoop could be less expensive."
  • "If my company can use the cloud version of Apache Hadoop, particularly the cloud storage feature, it would be easier and would cost less because an on-premises deployment has a higher cost during storage, for example, though I don't know exactly how much Apache Hadoop costs."
  • "We don't directly pay for it. Our clients pay for it, and they usually don't complain about the price. So, it is probably acceptable."
  • "The price could be better. Hortonworks no longer exists, and Cloudera killed the free version of Hadoop."
  • "We just use the free version."
  • More Apache Hadoop Pricing and Cost Advice →

  • "Teradata is not cheap, but you get what you pay for."
  • "Make sure you have the in-house skills to design and support the solution, as relying on external sources is extremely costly and tends to lock you into specific platforms, tools, and paradigms."
  • "In the past, it turned out that other solutions, in order to provide the full range of abilities that the Teradata platform provides plus the migration costs, would end up costing more than Teradata does."
  • "The initial cost may seem high, but the TCO is low."
  • "Teradata is currently making improvements in this area."
  • "It is still a very expensive solution. While I very much like the pure technological supremacy of the software itself, I believe Teradata as a company needs to become more affordable. They are already losing the market to more flexible or cheaper competitors."
  • "Teradata is expensive but gives value for money, especially if you don't want to move your data to the cloud."
  • "Price is quite high, so if it is really possible to use other solutions (e.g. you do not have strict requirements for performance and huge data volumes), it might be better to look at alternatives from the RDBMS world."
  • More Teradata Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
    763,955 professionals have used our research since 2012.
    Answers from the Community
    Tomasz Rabong
    InitZero - PeerSpot reviewerInitZero
    Real User

    Hi Tomasz,


    Collibra can scan all these sources. See this link: https://marketplace.collibra.c...


    Also, Erwin Data Intelligence Suite can harvest most (if not all) of these sources:


    https://www.erwin.com/products...

    Leandro Sodré - PeerSpot reviewerLeandro Sodré
    User

    Hi Tomasz Rabong


    I believe that if you have a developer team in Amundsen it would be possible. 


    Alternatively, you can look at Informatica EDC or at Data Virtualization Data Catalog (from Denodo).

    Ritesh Misra - PeerSpot reviewerRitesh Misra
    User

    @Tomasz Rabong, it depends upon the actual requirements of the data catalog. 


    As far as we have experienced SAP BO 4.0 is way ahead in solving architectural, clustering, warehousing and mining complex problems whereas Tableau server 2022.1 is really awesome and has recently included features to solve complex problems. 


    As a team, we prefer SAP BO for billions of data.

    Delmar Assis - PeerSpot reviewerDelmar Assis
    Real User

    Hi @Tomasz Rabong, I hope you're well and safe. 


    Specifically, if you need any help regarding Infogix Data360 Govern, please let me know. 


    Cheers.

    Questions from the Community
    Top Answer:It's open-source, so it's very cost-effective.
    Top Answer:The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support. And then there's the server issue. You have to create and… more »
    Top Answer: I have spoken to my colleagues about this comparison and in our collective opinion, the reason why some people may declare Teradata better than Oracle is the pricing. Both solutions are quite… more »
    Top Answer: Before my organization implemented this solution, we researched which big brands were using Teradata, so we knew if it would be compatible with our field. According to the product's site, the… more »
    Top Answer:Teradata is not a difficult product to work with, especially since they offer you technical support at all levels if you just ask. There are some features that may cause difficulties - for example… more »
    Ranking
    5th
    out of 33 in Data Warehouse
    Views
    2,765
    Comparisons
    2,378
    Reviews
    10
    Average Words per Review
    539
    Rating
    8.0
    3rd
    out of 33 in Data Warehouse
    Views
    6,277
    Comparisons
    5,267
    Reviews
    19
    Average Words per Review
    451
    Rating
    8.1
    Comparisons
    Learn More
    Overview
    The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

    Teradata is a multi-cloud data platform company that provides data warehouse and relational database tools. The brand has an extensive portfolio of big data analytic solutions, integrated marketing applications, and services. The products that Teradata offers can be categorized in the following ways:

    • Software: The products in this category include Teradata Vantage, an advanced SQL engine and Teradata database.

    • Cloud: The solutions can work with popular cloud services such as Amazon AWS, Microsoft Azure, Google Cloud Platform, and VMware. Teradata Cloud and customer cloud are also included in this product category.

    • Ecosystem management: The products in this section include IntelliSphere, business continuity manager, data lab, data mover, data stream, QueryGrid, and viewpoint.

    • Hardware: This category includes backup, archive, and restore (BAR) and IntelliFlex.

    • Applications: Master data management (MDM) and Teradata analytics for enterprise applications are the two products in this category.

    One of the most popular and commonly used products by Teradata is Teradata Vantage. This is a connected multi-cloud data platform for enterprise analytics. The product unifies data lakes, analytics, and data warehouses, as well as data sources and types. The solution offers its customers scalability, which allows them to scale dimensions to handle massive workloads of data. Teradata Vantage utilizes artificial intelligence (AI) and machine learning (ML) to power more models and enhance quality.

    Through Vantage Console, businesses can benefit from role-based, no-code software to organize and manage their data. The product offers deployment on many popular public clouds, on premises, and on commodity hardware. Teradata Vantage unifies and integrates all data types from sources and provides companies with a single source of information. It achieves this by supporting all common data types and formats. The product also provides organizations with tools for monitoring, analyzing, and connecting data throughout the organizations.

    Part of this product is also VantageCloud, which offers a modern cloud-native architecture as well as hybrid and multi-cloud deployment options. This solution offers new ways for clients to deploy their platforms. These include the next-generation, cloud-native architecture of Teradata VantageCloud Lake.

    Teradata Features

    The different products that Teradata offers have various features which facilitate data management for customers. Some of the key capabilities of the solutions offered by Teradata include:

    • Connectivity: Users can benefit from connections to the mainframe or network-attached systems. The product provides its own extension for interaction with data stored in the tables. It also supports SQL.

    • Linear scalability: Teradata solutions are highly scalable and linear. The solution can handle large volumes of data effectively while also supporting the ability to scale up to 2048 nodes.

    • Load and unload utilities: The product offers features to move data in and out of the product's system effectively.

    • Mature optimizer: Part of this feature is Teradata Optimizer, which is an advanced product that can handle up to 64 joins in a single query.

    • Robust utilities: The solution includes multiple robust utilities in this category of features that facilitate data handling in and out of the product's systems. The tools include MultiLoad, FastLoad, and FastExport.

    • Shared nothing architecture: This set of features, connected to the architecture of Teradata, is known as “shared nothing architecture” because the nodes, processors, and disks all work independently. These capacities ensure better value for a given task.

    • Unlimited parallelism: These features divide large volumes of data into smaller processes that are executed in parallel. This contributes to the execution of complex tasks in a timely manner.

    Teradata Benefits

    Through its multiple solutions, Teradata offers various benefits for its clients. Some of these include:

    • The solution facilitates prototype creation and offers faster times for completion.

    • Through Teradata, users are able to use faster, simpler, and easier solutions for data-related tasks.

    • The product eliminates the need for unnecessary data movement, which contributes to performance improvements.

    • Teradata offers developers the ability to decide where in the architecture different parts of an application run.

    • Teradata allows database administrators to manage databases from a single point of control.

    • The solution provides the option to get the same data on multiple deployment options.

    • The product supports Online Analytical Programming (OLAP) functions, which allows it to perform a complex analytical process of data.

    • Teradata uses the Service Workstation to provide a single operation view for the multi-node system of the product.

    Reviews from Real Users

    Martin P., a services manager at Bytes Systems Integration, describes Teradata as a product that is very fast with good database control and excellent support.

    Blaine V., principal at Insight Data Consulting, rates Teradata highly because of its excellent native features, highly stable, and impressive automation.

    Sample Customers
    Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
    Netflix
    Top Industries
    REVIEWERS
    Financial Services Firm40%
    Comms Service Provider27%
    Hospitality Company7%
    Consumer Goods Company7%
    VISITORS READING REVIEWS
    Financial Services Firm27%
    Computer Software Company10%
    Comms Service Provider6%
    Educational Organization6%
    REVIEWERS
    Comms Service Provider26%
    Computer Software Company22%
    Financial Services Firm9%
    Energy/Utilities Company9%
    VISITORS READING REVIEWS
    Financial Services Firm26%
    Computer Software Company10%
    Manufacturing Company8%
    Healthcare Company7%
    Company Size
    REVIEWERS
    Small Business33%
    Midsize Enterprise24%
    Large Enterprise42%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise10%
    Large Enterprise75%
    REVIEWERS
    Small Business33%
    Midsize Enterprise11%
    Large Enterprise56%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise9%
    Large Enterprise77%
    Buyer's Guide
    Apache Hadoop vs. Teradata
    March 2024
    Find out what your peers are saying about Apache Hadoop vs. Teradata and other solutions. Updated: March 2024.
    763,955 professionals have used our research since 2012.

    Apache Hadoop is ranked 5th in Data Warehouse with 11 reviews while Teradata is ranked 3rd in Data Warehouse with 21 reviews. Apache Hadoop is rated 7.8, while Teradata is rated 8.4. The top reviewer of Apache Hadoop writes "Has good processing power and speed and is capable of handling large volumes of data and doing online analysis". On the other hand, the top reviewer of Teradata writes "Very fast with good database control and excellent support". Apache Hadoop is most compared with Microsoft Azure Synapse Analytics, Azure Data Factory, Oracle Exadata, Snowflake and BigQuery, whereas Teradata is most compared with SQL Server, Snowflake, MySQL, Oracle Exadata and Teradata IntelliFlex. See our Apache Hadoop vs. Teradata report.

    See our list of best Data Warehouse vendors.

    We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.