We performed a comparison between Apache Hadoop and Teradata based on real PeerSpot user reviews.
Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Apache Hadoop can manage large amounts and volumes of data with relative ease, which is a feature that is beneficial."
"It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database."
"Data ingestion: It has rapid speed, if Apache Accumulo is used."
"Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing."
"The most valuable features are the ability to process the machine data at a high speed, and to add structure to our data so that we can generate relevant analytics."
"What I like about Apache Hadoop is that it's for big data, in particular big data analysis, and it's the easier solution. I like the data processing feature for AI/ML use cases the most because some solutions allow me to collect data from relational databases, while Hadoop provides me with more options for newer technologies."
"The ability to add multiple nodes without any restriction is the solution's most valuable aspect."
"The most valuable features are powerful tools for ingestion, as data is in multiple systems."
"Auto-partitioning and indexing, and resource allocation on the fly are key features."
"Building a data warehouse with Teradata has definitely helped a lot of our downstream applications to more easily access information."
"Teradata's most valuable feature is that it's easy to use."
"I've never had any issues with scalability."
"Teradata can be deployed on-premise, on the cloud, or in a virtual machine, which means customers can move without having to create their architecture all over again."
"It's very, very fast"
"Teradata features high productivity and reliability because it has several redundancy options, so the system is always up and running."
"The initial setup was straightforward."
"The key shortcoming is its inability to handle queries when there is insufficient memory. This limitation can be bypassed by processing the data in chunks."
"We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it."
"In certain cases, the configurations for dealing with data skewness do not make any sense."
"The solution is not easy to use. The solution should be easy to use and suitable for almost any case connected with the use of big data or working with large amounts of data."
"It needs better user interface (UI) functionalities."
"The upgrade path should be improved because it is not as easy as it should be."
"The stability of the solution needs improvement."
"In the next release, I would like to see Hive more responsive for smaller queries and to reduce the latency."
"There is a need to improve performance in high transaction processes, as well as the reporting system."
"It could use some more advanced analytics relating to structured and semi-structured data."
"Apart from Control-M, it would be nice if it could integrate with other tools."
"GUI of administrative tools is really outdated."
"The following could be better: licensing, architecture openness, integration with other tools."
"An additional feature I would you like to see included in the next release, is that it needs to be more cloud-friendly."
"Teradata is a bit late for the cloud."
"I'm not sure about the unstructured data management capabilities. It could be improved."
Apache Hadoop is ranked 5th in Data Warehouse with 11 reviews while Teradata is ranked 3rd in Data Warehouse with 21 reviews. Apache Hadoop is rated 7.8, while Teradata is rated 8.4. The top reviewer of Apache Hadoop writes "Has good processing power and speed and is capable of handling large volumes of data and doing online analysis". On the other hand, the top reviewer of Teradata writes "Very fast with good database control and excellent support". Apache Hadoop is most compared with Microsoft Azure Synapse Analytics, Azure Data Factory, Oracle Exadata, Snowflake and BigQuery, whereas Teradata is most compared with SQL Server, Snowflake, MySQL, Oracle Exadata and Teradata IntelliFlex. See our Apache Hadoop vs. Teradata report.
See our list of best Data Warehouse vendors.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.
Hi Tomasz,
Collibra can scan all these sources. See this link: https://marketplace.collibra.c...
Also, Erwin Data Intelligence Suite can harvest most (if not all) of these sources:
https://www.erwin.com/products...
Hi Tomasz Rabong,
I believe that if you have a developer team in Amundsen it would be possible.
Alternatively, you can look at Informatica EDC or at Data Virtualization Data Catalog (from Denodo).
@Tomasz Rabong, it depends upon the actual requirements of the data catalog.
As far as we have experienced SAP BO 4.0 is way ahead in solving architectural, clustering, warehousing and mining complex problems whereas Tableau server 2022.1 is really awesome and has recently included features to solve complex problems.
As a team, we prefer SAP BO for billions of data.
Hi @Tomasz Rabong, I hope you're well and safe.
Specifically, if you need any help regarding Infogix Data360 Govern, please let me know.
Cheers.