We performed a comparison between Apache Hadoop and Teradata based on real PeerSpot user reviews.
Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."It's open-source, so it's very cost-effective."
"The best thing about this solution is that it is very powerful and very cheap."
"Hadoop File System is compatible with almost all the query engines."
"Most valuable features are HDFS and Kafka: Ingestion of huge volumes and variety of unstructured/semi-structured data is feasible, and it helps us to quickly onboard a new Big Data analytics prospect."
"The most important feature is its ability to handle large volumes. Some of our customers have really large volumes, and it is capable of handling their data in terms of the core volume and daily incremental volume. So, its processing power and speed are most valuable."
"The most valuable feature is the database."
"Since both Apache Hadoop and Amazon EC2 are elastic in nature, we can scale and expand on demand for a specific PoC, and scale down when it's done."
"Hadoop is designed to be scalable, so I don't think that it has limitations in regards to scalability."
"Teradata's capabilities enhance data management efficiency, support scalability, and contribute to faster query performance."
"The functionality of the solution is excellent."
"It's the same as your visual database. I like the fast load feature for data, the BTQ solution is very good, and storage procedures are very fast."
"Teradata's pretty fast."
"The ability to handle machine data parallel processing is the most valuable feature of Teradata."
"The most valuable feature of Teradata is the quick processing of large data."
"Teradata has good performance, the response times are very fast. Overall the solution is easy to use. When we do the transformation, we have all of our staging and aggregation data available."
"The feature that we find most valuable is its ability to perform Massive Parallel Processing."
"I mentioned it definitely, and this is probably the only feature we can improve a little bit because the terminal and coding screen on Hadoop is a little outdated, and it looks like the old C++ bio screen. If the UI and UX can be improved slightly, I believe it will go a long way toward increasing adoption and effectiveness."
"The solution is very expensive."
"The price could be better. I think we would use it more, but the company didn't want to pay for it. Hortonworks doesn't exist anymore, and Cloudera killed the free version of Hadoop."
"The key shortcoming is its inability to handle queries when there is insufficient memory. This limitation can be bypassed by processing the data in chunks."
"Hadoop's security could be better."
"We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it."
"The stability of the solution needs improvement."
"What could be improved in Apache Hadoop is its user-friendliness. It's not that user-friendly, but maybe it's because I'm new to it. Sometimes it feels so tough to use, but it could be because of two aspects: one is my incompetency, for example, I don't know about all the features of Apache Hadoop, or maybe it's because of the limitations of the platform. For example, my team is maintaining the business glossary in Apache Atlas, but if you want to change any settings at the GUI level, an advanced level of coding or programming needs to be done in the back end, so it's not user-friendly."
"Data synchronization to the DR site."
"Teradata needs to expand the kind of training that's available to customers. Teradata only offers training directly and doesn't delegate to any third-party companies. As a result, it's harder to find people trained on Teradata in our market relative to Oracle."
"We tried to use case Teradata for a data warehouse system, but we had some problems in relation to the Teradata system, CDC tools, and source databases. We were unable to transfer data from HPE Integrity mainframe to Teradata."
"I would like to see more integration with many different types of data."
"I think the UI is not there yet. It could be improved by being more user-friendly."
"Sometimes the large injestion takes days to load data, and some of our stored procedures take two to three days."
"Limited interest and success in some areas make us hesitate about upgrading."
"Apart from Control-M, it would be nice if it could integrate with other tools."
Apache Hadoop is ranked 5th in Data Warehouse with 32 reviews while Teradata is ranked 3rd in Data Warehouse with 54 reviews. Apache Hadoop is rated 7.8, while Teradata is rated 8.2. The top reviewer of Apache Hadoop writes "A file system for data collection that contains needed information and files". On the other hand, the top reviewer of Teradata writes "Offers seamless integration capabilities and performance optimization features, including extensive indexing and advanced tuning capabilities". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and BigQuery, whereas Teradata is most compared with SQL Server, Snowflake, Oracle Exadata, MySQL and Teradata IntelliFlex. See our Apache Hadoop vs. Teradata report.
See our list of best Data Warehouse vendors.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.
Hi Tomasz,
Collibra can scan all these sources. See this link: https://marketplace.collibra.c...
Also, Erwin Data Intelligence Suite can harvest most (if not all) of these sources:
https://www.erwin.com/products...
Hi Tomasz Rabong,
I believe that if you have a developer team in Amundsen it would be possible.
Alternatively, you can look at Informatica EDC or at Data Virtualization Data Catalog (from Denodo).
@Tomasz Rabong, it depends upon the actual requirements of the data catalog.
As far as we have experienced SAP BO 4.0 is way ahead in solving architectural, clustering, warehousing and mining complex problems whereas Tableau server 2022.1 is really awesome and has recently included features to solve complex problems.
As a team, we prefer SAP BO for billions of data.
Hi @Tomasz Rabong, I hope you're well and safe.
Specifically, if you need any help regarding Infogix Data360 Govern, please let me know.
Cheers.