We performed a comparison between Apache Hadoop and Teradata based on real PeerSpot user reviews.
Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."The most valuable feature is scalability and the possibility to work with major information and open source capability."
"One valuable feature is that we can download data."
"The most valuable features are the ability to process the machine data at a high speed, and to add structure to our data so that we can generate relevant analytics."
"It's open-source, so it's very cost-effective."
"The performance is pretty good."
"It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming."
"Its integration is Hadoop's best feature because that allows us to support different tools in a big data platform."
"The scalability of Apache Hadoop is very good."
"Teradata's capabilities enhance data management efficiency, support scalability, and contribute to faster query performance."
"Things have started moving faster in my company, such as data retrieval happens more quickly."
"Auto-partitioning and indexing, and resource allocation on the fly are key features."
"Teradata's most valuable feature is that it's easy to use."
"I like this solution's ease of design and the fact that its performance is quite good. It is stable as well."
"The solution scales well on the cloud."
"Teradata solutions help organizations reduce IT, operations, and maintenance costs; enhance on-time delivery of products and services."
"We did performance testing. We had a set of real life MicroStrategy reports. Our conditions were: Not allowed to redesign data model, not allowed to rewrite the queries, all queries should be generated by MicroStrategy, no aggregates. Teradata appeared to be way faster than a similarly configured (in terms of hardware) Oracle server."
"The solution needs a better tutorial. There are only documents available currently. There's a lot of YouTube videos available. However, in terms of learning, we didn't have great success trying to learn that way. There needs to be better self-paced learning."
"The stability of the solution needs improvement."
"I would like to see more direct integration of visualization applications."
"The solution is very expensive."
"We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it."
"What could be improved in Apache Hadoop is its user-friendliness. It's not that user-friendly, but maybe it's because I'm new to it. Sometimes it feels so tough to use, but it could be because of two aspects: one is my incompetency, for example, I don't know about all the features of Apache Hadoop, or maybe it's because of the limitations of the platform. For example, my team is maintaining the business glossary in Apache Atlas, but if you want to change any settings at the GUI level, an advanced level of coding or programming needs to be done in the back end, so it's not user-friendly."
"The integration with Apache Hadoop with lots of different techniques within your business can be a challenge."
"I think more of the solution needs to be focused around the panel processing and retrieval of data."
"An additional feature I would you like to see included in the next release, is that it needs to be more cloud-friendly."
"It would help to make scaling easier with a reduced cost. "
"Teradata needs to expand the kind of training that's available to customers. Teradata only offers training directly and doesn't delegate to any third-party companies. As a result, it's harder to find people trained on Teradata in our market relative to Oracle."
"The solution could improve by having a cloud version or a cloud component. We have to use other solutions, such as Amazon AWS, Microsoft Azure, or Snowflake for the cloud."
"I'm not sure about the unstructured data management capabilities. It could be improved."
"Data ingestion is done via external utilities and not by the query language itself. It would be more convenient to have that functionality within its SQL dialect."
"Teradata's UI could be more user-friendly."
"I would like to see more integration with many different types of data."
Apache Hadoop is ranked 6th in Data Warehouse with 34 reviews while Teradata is ranked 3rd in Data Warehouse with 54 reviews. Apache Hadoop is rated 7.8, while Teradata is rated 8.2. The top reviewer of Apache Hadoop writes "Handles huge data volumes and create your own workflows and tables but you need to have deeper knowledge". On the other hand, the top reviewer of Teradata writes "Offers seamless integration capabilities and performance optimization features, including extensive indexing and advanced tuning capabilities". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and BigQuery, whereas Teradata is most compared with SQL Server, Snowflake, Oracle Exadata, MySQL and Teradata IntelliFlex. See our Apache Hadoop vs. Teradata report.
See our list of best Data Warehouse vendors.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.
Hi Tomasz,
Collibra can scan all these sources. See this link: https://marketplace.collibra.c...
Also, Erwin Data Intelligence Suite can harvest most (if not all) of these sources:
https://www.erwin.com/products...
Hi Tomasz Rabong,
I believe that if you have a developer team in Amundsen it would be possible.
Alternatively, you can look at Informatica EDC or at Data Virtualization Data Catalog (from Denodo).
@Tomasz Rabong, it depends upon the actual requirements of the data catalog.
As far as we have experienced SAP BO 4.0 is way ahead in solving architectural, clustering, warehousing and mining complex problems whereas Tableau server 2022.1 is really awesome and has recently included features to solve complex problems.
As a team, we prefer SAP BO for billions of data.
Hi @Tomasz Rabong, I hope you're well and safe.
Specifically, if you need any help regarding Infogix Data360 Govern, please let me know.
Cheers.