Azure Data Factory vs SAP Data Hub comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Azure Data Factory
Average Rating
8.0
Number of Reviews
81
Ranking in other categories
Data Integration (1st), Cloud Data Warehouse (3rd)
SAP Data Hub
Average Rating
7.6
Number of Reviews
3
Ranking in other categories
Data Governance (26th), Metadata Management (11th)
 

Market share comparison

As of June 2024, in the Data Integration category, the market share of Azure Data Factory is 9.6% and it decreased by 29.4% compared to the previous year. The market share of SAP Data Hub is 0.6% and it decreased by 11.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
Unique Categories:
Cloud Data Warehouse
13.6%
Data Governance
1.0%
Metadata Management
0.9%
 

Featured Reviews

KR
Mar 6, 2024
Offers good integration with SQL pools and serverless architecture
We have multiple banking applications running on SSIS pipelines. We're in the process of upgrading them to a hybrid cloud architecture. For that, we use Azure Data Factory to move data from on-premises to the cloud – mainly for back-end database operations and ETL transformations. We primarily use it to load data from an on-premises SQL Server to either Blob storage or an Azure SQL data warehouse. For other integrations, especially those outside of Azure, we tend to use Informatica Cloud Services (ICS). For structured data loading, we use it However, we use Informatica for unstructured or semi-structured data. We also use Snowflake for ETL processes and sometimes for streaming. In my opinion, ADF isn't as suitable for streaming – for streaming, Snowflake streamlets or Informatica structured streaming are more reliable. ADF works well for batch processing, though.
VM
Sep 22, 2023
The solution is seamless, but the database sometimes leads to confusion
We used to have multiple different kinds of databases, which internally, had different compliance levels. Retention management is very different now. If the policy is live and the claim has been completed, I couldn't archive the claim. I needed to keep a reference integrity of that claim and understand which policy paid out the claim. With this solution, the policy came in six months ago and qualified for archiving. The claim had been paid and in every environment, the claim had been closed, including the reporting system, the claims system, etc. With the payment set gateway, I can just go and archive. But, we had a hard time during this process. I rate the overall solution a seven out of ten.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature of this solution would be ease of use."
"The most valuable features of the solution are its ease of use and the readily available adapters for connecting with various sources."
"What I like best about Azure Data Factory is that it allows you to create pipelines, specifically ETL pipelines. I also like that Azure Data Factory has connectors and solves most of my company's problems."
"The function of the solution is great."
"The two most valuable features of Azure Data Factory are that it's very scalable and that it's also highly reliable."
"Allows more data between on-premises and cloud solutions"
"The user interface is very good. It makes me feel very comfortable when I am using the tool."
"Its integrability with the rest of the activities on Azure is most valuable."
"SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database."
"Its connection to on-premise products is the most valuable. We mostly use the on-premise connection, which is seamless. This is what we prefer in this solution over other solutions. We are using it the most for the orchestration where the data is coming from different categories. Its other features are very much similar to what they are giving us in open source. Their push-down approach is the most advantageous, where they push most of the processing on to the same data source. This means that they have a serverless kind of thing, and they don't process the data inside a product such as Data Hub. They process the data from where the data is coming out. If it is coming from HANA, to capture the data or process it for analytics, orchestration, or management, they go to the HANA database and give it out. They don't process it on Data Hub. This push-down approach increases the processing speed a little bit because the data is processed where it is sitting. That's the best part and an advantage. I have used another product where they used to capture the data first and then they used to process it and give it. In Data Hub, it is in reverse. They process it first and give it, and then they put their own manipulations. They lead in terms of business functions. No other solution has business functions already implemented to perform business analysis. They have a lot of prebuilt business functions for machine learning and orchestration, which we can use directly to get an analysis out from the existing data. Most of the data is sitting as enterprise data there. That's a major advantage that they have."
"The most valuable feature is the S/4HANA 1909 On-Premise"
 

Cons

"If the user interface was more user friendly and there was better error feedback, it would be helpful."
"It's a good idea to take a Microsoft course. Because they are really helpful when you start from your journey with Data Factory."
"The user interface could use improvement. It's not a major issue but it's something that can be improved."
"It can improve from the perspective of active logging. It can provide active logging information."
"Some of the optimization techniques are not scalable."
"Data Factory has so many features that it can be a little difficult or confusing to find some settings and configurations. I'm sure there's a way to make it a little easier to navigate."
"I have not found any real shortcomings within the product."
"Snowflake connectivity was recently added and if the vendor provided some videos on how to create data then that would be helpful."
"Nowadays there are some inconsistencies in data bases, however, they upgrade and release the versions to market."
"In 2018, connecting it to outside sources, such as IoT products or IoT-enabled big data Hadoop, was a little complex. It was not smooth at the beginning. It was unstable. It took a lot of time for the initial data load. Sometimes, the connection broke, and we had to restart the process, which was a major issue, but they might have improved it now. It is very smooth with SAP HANA on-premise system, SAP Cloud Platform, and SAP Analytics Cloud. It could be because these are their own products, and they know how to integrate them. With Hadoop, they might have used open-source technologies, and that's why it was breaking at that time. They are providing less embedded integration because they want us to use their other products. For example, they don't want to go and remove SAP Analytics Cloud and put everything in Data Hub. They want us to use SAP Analytics Cloud somewhere else and not inside the Data Hub. On the integration part, it lacks real-time analytics, and it is slow. They should embed the SAP Analytics Cloud inside Data Hub or support some kind of analysis. They do provide some analysis, but it is not extensive. They are moreover open source. So, we need a lot of developers or data scientists to go in and implement Python algorithms. It would be better if they can provide their own existing algorithms and give some connections and drop-down menus to go and just configure those. It will make things really quick by increasing the embedded integrations. It will also improve the process efficiency and processing power. Its performance needs improvement. It is a little slow. It is not the best in the market, and there are other products that are much better than this. In terms of technology and performance, it is a little slow as compared to Microsoft and other data orchestration products. I haven't used other products, but I have read about those products, their settings, and the milliseconds that they do. In Azure Purview, they say that they can copy, manage, or transform the data within milliseconds. They say that they can transform 100 gigabytes of data within three to five seconds, which is something SAP cannot do. It generally takes a lot of time to process that much amount of data. However, I have never tested out Azure."
"The company has everything offshore."
 

Pricing and Cost Advice

"The solution's pricing is competitive."
"The solution's fees are based on a pay-per-minute use plus the amount of data required to process."
"The licensing cost is included in the Synapse."
"The price you pay is determined by how much you use it."
"The pricing is pay-as-you-go or reserve instance. Of the two options, reserve instance is much cheaper."
"I am aware of the pricing of Azure Data Factory, but I prefer not to disclose specific details."
"There's no licensing for Azure Data Factory, they have a consumption payment model. How often you are running the service and how long that service takes to run. The price can be approximately $500 to $1,000 per month but depends on the scaling."
"I don't see a cost; it appears to be included in general support."
"The Cloud is very expensive, but SAP HANA previous service is okay."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
787,383 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Computer Software Company
13%
Financial Services Firm
13%
Manufacturing Company
8%
Healthcare Company
7%
Computer Software Company
15%
Manufacturing Company
13%
Financial Services Firm
12%
Government
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Azure Data Factory compare with Informatica PowerCenter?
Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up an...
How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
What do you like most about SAP Data Hub?
SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database.
What needs improvement with SAP Data Hub?
We moved from Oracle. If you're aware of your monitoring system, the RPU market, and the managed system, you should move to HANA, which is an innovative database built by SAP itself. However, this ...
What is your primary use case for SAP Data Hub?
I technically handle the database, like cycle management projects. When transaction data comes in, we see it based on the retention periods. We have to move the data to some secure storage rather t...
 

Learn More

 

Overview

 

Sample Customers

1. Adobe 2. BMW 3. Coca-Cola 4. General Electric 5. Johnson & Johnson 6. LinkedIn 7. Mastercard 8. Nestle 9. Pfizer 10. Samsung 11. Siemens 12. Toyota 13. Unilever 14. Verizon 15. Walmart 16. Accenture 17. American Express 18. AT&T 19. Bank of America 20. Cisco 21. Deloitte 22. ExxonMobil 23. Ford 24. General Motors 25. IBM 26. JPMorgan Chase 27. Microsoft (Azure Data Factory is developed by Microsoft) 28. Oracle 29. Procter & Gamble 30. Salesforce 31. Shell 32. Visa
Kaeser Kompressoren, HARTMANN
Find out what your peers are saying about Microsoft, Informatica, Oracle and others in Data Integration. Updated: May 2024.
787,383 professionals have used our research since 2012.