Azure Data Factory vs IBM Cloud Pak for Data comparison

Cancel
You must select at least 2 products to compare!
Microsoft Logo
25,660 views|20,160 comparisons
91% willing to recommend
IBM Logo
4,032 views|2,639 comparisons
84% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Azure Data Factory and IBM Cloud Pak for Data based on real PeerSpot user reviews.

Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Azure Data Factory vs. IBM Cloud Pak for Data Report (Updated: May 2024).
772,649 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"I like the basic features like the data-based pipelines.""The workflow automation features in GitLab, particularly its low code/no code approach, are highly beneficial for accelerating development speed. This feature allows for quick creation of pipelines and offers customization options for integration needs, making it versatile for various use cases. GitLab supports a wide range of connectors, catering to a majority of integration needs. Azure Data Factory's virtual enterprise and monitoring capabilities, the visual interface of GitLab makes it user-friendly and easy to teach, facilitating adoption within teams. While the monitoring capabilities are sufficient out of the box, they may not be as comprehensive as dedicated enterprise monitoring tools. GitLab's monitoring features are manageable for production use, with the option to integrate log analytics or create custom dashboards if needed. The data flow feature in Azure Data Factory within GitLab is valuable for data transformation tasks, especially for those who may not have expertise in writing complex code. It simplifies the process of data manipulation and is particularly useful for individuals unfamiliar with Spark coding. While there could be improvements for more flexibility, overall, the data flow feature effectively accomplishes its purpose within GitLab's ecosystem.""The most important feature is that it can help you do the multi-threading concepts.""In terms of my personal experience, it works fine.""I think it makes it very easy to understand what data flow is and so on. You can leverage the user interface to do the different data flows, and it's great. I like it a lot.""I like its integration with SQL pools, its ability to work with Databricks, its pipelines, and the serverless architecture are the most effective features.""On the tool itself, we've never experienced any bugs or glitches. There haven't been crashes. Stability has been good.""It's extremely consistent."

More Azure Data Factory Pros →

"One of Cloud Pak's best features is the Watson Knowledge Catalog, which helps you implement data governance.""It is a scalable solution, and we have had no issues with its scalability in our company. I rate the solution's scalability a nine out of ten.""Cloud Pak's most valuable features are IBM MQ, IBM App Connect, IBM API Connect, and ISPF.""Its data preparation capabilities are highly valuable.""Scalability-wise, I rate the solution a nine or ten out of ten.""The most valuable feature of IBM Cloud Pak for Data is the Modeler flows. The ability to develop models using a graphical approach and the capability to connect to various sources, as well as the data virtualization capabilities, allow me to easily access and utilize data that is dispersed across different sources.""The most valuable features of IBM Cloud Pak for Data are the Watson Studio, where we can initiate more groups and write code. Additionally, Watson Machine Learning is available with many other services, such as APIs which you can plug the machine learning models.""You can model the data there, connect the data models with the business processes and create data lineage processes."

More IBM Cloud Pak for Data Pros →

Cons
"The support and the documentation can be improved.""The pricing scheme is very complex and difficult to understand.""Azure Data Factory could benefit from improvements in its monitoring capabilities to provide a more robust feature set. Enhancing the ease of deployment to higher environments within Azure DevOps would be beneficial, as the current process often requires extensive scripting and pipeline development. It is also known for the flexibility of the data flow feature, particularly in supporting more dynamic data-driven architectures. These enhancements would contribute to a more seamless and efficient workflow within GitLab.""DataStage is easier to learn than Data Factory because it's more visual. Data Factory has some drag-and-drop options, but it's not as intuitive as DataStage. It would be better if they added more drag-and-drop features. You can start using DataStage without knowing the code. You don't need to learn how the code works before using the solution.""The thing we missed most was data update, but this is now available as of two weeks ago.""Azure Data Factory's pricing in terms of utilization could be improved.""There is no built-in pipeline exit activity when encountering an error.""Azure Data Factory should be cheaper to move data to a data center abroad for calamities in case of disasters."

More Azure Data Factory Cons →

"The interface could improve because sometimes it becomes slow. Sometimes there is a delay between clicks when using the software, which can make the development process slow. It can take a few seconds to complete one action, and then a few more seconds to do the next one.""One thing that bugs me is how much infrastructure Cloud Pak requires for the initial deployment. It doesn't allow you to start small. The smallest permitted deployment is too big. It's a huge problem that prevents us from implementing the solution in many scenarios.""The solution could have more connectors.""One challenge I'm facing with IBM Cloud Pak for Data is native features have been decommissioned, such as XML input and output. Too many changes have been made, and my company has around one hundred thousand mappings, so my team has been putting more effort into alternative ways to do things. Another area for improvement in IBM Cloud Pak for Data is that it's more complicated to shift from on-premise to the cloud. Other vendors provide secure agents that easily connect with your existing setup. Still, with IBM Cloud Pak for Data, you have to perform connection migration steps, upgrade to the latest version, etc., which makes it more complicated, especially as my company has XML-based mappings. Still, the XML input and output capabilities of IBM Cloud Pak for Data have been discontinued, so I'd like IBM to bring that back.""The tool depends on the control plane, an OpenShift container platform utilized as an orchestration layer...So, we have communicated this issue to IBM and asked if it is feasible to adapt the solution to work on a Kubernetes platform that we support.""The product must improve its performance.""The product is trying to be more maturity in terms of connectors. That, I believe, is an area where Cloud Pak can improve.""Cloud Pak would be improved with integration with cloud service providers like Cloudera."

More IBM Cloud Pak for Data Cons →

Pricing and Cost Advice
  • "In terms of licensing costs, we pay somewhere around S14,000 USD per month. There are some additional costs. For example, we would have to subscribe to some additional computing and for elasticity, but they are minimal."
  • "This is a cost-effective solution."
  • "The price you pay is determined by how much you use it."
  • "Understanding the pricing model for Data Factory is quite complex."
  • "I would not say that this product is overly expensive."
  • "The licensing is a pay-as-you-go model, where you pay for what you consume."
  • "Our licensing fees are approximately 15,000 ($150 USD) per month."
  • "The licensing cost is included in the Synapse."
  • More Azure Data Factory Pricing and Cost Advice →

  • "I think that this product is too expensive for smaller companies."
  • "I don't have the exact licensing cost for IBM Cloud Pak for Data, as my company is still finalizing requirements, including monthly, yearly, and three-year licensing fees. Still, on a scale of one to five, I'd rate it a three because, compared to other vendors, it's more complicated."
  • "Cloud Pak's cost is a little high."
  • "IBM Cloud Pak for Data is expensive. If we include the training time and the machine learning, it's expensive. The cost of the execution is more reasonable."
  • "For the licensing of the solution, there is a yearly payment that needs to be made. Also, since it is expensive, cost-wise, I rate the solution an eight or nine out of ten."
  • "It's quite expensive."
  • "The solution is expensive."
  • More IBM Cloud Pak for Data Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
    772,649 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up and… more »
    Top Answer:Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power… more »
    Top Answer:DataStage allows me to connect to different data sources.
    Top Answer:The product must improve its performance. We see typical cloud-related issues in the solution. IBM can still focus more on keeping the performance up and keeping it 100% available all the time.
    Ranking
    1st
    out of 101 in Data Integration
    Views
    25,660
    Comparisons
    20,160
    Reviews
    47
    Average Words per Review
    509
    Rating
    8.0
    17th
    out of 101 in Data Integration
    Views
    4,032
    Comparisons
    2,639
    Reviews
    9
    Average Words per Review
    500
    Rating
    8.4
    Comparisons
    Also Known As
    Cloud Pak for Data
    Learn More
    Overview

    Azure Data Factory efficiently manages and integrates data from various sources, enabling seamless movement and transformation across platforms. Its valuable features include seamless integration with Azure services, handling large data volumes, flexible transformation, user-friendly interface, extensive connectors, and scalability. Users have experienced improved team performance, workflow simplification, enhanced collaboration, streamlined processes, and boosted productivity.

    IBM Cloud Pak® for Data is a fully-integrated data and AI platform that modernizes how businesses collect, organize and analyze data to infuse AI throughout their organizations. Cloud-native by design, the platform unifies market-leading services spanning the entire analytics lifecycle. From data management, DataOps, governance, business analytics and automated AI, IBM Cloud Pak for Data helps eliminate the need for costly, and often competing, point solutions while providing the information architecture you need to implement AI successfully.

    Building on the streamlined hybrid-cloud foundation of Red Hat® OpenShift®, IBM Cloud Pak for Data takes advantage of the underlying resource and infrastructure optimization and management. The solution fully supports multicloud environments such as Amazon Web Services (AWS), Azure, Google Cloud, IBM Cloud™ and private cloud deployments. Find out how IBM Cloud Pak for Data can lower your total cost of ownership and accelerate innovation.

    Sample Customers
    1. Adobe 2. BMW 3. Coca-Cola 4. General Electric 5. Johnson & Johnson 6. LinkedIn 7. Mastercard 8. Nestle 9. Pfizer 10. Samsung 11. Siemens 12. Toyota 13. Unilever 14. Verizon 15. Walmart 16. Accenture 17. American Express 18. AT&T 19. Bank of America 20. Cisco 21. Deloitte 22. ExxonMobil 23. Ford 24. General Motors 25. IBM 26. JPMorgan Chase 27. Microsoft (Azure Data Factory is developed by Microsoft) 28. Oracle 29. Procter & Gamble 30. Salesforce 31. Shell 32. Visa
    Qatar Development Bank, GuideWell, Skanderborg Music Festival
    Top Industries
    REVIEWERS
    Computer Software Company34%
    Insurance Company11%
    Manufacturing Company8%
    Financial Services Firm8%
    VISITORS READING REVIEWS
    Computer Software Company13%
    Financial Services Firm13%
    Manufacturing Company8%
    Healthcare Company7%
    VISITORS READING REVIEWS
    Financial Services Firm26%
    Computer Software Company11%
    Manufacturing Company8%
    Government7%
    Company Size
    REVIEWERS
    Small Business29%
    Midsize Enterprise19%
    Large Enterprise52%
    VISITORS READING REVIEWS
    Small Business18%
    Midsize Enterprise13%
    Large Enterprise69%
    REVIEWERS
    Small Business46%
    Large Enterprise54%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise7%
    Large Enterprise76%
    Buyer's Guide
    Azure Data Factory vs. IBM Cloud Pak for Data
    May 2024
    Find out what your peers are saying about Azure Data Factory vs. IBM Cloud Pak for Data and other solutions. Updated: May 2024.
    772,649 professionals have used our research since 2012.

    Azure Data Factory is ranked 1st in Data Integration with 81 reviews while IBM Cloud Pak for Data is ranked 17th in Data Integration with 11 reviews. Azure Data Factory is rated 8.0, while IBM Cloud Pak for Data is rated 8.0. The top reviewer of Azure Data Factory writes "The data factory agent is quite good but pricing needs to be more transparent". On the other hand, the top reviewer of IBM Cloud Pak for Data writes "A scalable data analytics and digital transformation tool that provides useful features and integrations". Azure Data Factory is most compared with Informatica PowerCenter, Informatica Cloud Data Integration, Alteryx Designer, Snowflake and IBM InfoSphere DataStage, whereas IBM Cloud Pak for Data is most compared with IBM InfoSphere DataStage, Informatica Cloud Data Integration, Palantir Foundry, Denodo and IBM InfoSphere Information Server. See our Azure Data Factory vs. IBM Cloud Pak for Data report.

    See our list of best Data Integration vendors.

    We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.