AWS Glue vs Informatica Cloud Data Integration comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
11,729 views|8,292 comparisons
92% willing to recommend
Informatica Logo
3,500 views|2,815 comparisons
88% willing to recommend
Comparison Buyer's Guide
Executive Summary
Updated on Sep 6, 2022

We performed a comparison between AWS Glue and Informatica Cloud Data Integration based on our users’ reviews in four categories. After reading all of the collected data, you can find our conclusion below.

  • Ease of Deployment: For the most part, users of both solutions feel they are easy and straightforward to deploy.
  • Features: AWS Glue can easily sync data from the source to the solution phase and users say it provides excellent intuitive automation. They find it is very robust and flexible, enabling them to write their own queries to achieve the desired transformations quickly. However, they say AWS is not very user friendly and only works with other AWS tools and solutions.

    Informatica Cloud Data Integration offers mass ingestion functionality, and users say it is very flexible, elastic, and is for enterprise organizations. The solution makes it easy to create integrations and they provide many connectors. Many users feel the solution has a steep learning curve and performance limitations.
  • Pricing: AWS Glue users tell us the solution is affordable and offers a pay-as-you-use option. Informatica Cloud Data Integration users feel the pricing could be improved.
  • Service and Support: Overall, users are satisfied with the service and support.

Comparison Results: For users vested in the AWS ecosystem, AWS is hands down the best choice. Informatica Cloud Data Integration is flexible and allows users to decide how to distribute their IPUs in their own networks. Data residency laws make it challenging to choose this solution, as their regions are currently very limited.

To learn more, read our detailed AWS Glue vs. Informatica Cloud Data Integration Report (Updated: March 2024).
770,292 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"AWS Glue's best features are scalability and cloud-based features.""AWS Glue is a stable and easy-to-use solution.""AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code.""The most valuable feature of AWS Glue is that it provides a GUI format with a drag-and-drop feature.""The solution is highly user-friendly, and its features are easy to use. The new addition of AWS Glue Data Catalog is also very beneficial, making the tool even more helpful for its users.""The solution helps organizations gain flexibility in defining the structure of the data.""The most valuable feature for me is the visual interface of AWS Glue.""I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages."

More AWS Glue Pros →

"The Mapping Designer allows for declarative ETL development (visual scripting) that leverages a wide array of different transformations.""Informatica Cloud Data Integration is stable.""The most valuable feature of Informatica Cloud Data Integration is Pushback. You are able to push the data to the solution itself, and it can handle the queries. It helps us a lot. With other tools, the burden is kept on the server.""Data integration is the most valuable feature. The ability to connect to any of the sources and enterprise applications makes our lives easier.""The solution provides increased efficiency while still being user-friendly and easy to operate.""The solution is straightforward.""It is quite easy to use and flexible.""Their new licensing is very flexible. With Informatica Cloud, you have plenty of items under the same umbrella, such as services, offerings, data quality, and data masking. You have also got master data management and API management. What I really like about them is that you don't need to go to Informatica and say that you need a data integration module. You would say that you need iPaaS or Informatica Cloud. They'll then try to understand your needs and give you IPUs, which are the processing units. If I purchased a hundred IPUs from Informatica as a customer, I can use 70 IPUs for data integration. I would also need data quality, so I can use 10 IPUs for data quality. I can use the remaining 20 IPUs for API management. Down the line, if I see that my initial data integration needs for the development phase are met, then out of the 70 IPUs assigned for data integration, I can use 30 IPUs for data masking. I can shuffle these numbers in any way within the Informatica Cloud umbrella for the tenure for which I have subscribed to these IPUs. I can use all services the way I want. This flexibility is what I really love about Informatica. It also has got good connectors."

More Informatica Cloud Data Integration Pros →

Cons
"Glue could perform better. It sometimes takes too long to test a Glue job. Google Cloud Platform offers more Python scripts than AWS.""Cost-wise, AWS Glue is expensive, so that's an area for improvement. The process for setting up the solution was also complex, which is another area for improvement.""The solution's visual ETL tool is of no use for actual implementation.""The mapping area and the use of the data catalog from Glue could be better.""The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS.""If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data.""There should be more connectors for different databases.""The solution could be cheaper. The price of the solution is an area that needs improvement."

More AWS Glue Cons →

"Its pricing model can be improved. The response time from technical support can also be improved.""Informatica Cloud Data Integration can improve by being more user-friendly. When you're working with the solution a lot of technical knowledge is required. It's not a solution that anyone can use properly, you need knowledge of what's happening at the back end, such as SQL. When you get stuck, you need to look into your logic. For other tools, such as Dell Boomi, anyone can use them.""Performance also needs to be significantly improved, especially when connecting to SFDC for read and write operations.""It could be improved by including a buffer that saves data when there is a connectivity issue.""I would like to see support for more data sources.""The regions in which the data resides are still limited. This could be an issue in terms of the data residency laws of some of the countries. They should get more regions.""The current features are a bit complicated, and we need to write big scripts and test.""Error reporting and debugging need improvement."

More Informatica Cloud Data Integration Cons →

Pricing and Cost Advice
  • "The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
  • "It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
  • "Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
  • "Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
  • "This solution is affordable and there is an option to pay for the solution based on your usage."
  • "AWS Glue is quite costly, especially for small organizations."
  • "AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
  • "The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
  • More AWS Glue Pricing and Cost Advice →

  • "It is cost effective and an easily accessible tool."
  • "The pricing structure is good, but having to pay for extra drivers to be used in an ICS environment makes me a little nervous."
  • "Licensing is difficult to understand, but the team is always available to explain anything. They are very helpful."
  • "My understanding is that Informatica is quite expensive compare to other tools that are available in the market."
  • "Our customers sometimes are able to negotiate a much better price for Informatica Cloud Data Integration based on their relationship with the vendor."
  • "Its pricing model can be improved."
  • "I'm not sure about the most recent pricing trends, but I don't believe it's significantly different from PowerCenter. I believe it is nearly the same."
  • "The price of Informatica Cloud Data Integration could be reduced."
  • More Informatica Cloud Data Integration Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    770,292 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in… more »
    Top Answer:AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or… more »
    Top Answer:Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power… more »
    Top Answer:Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge… more »
    Top Answer:When it comes to cloud data integration, this solution can provide you with multiple benefits, including Overhead reduction by integrating data on any cloud in various ways Effective integration of… more »
    Ranking
    1st
    Views
    11,729
    Comparisons
    8,292
    Reviews
    32
    Average Words per Review
    419
    Rating
    7.8
    5th
    Views
    3,500
    Comparisons
    2,815
    Reviews
    18
    Average Words per Review
    459
    Rating
    7.7
    Comparisons
    Learn More
    Overview

    AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.

    AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.

    The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.

    AWS Glue Features

    AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:

    • Automatic schema discovery: AWS Glue crawlers connect to the organization's source or target data source through a prioritized list of classifiers to determine the schema for users' data. This feature creates metadata in companies' AWS Glue Data Catalog.

    • Schemas for data stream management: The AWS Glue Schema Registry enables users to validate and control the evolution of streaming data through registered Apache Avro schemas for no additional charge.

    • Automatic scaling based on workload: This feature dynamically scales resources up and down based on workload. The feature controls job resources, removing them depending on how much the workload can be split up.

    • FindMatches: This feature is for machine learning-based data deduplication and cleansing, and works by finding records that are imperfect matches of each other to remove useless data copies.

    • Edit, debug, and test ETL code: This feature helps users who have chosen to interactively develop their ETL code by providing development endpoints for editing, debugging, and testing the code it generates for them.

    • AWS Glue DataBrew: An interactive, point-and-click visual interface for specialists to clean and normalize data without the need to write any code.

    • AWS Glue Interactive Sessions: This feature simplifies the development of data integration jobs by enabling data engineers to interactively prepare and explore data.

    • AWS Glue Studio Job Notebooks: This AWS Glue feature provides serverless notebooks with minimal setup, allowing developers to start working in a timely manner.

    • Complex ETL pipeline building: This feature allows the product to be invoked on a schedule, on demand, or based on an event, allowing users to start multiple jobs in parallel or specify dependencies to build complex ETL pipelines.

    • AWS Glue Studio: This AWS Glue feature allows users to visually transform data through a drag-and-drop interface. The product automatically generates the code for ETL processes for users' data.

    AWS Glue Benefits

    AWS Glue offers a wide range of benefits for its users. These benefits include:

    • Users of other AWS products can easily onboard with AWS Glue, as it is integrated across a wide range of the company's services.

    • The solution is serverless, which allows for a lower total cost of ownership.

    • AWS Glue offers more power for users, as it automates much of the effort in building, maintaining, and running ETL jobs.

    • The product allows customers to easily discover and search across all their AWS datasets through AWS Glue Data Catalog.

    • AWS Glue does not require additional payment for managing and enforcing schemas for data streams.

    • The solution facilitates the authority of scalable ETL jobs for beginners and non-coding experts through a drag-and-drop interface.

    Reviews from Real Users

    Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.

    Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.

    Informatica Cloud Data Integration is a cloud-native cloud data integration solution that enables users to connect a large number of applications and data sources across on-premises and integrate the data sources at scale on the cloud. The product is built on microservices-driven management and integration platform as a service (iPaaS) and assists organizations to govern costs, increase productivity and collaboration, and simplify their experience. Informatica Cloud Data Integration allows companies to deliver data and analytics to lines of business in a timely manner, build data warehouses on Amazon Redshift, Google Cloud BigQuery, Snowflake, and Microsoft Azure Synapse Analytics, and utilize the required data integration patterns, including elastic processing, extract, load, and transform (ELT), and extract, transform, and load (ETL).

    The solution allows users to to build enterprise-scale integration workloads within hours while it improves the productivity of development teams by providing them a codeless, drag-and-drop user interface. Companies can benefit from integration features built for data warehousing and optimized connectors for bulk loads of billions of records. Informatica Cloud Data Integration offers organizations the option of going serverless at scale by allowing them to process data integration jobs from cloud-hosted as well as managed environments. The Spark-based engine allows the solution to handle high-volume data demands and complex data integration tasks.

    Informatica Cloud Data Integration Features

    Informatica Cloud Data Integration provides its users with various features and tools. Among the key capacities of the product are:

    • Advanced Pushdown Optimization: Informatica Cloud Data Integration offers a feature that provides users with the benefits of ELT while maintaining their data flow definitions at a logical or abstract level. This feature allows users to choose a runtime option that complies with the workload as well as send their data processing work to cloud ecosystem pushdown, cloud data warehouse pushdown, Spark serverless processing, or traditional ETL.

    • Connectors for all major data sources: This feature provides out-of-the-box connectivity to a large number of cloud and on-premise systems, data stores, analytics and BI tools, and enterprise and middleware applications.

    • Data transformation capabilities: This feature allows users to process data transformation in real time or batch by using a variety of transformation types, such as cleansing, masking, aggregation, fileting, parsing, and ranking.

    • Spark-based complex data integration: Informatica Cloud Data Integration Elastic allows specialists to use elastic clusters to process their data transformation.

    • Codeless integration: This feature facilitates the creation of simple-to-sophisticated data integration projects with a visual mapping designer that speeds up pre-build transformations for development through a variety of endpoints across cloud and on-premises.

    • Serverless data integration: Users can achieve cloud data integration in a mode called Advanced Serverless, where they can benefit from a fully managed environment with no software, no cloud administration, and no servers or clusters to manage.

    • Taskflow orchestration: This feature allows users to combine batch and real-time integration through a taskflow designer in order to create simple-to-sophisticated orchestrations.

    • Intelligent structure discovery: This feature uses the CLAIRE engine to automatically understand the parsing model for complicated files based on their structure.

    • Change data capture: Utilizing the prebuilt task wizards and Change Data Capture tool, users can automatically pull only the updated or incremental data from source systems to the targets on a frequent basis.

    • Security: The product offers various features which ensure the highest level of data and workload security and comply with various policies.

    Informatica Cloud Data Integration Benefits

    Informatica Cloud Data Integration brings multiple benefits to its users. These include:

    • The product offers optimized connectivity to various systems through custom build-connectors.

    • Users can benefit from improved elasticity and performance by utilizing Spark clusters and auto-tuning.

    • The tool allows developers to focus on business logic by facilitating infrastructure management through serverless deployment features.

    • Informatica Data Cloud Integration provides user flexibility by connecting to any database, cloud data lake, on-premise apps, and data warehouses.

    • Through a zero-coding environment and role-appropriate user experience, the solution is suitable for all types of users.

    • The solution offers consistent experience and unified metadata across all cloud services.

    • Users can leverage enterprise-level performance for integration design with no coding required.

    • Informatica Data Cloud Integration scales as a business grows, providing a high level of adaptability.

    Reviews from Real Users

    Divya R., a senior consultant at Deloitte, rates Informatica Cloud Data Integration highly because it is a UI-based tool with great scripting.

    A data architect at a retailer likes Informatica Cloud Data Integration because of its flexible licensing, good connectors, and timely upgrades and patches.

    Sample Customers
    bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
    Chicago Cubs, Telegraph Media Group
    Top Industries
    REVIEWERS
    Computer Software Company47%
    Financial Services Firm18%
    Pharma/Biotech Company12%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm19%
    Computer Software Company13%
    Manufacturing Company7%
    Insurance Company7%
    REVIEWERS
    Computer Software Company37%
    Pharma/Biotech Company21%
    Manufacturing Company11%
    Individual & Family Service5%
    VISITORS READING REVIEWS
    Financial Services Firm15%
    Computer Software Company14%
    Manufacturing Company9%
    Insurance Company8%
    Company Size
    REVIEWERS
    Small Business29%
    Midsize Enterprise13%
    Large Enterprise58%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise12%
    Large Enterprise73%
    REVIEWERS
    Small Business21%
    Midsize Enterprise21%
    Large Enterprise57%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    Buyer's Guide
    AWS Glue vs. Informatica Cloud Data Integration
    March 2024
    Find out what your peers are saying about AWS Glue vs. Informatica Cloud Data Integration and other solutions. Updated: March 2024.
    770,292 professionals have used our research since 2012.

    AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews while Informatica Cloud Data Integration is ranked 5th in Cloud Data Integration with 40 reviews. AWS Glue is rated 7.8, while Informatica Cloud Data Integration is rated 7.8. The top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". On the other hand, the top reviewer of Informatica Cloud Data Integration writes "A stable, scalable, and user-friendly solution". AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, SSIS, Talend Open Studio and Oracle Integration Cloud Service, whereas Informatica Cloud Data Integration is most compared with Informatica PowerCenter, Azure Data Factory, Fivetran, Mule Anypoint Platform and SAP Data Services. See our AWS Glue vs. Informatica Cloud Data Integration report.

    See our list of best Cloud Data Integration vendors.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.