AWS Glue vs Informatica Enterprise Data Catalog comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
11,616 views|8,101 comparisons
92% willing to recommend
Informatica Logo
2,169 views|1,493 comparisons
84% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between AWS Glue and Informatica Enterprise Data Catalog based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed AWS Glue vs. Informatica Enterprise Data Catalog Report (Updated: March 2024).
772,679 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages.""What I like best about AWS Glue is its real-time data backup feature. Last week, there was a production push, and what used to take almost ten days to send out around fifty-six thousand emails now takes only two hours.""The most valuable feature of AWS Glue is its ease of use and good documentation. Additionally, we can do all the transformations that we need.""It is a stable and scalable solution.""AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code.""We no longer had to worry much about infrastructure management because AWS Glue is serverless, and Amazon takes care of the underlying infrastructure.""We have found it beneficial when moving data from one source to another.""AWS Glue is fast and managed by AWS. Hence, you don't have to worry about capacity and the performance of Glue jobs. It has integrations with other data stores of AWS. The product offers metadata management, logging, and ETL processing capabilities. It comes with a powerful feature, Glue Studio, which helps to do queries interactively within the community. It is a managed service and very secure. Another popular and mature service is S3."

More AWS Glue Pros →

"The most valuable feature of Informatica Enterprise Data Catalog is it provides clients with a full view of the enterprise data assets. For example, how many data assets they have and who owns them.""It can automatically connect or associate business terms with various options, providing flexibility beyond general capabilities.""Multifeatured and easily scalable data catalog, with good data domain discovery and data profiling features.""The solution scales well.""We can scan anything.""I rate the technical support a ten out of ten.""Offers data lineage feature""The capability of the tool to scan and capture the metadata from a variety of sources is one of the capabilities that I find most useful. The central repository into which it is going to put that captured metadata is the best."

More Informatica Enterprise Data Catalog Pros →

Cons
"Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background.""I haven't looked into Glue in terms of seeking out flaws. I've not come across missing features.""In terms of improvement, the performance of AWS Glue could be faster.""While working on AWS Glue, I could not find any training material for it.""The mapping area and the use of the data catalog from Glue could be better.""It is not clear how the partition discovery would have been affected by more data coming in.""The solution could be cheaper. The price of the solution is an area that needs improvement.""I would like to see stable libraries at the moment they are not there."

More AWS Glue Cons →

"This solution is hard to set up and its interface is not user-friendly. It's also not as stable, and the technical support takes a lot of time to solve simple problems.""The model is somewhat flexible. There are certain aspects of the model that are not as flexible as we would like. It doesn't do certain things to a great level of depth. So, in situations where we want to drill in to do something specific, we have to essentially copy that data into our own structures in order to add that additional layer of flexibility.""The scalability is tough.""Informatica Enterprise Data Catalog could improve by having a much better user interface. It is not user-friendly.""The UI is extremely complex""IEDC can improve the comparison of lineages.""They have to improve their relationship discovery tool. They say that they have AI inside, but this AI did not automatically find relationships or suggested relationships between entities.""Currently, there are limitations in processing and the interface."

More Informatica Enterprise Data Catalog Cons →

Pricing and Cost Advice
  • "The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
  • "It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
  • "Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
  • "Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
  • "This solution is affordable and there is an option to pay for the solution based on your usage."
  • "AWS Glue is quite costly, especially for small organizations."
  • "AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
  • "The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
  • More AWS Glue Pricing and Cost Advice →

  • "I have no idea what the price actually is. It is probably not going to be the cheapest, but it is a pretty stable and robust platform from the backend standpoint."
  • "I rate the product's pricing a five on a scale of one to ten, where one is cheap and ten is expensive."
  • "It's a costly solution"
  • More Informatica Enterprise Data Catalog Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    772,679 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in… more »
    Top Answer:AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or… more »
    Top Answer:It can automatically connect or associate business terms with various options, providing flexibility beyond general capabilities.
    Top Answer:They could improve the product support for the Arabic language. Currently, there are limitations in processing and the interface itself regarding Arabic language support.
    Top Answer:We use the product for building static and automatic data lineages.
    Ranking
    1st
    Views
    11,616
    Comparisons
    8,101
    Reviews
    32
    Average Words per Review
    419
    Rating
    7.8
    1st
    out of 27 in Metadata Management
    Views
    2,169
    Comparisons
    1,493
    Reviews
    9
    Average Words per Review
    582
    Rating
    7.8
    Comparisons
    Also Known As
    Informatica EDC, Informatica Enterprise Information Catalog, Enterprise Information Catalog
    Learn More
    Overview

    AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.

    AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.

    The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.

    AWS Glue Features

    AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:

    • Automatic schema discovery: AWS Glue crawlers connect to the organization's source or target data source through a prioritized list of classifiers to determine the schema for users' data. This feature creates metadata in companies' AWS Glue Data Catalog.

    • Schemas for data stream management: The AWS Glue Schema Registry enables users to validate and control the evolution of streaming data through registered Apache Avro schemas for no additional charge.

    • Automatic scaling based on workload: This feature dynamically scales resources up and down based on workload. The feature controls job resources, removing them depending on how much the workload can be split up.

    • FindMatches: This feature is for machine learning-based data deduplication and cleansing, and works by finding records that are imperfect matches of each other to remove useless data copies.

    • Edit, debug, and test ETL code: This feature helps users who have chosen to interactively develop their ETL code by providing development endpoints for editing, debugging, and testing the code it generates for them.

    • AWS Glue DataBrew: An interactive, point-and-click visual interface for specialists to clean and normalize data without the need to write any code.

    • AWS Glue Interactive Sessions: This feature simplifies the development of data integration jobs by enabling data engineers to interactively prepare and explore data.

    • AWS Glue Studio Job Notebooks: This AWS Glue feature provides serverless notebooks with minimal setup, allowing developers to start working in a timely manner.

    • Complex ETL pipeline building: This feature allows the product to be invoked on a schedule, on demand, or based on an event, allowing users to start multiple jobs in parallel or specify dependencies to build complex ETL pipelines.

    • AWS Glue Studio: This AWS Glue feature allows users to visually transform data through a drag-and-drop interface. The product automatically generates the code for ETL processes for users' data.

    AWS Glue Benefits

    AWS Glue offers a wide range of benefits for its users. These benefits include:

    • Users of other AWS products can easily onboard with AWS Glue, as it is integrated across a wide range of the company's services.

    • The solution is serverless, which allows for a lower total cost of ownership.

    • AWS Glue offers more power for users, as it automates much of the effort in building, maintaining, and running ETL jobs.

    • The product allows customers to easily discover and search across all their AWS datasets through AWS Glue Data Catalog.

    • AWS Glue does not require additional payment for managing and enforcing schemas for data streams.

    • The solution facilitates the authority of scalable ETL jobs for beginners and non-coding experts through a drag-and-drop interface.

    Reviews from Real Users

    Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.

    Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.

    Informatica Enterprise Information Catalog provides a machine-learning-based discovery engine to collect data assets across the enterprise while increasing the understanding of those data assets through a graph-based enterprise information catalog. Powered by Informatica’s unique metadata services engine, Enterprise Information Catalog enables business analysts and data stewards to find all types of data across the enterprise; discover relationships among them; enrich data with business glossary terms and crowdsourced annotations; and understand the provenance, quality, and usage of their data.

    Sample Customers
    bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
    AIA Singapore, Mattel
    Top Industries
    REVIEWERS
    Computer Software Company47%
    Financial Services Firm18%
    Pharma/Biotech Company12%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm20%
    Computer Software Company14%
    Manufacturing Company8%
    Insurance Company7%
    REVIEWERS
    Computer Software Company25%
    Construction Company13%
    Insurance Company13%
    Manufacturing Company13%
    VISITORS READING REVIEWS
    Financial Services Firm21%
    Computer Software Company14%
    Government9%
    Manufacturing Company9%
    Company Size
    REVIEWERS
    Small Business29%
    Midsize Enterprise13%
    Large Enterprise58%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise12%
    Large Enterprise73%
    REVIEWERS
    Small Business29%
    Midsize Enterprise7%
    Large Enterprise64%
    VISITORS READING REVIEWS
    Small Business18%
    Midsize Enterprise9%
    Large Enterprise73%
    Buyer's Guide
    AWS Glue vs. Informatica Enterprise Data Catalog
    March 2024
    Find out what your peers are saying about AWS Glue vs. Informatica Enterprise Data Catalog and other solutions. Updated: March 2024.
    772,679 professionals have used our research since 2012.

    AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews while Informatica Enterprise Data Catalog is ranked 1st in Metadata Management with 14 reviews. AWS Glue is rated 7.8, while Informatica Enterprise Data Catalog is rated 7.6. The top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". On the other hand, the top reviewer of Informatica Enterprise Data Catalog writes "They listen to their customers, so if something is missing or not working, they will put it on their roadmap". AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, Informatica Cloud Data Integration, SSIS and SAP Data Services, whereas Informatica Enterprise Data Catalog is most compared with Alation Data Catalog, Collibra Catalog, Informatica PowerCenter, Denodo and Palantir Foundry. See our AWS Glue vs. Informatica Enterprise Data Catalog report.

    See our list of best Cloud Data Integration vendors.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.