AWS Glue Overview

AWS Glue is the #7 ranked solution in our list of top Cloud Data Integration tools. It is most often compared to Talend Open Studio: AWS Glue vs Talend Open Studio

What is AWS Glue?

AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. You can create and run an ETL job with a few clicks in the AWS Management Console. You simply point AWS Glue to your data stored on AWS, and AWS Glue discovers your data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, your data is immediately searchable, queryable, and available for ETL.

AWS Glue Buyer's Guide

Download the AWS Glue Buyer's Guide including reviews and more. Updated: December 2020

AWS Glue Customers
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
AWS Glue Video

Pricing Advice

What users are saying about AWS Glue pricing:
  • "Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
  • "It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
  • "The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."

AWS Glue Reviews

Filter by:
Filter Reviews
Industry
Loading...
Filter Unavailable
Company Size
Loading...
Filter Unavailable
Job Level
Loading...
Filter Unavailable
Rating
Loading...
Filter Unavailable
Considered
Loading...
Filter Unavailable
Order by:
Loading...
  • Date
  • Highest Rating
  • Lowest Rating
  • Review Length
Search:
Showingreviews based on the current filters. Reset all filters
Bruno Ramos
CEO and Founder at HartB
Real User
Top 20
Dec 21, 2020
Improved our time to implement a new ETL process and has a good price and scalability, but only works with AWS

What is our primary use case?

It is a good tool for us. All the implementation in our company is done with AWS Glue. We use it to execute all the ETL processes. We have collected more or less five terabytes of information from the internet by now. We process all this data in our cloud platform and normalize the information. We first put it on a data lake that we have here on the AWS tool. After that, we use AWS Glue to transform all the information collected around the internet and put the normalized information into a data warehouse.

Pros and Cons

  • "The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."
  • "The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS."

What other advice do I have?

I would rate AWS Glue a seven out of ten.
Akash Shanker
Team Lead at a financial services firm with 5,001-10,000 employees
Real User
Top 20
Oct 16, 2020
It can generate the code and has a good user interface, but it lacks Java support

What is our primary use case?

We are using it for file ingestion. Its primary role is to ingest a file from a vendor to a database.

Pros and Cons

  • "Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you."
  • "Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background."

What other advice do I have?

We have just recently started to use this solution. We haven't used all features properly. It is good for the features we are using. We did not find any drawbacks or limitations so far. We are already getting whatever we want from it. I would rate AWS Glue a seven out of ten. It needs improvements in terms of Java support and the turnaround time for our problems.
Find out what your peers are saying about Amazon, Matillion, Denodo and others in Cloud Data Integration. Updated: December 2020.
455,962 professionals have used our research since 2012.
reviewer1412730
Senior Software Engineer at a consumer goods company with 10,001+ employees
Real User
Sep 5, 2020
It comes with its own data catalog and supports triggers for scheduling the ETL process

What is our primary use case?

We are collecting some TV audience data and analyzing it.

Pros and Cons

  • "Data catalog and triggers are the two best features for me. AWS Glue has its own data catalog, which makes it great and really easy to use. Triggers are also really good for scheduling the ETL process."
  • "The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3."

What other advice do I have?

I would recommend AWS Glue. It is a great choice. I would rate this solution a nine out of ten.
Product Categories
Cloud Data Integration
Buyer's Guide
Download our free Cloud Data Integration Report and find out what your peers are saying about Amazon, Matillion, Denodo, and more!