Cancel
You must select at least 2 products to compare!
AWS Glue Logo
8,543 views|7,227 comparisons
StreamSets Logo
6,145 views|4,494 comparisons
Top Review
Find out what your peers are saying about MuleSoft, Informatica, Denodo and others in Cloud Data Integration. Updated: September 2021.
534,299 professionals have used our research since 2012.
Quotes From Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros
"Data catalog and triggers are the two best features for me. AWS Glue has its own data catalog, which makes it great and really easy to use. Triggers are also really good for scheduling the ETL process.""Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you.""The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."

More AWS Glue Pros »

"It is really easy to set up and the interface is easy to use."

More StreamSets Pros »

Cons
"The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3.""Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background.""The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS."

More AWS Glue Cons »

"We've seen a couple of cases where it appears to have a memory leak or a similar problem."

More StreamSets Cons »

Pricing and Cost Advice
"The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes.""It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us.""Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."

More AWS Glue Pricing and Cost Advice »

"We are running the community version right now, which can be used free of charge."

More StreamSets Pricing and Cost Advice »

report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
534,299 professionals have used our research since 2012.
Questions from the Community
Top Answer: AWS Glue and Azure Data factory for ELT best performance cloud services.
Top Answer: The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features.
Top Answer: Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is also good in terms of the financial planning of the company… more »
Top Answer: It is really easy to set up and the interface is easy to use.
Top Answer: We've seen a couple of cases where it appears to have a memory leak or a similar problem. It grows for a bit and then we'd have to restart the container, maybe once a month when it gets high.
Top Answer: We typically use it to transport our Oracle raw datasets up to Microsoft Azure, and then into SQL databases there.
Ranking
6th
Views
8,543
Comparisons
7,227
Reviews
3
Average Words per Review
591
Rating
7.7
24th
Views
6,145
Comparisons
4,494
Reviews
1
Average Words per Review
399
Rating
8.0
Comparisons
Learn More
StreamSets
Video Not Available
Overview

AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. You can create and run an ETL job with a few clicks in the AWS Management Console. You simply point AWS Glue to your data stored on AWS, and AWS Glue discovers your data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, your data is immediately searchable, queryable, and available for ETL.

StreamSets Dataflow Performance Manager was created to enable enterprises to harness their data in motion. It unifies visibility and control of dataflows, which reduces management costs, improves data quality and enables IT agility.

Offer
Learn more about AWS Glue
Learn more about StreamSets
Sample Customers
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Cisco, Lithium Technologies, Cloudera, Elastic
Top Industries
VISITORS READING REVIEWS
Computer Software Company26%
Media Company15%
Comms Service Provider11%
Financial Services Firm9%
VISITORS READING REVIEWS
Computer Software Company33%
Insurance Company11%
Comms Service Provider11%
Energy/Utilities Company7%
Find out what your peers are saying about MuleSoft, Informatica, Denodo and others in Cloud Data Integration. Updated: September 2021.
534,299 professionals have used our research since 2012.

AWS Glue is ranked 6th in Cloud Data Integration with 3 reviews while StreamSets is ranked 24th in Data Integration Tools with 1 review. AWS Glue is rated 7.6, while StreamSets is rated 8.0. The top reviewer of AWS Glue writes "Improved our time to implement a new ETL process and has a good price and scalability, but only works with AWS". On the other hand, the top reviewer of StreamSets writes "Easy to set up and use, and the functionality for transforming data is good". AWS Glue is most compared with Talend Open Studio, Informatica PowerCenter, AWS Database Migration Service, IBM InfoSphere DataStage and SSIS, whereas StreamSets is most compared with Informatica PowerCenter, Spring Cloud Data Flow, Azure Data Factory, SSIS and Pentaho Data Integration.

We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.