We performed a comparison between AWS Glue and IBM Infosphere DataStage based on our users’ reviews in four categories. After reading all of the collected data, you can find our conclusion below.
Comparison Results: For users vested in the AWS ecosystem, AWS is hands down the best choice. Users are happier with the pricing, too. IBM Infosphere can handle a significant amount of data quickly and easily. Once IBM Infosphere DataStage finetunes processes and moves toward a greater focus on cloud technologies, it will become a more desirable solution in today’s cloud-focused marketplace.
"We have found it beneficial when moving data from one source to another."
"It is AWS-integrated. There is end-to-end integration with the other AWS services. It is also user-friendly."
"The most valuable features currently are glue studio, jobs, and triggers."
"What I like best about AWS Glue is its real-time data backup feature. Last week, there was a production push, and what used to take almost ten days to send out around fifty-six thousand emails now takes only two hours."
"The most valuable feature of AWS Glue is that it provides a GUI format with a drag-and-drop feature."
"AWS Glue is fast and managed by AWS. Hence, you don't have to worry about capacity and the performance of Glue jobs. It has integrations with other data stores of AWS. The product offers metadata management, logging, and ETL processing capabilities. It comes with a powerful feature, Glue Studio, which helps to do queries interactively within the community. It is a managed service and very secure. Another popular and mature service is S3."
"The most valuable feature of AWS Glue is its ease of use and good documentation. Additionally, we can do all the transformations that we need."
"Glue is a NoSQL-based data ETL tool that has some advantages over IIS and ISAs."
"The concept of integration is a valuable feature of the product."
"Offers great flexibility."
"The solution is stable."
"I am impressed with the tool's ETL tracing."
"Highly customizable: Allowing you to handle multiple data latencies (scheduled batch, on-demand, and real-time) in the same job."
"The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities."
"The solution has improved the time it takes to perform tasks related to batch applications."
"When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses."
"I haven't looked into Glue in terms of seeking out flaws. I've not come across missing features."
"The solution could be cheaper. The price of the solution is an area that needs improvement."
"Only people who can code, either in Java or Python, can use the product freely. Those who don't know Java or Python might find using AWS Glue difficult."
"While working on AWS Glue, I could not find any training material for it."
"It is not clear how the partition discovery would have been affected by more data coming in."
"The mapping area and the use of the data catalog from Glue could be better."
"Cost-wise, AWS Glue is expensive, so that's an area for improvement. The process for setting up the solution was also complex, which is another area for improvement."
"One area that could be improved is the ETL view. The drag-and-drop interface is not as user-friendly as some other ETL tools."
"We would be happy to see in next versions the ability to return several parameters from jobs. Now, jobs can return just one parameter. If they could return several parameters, that would be great."
"The interface needs improvement."
"It would be useful to provide support for Python, AR, and Java."
"There could be more customization options for the product."
"The template mapping could be easier."
"The documentation and in-application help for this solution need to be improved, especially for new features."
"DataStage is quite expensive. It is too hard to find a consultant using DataStage in Turkey."
"The setup is extremely difficult."
AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews while IBM InfoSphere DataStage is ranked 7th in Data Integration with 37 reviews. AWS Glue is rated 7.8, while IBM InfoSphere DataStage is rated 7.8. The top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". On the other hand, the top reviewer of IBM InfoSphere DataStage writes "User-friendly with a lot of functions for transmission rules, but has slow performance and not suitable for a huge volume of data". AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, SSIS, Informatica Cloud Data Integration and Matillion ETL, whereas IBM InfoSphere DataStage is most compared with IBM Cloud Pak for Data, SSIS, Azure Data Factory, Talend Open Studio and SnapLogic. See our AWS Glue vs. IBM InfoSphere DataStage report.
See our list of best Cloud Data Integration vendors.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.