We performed a comparison between AWS Glue and IBM Infosphere DataStage based on our users’ reviews in four categories. After reading all of the collected data, you can find our conclusion below.
Comparison Results: For users vested in the AWS ecosystem, AWS is hands down the best choice. Users are happier with the pricing, too. IBM Infosphere can handle a significant amount of data quickly and easily. Once IBM Infosphere DataStage finetunes processes and moves toward a greater focus on cloud technologies, it will become a more desirable solution in today’s cloud-focused marketplace.
"I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages."
"The product has a valuable feature for data catalog."
"Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you."
"AWS Glue is quite better than other tools, but you have to learn it properly before you start using it."
"AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code."
"One of the best features of the solution is its ability to easily integrate with other AWS services."
"The most valuable feature of AWS Glue is scalability."
"It is a stable and scalable solution."
"When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses."
"IBM is stable and accurate to monitor. It's easy to understand to monitor the data lineage from source to target."
"Finding logs is very easy on the solution."
"I am impressed with the tool's ETL tracing."
"The ETL tools are probably the most valuable feature. It has an IBM tool, a friendly UI and it makes things more comfortable."
"The product is easy to deploy."
"The most valuable feature for our data processing needs is IBM InfoSphere DataStage's capability to handle ETL tasks with large record volumes."
"The concept of integration is a valuable feature of the product."
"It is not clear how the partition discovery would have been affected by more data coming in."
"The product has only a few built-in transformations."
"In terms of performance, if they can further optimize the execution time for serverless jobs, it would be a welcome improvement."
"In terms of improvement, the performance of AWS Glue could be faster."
"I would like to see stable libraries at the moment they are not there."
"The solution’s stability could be improved."
"While working on AWS Glue, I could not find any training material for it."
"The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data."
"The interface needs improvement."
"The solution can be a bit more user-friendly, similar to Informatica."
"It would be useful to provide support for Python, AR, and Java."
"The initial setup can be complex."
"The initial setup could be more straightforward."
"What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag. Another area for improvement in the solution stems from a lot of new types of databases, for example, databases in the cloud and big data have become available, and IBM InfoSphere DataStage is working on various connectors for different data sources, but that still isn't up-to-date, meaning that some connectors are missing for modern data sources. The latest version of IBM InfoSphere DataStage also has a complex architecture, so my team faced frequent outages and that should be improved as well."
"The response time from support is slow and needs to be improved."
"The interface needs work to be more user-friendly."
AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews while IBM InfoSphere DataStage is ranked 7th in Data Integration with 37 reviews. AWS Glue is rated 7.8, while IBM InfoSphere DataStage is rated 7.8. The top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". On the other hand, the top reviewer of IBM InfoSphere DataStage writes "User-friendly with a lot of functions for transmission rules, but has slow performance and not suitable for a huge volume of data". AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, Informatica Cloud Data Integration, SSIS and Matillion ETL, whereas IBM InfoSphere DataStage is most compared with SSIS, IBM Cloud Pak for Data, Azure Data Factory, Talend Open Studio and Oracle GoldenGate. See our AWS Glue vs. IBM InfoSphere DataStage report.
See our list of best Cloud Data Integration vendors.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.