We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
"Data catalog and triggers are the two best features for me. AWS Glue has its own data catalog, which makes it great and really easy to use. Triggers are also really good for scheduling the ETL process."
"Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you."
"The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."
"The most valuable feature is the building of mockups and tasks."
"The program is stable and scalable."
"The features I find most valuable is the synchronization, verification, functionalities and all the data integration features."
"The user interface which is very easy to use if we have any problems to solve."
"I do a quite a lot of data transformations, and the fact that I can do them without changing any of my SQL queries from the code, using the inbuilt tools, is very helpful."
"Whether we need data cleansing or data mastering, we get it all in one platform."
"The solution's initial setup is quite straightforward."
"Their new licensing is very flexible. With Informatica Cloud, you have plenty of items under the same umbrella, such as services, offerings, data quality, and data masking. You have also got master data management and API management. What I really like about them is that you don't need to go to Informatica and say that you need a data integration module. You would say that you need iPaaS or Informatica Cloud. They'll then try to understand your needs and give you IPUs, which are the processing units. If I purchased a hundred IPUs from Informatica as a customer, I can use 70 IPUs for data integration. I would also need data quality, so I can use 10 IPUs for data quality. I can use the remaining 20 IPUs for API management. Down the line, if I see that my initial data integration needs for the development phase are met, then out of the 70 IPUs assigned for data integration, I can use 30 IPUs for data masking. I can shuffle these numbers in any way within the Informatica Cloud umbrella for the tenure for which I have subscribed to these IPUs. I can use all services the way I want. This flexibility is what I really love about Informatica. It also has got good connectors."
"The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3."
"Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background."
"The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS."
"I would like to see more functionality added so that it is a bit closer to how much you can do with Informatica PowerCenter."
"One area that needs to improve is the user experience because it is very complex. The trial version is very complex so it's not easy to start using the program immediately. You must study the rules first."
"I would also like to have profiling functionalities and quality transformations in the cloud."
"A general improvement in icons and the virtual interface would be good."
"I would like to see support for more data sources."
"Connectivity could be improved, it can be a little slow."
"There may be some types of limitations with the performance."
"The regions in which the data resides are still limited. This could be an issue in terms of the data residency laws of some of the countries. They should get more regions."
"The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
"It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
"Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
"My understanding is that Informatica is quite expensive compare to other tools that are available in the market."
AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. You can create and run an ETL job with a few clicks in the AWS Management Console. You simply point AWS Glue to your data stored on AWS, and AWS Glue discovers your data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, your data is immediately searchable, queryable, and available for ETL.
AWS Glue is ranked 7th in Cloud Data Integration with 3 reviews while Informatica Cloud Data Integration is ranked 2nd in Cloud Data Integration with 8 reviews. AWS Glue is rated 7.6, while Informatica Cloud Data Integration is rated 7.6. The top reviewer of AWS Glue writes "Improved our time to implement a new ETL process and has a good price and scalability, but only works with AWS". On the other hand, the top reviewer of Informatica Cloud Data Integration writes "Flexible licensing, good connectors, and timely upgrades and patches". AWS Glue is most compared with Talend Open Studio, Informatica PowerCenter, AWS Database Migration Service, IBM InfoSphere DataStage and Matillion ETL, whereas Informatica Cloud Data Integration is most compared with Informatica PowerCenter, Azure Data Factory, Matillion ETL, Mule Anypoint Platform and Oracle Integration Cloud Service. See our AWS Glue vs. Informatica Cloud Data Integration report.
See our list of best Cloud Data Integration vendors.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.