We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
"Data catalog and triggers are the two best features for me. AWS Glue has its own data catalog, which makes it great and really easy to use. Triggers are also really good for scheduling the ETL process."
"Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you."
"The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."
"It's already very user-friendly and has a good dashboard."
"There are many good features in this solution including the data fields, database integration, support for SQL views, and the lookups for matching information."
"The most valuable features for our company are the flexibility and the quick turn around time in producing simple ETL solutions."
"I have used most of the standard SQL features, but the ones that stand out are the Data Flows and Bulk Import."
"It is easily scheduled and integrates well with SQL Server and SQL Server Agent jobs."
"Its compatibility with Microsoft products has been very valuable to our company. It fits well within the architecture."
"The simplicity of the solution is great. The solution also offers excellent integration."
"The reporting on the solution is perfect. I didn't expect to see reporting features, but they are great."
"The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3."
"Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background."
"The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS."
"Sometimes when we want to publish to other types of databases it's not easy to publish to those databases. For example, the Jet Database Engine. Before the SSIS supported Jet Database Engine but nowadays it doesn't support the Jet Database Engine. We connect to many databases such as Access database, SparkPros databases and the other types of databases using Jet Database Engines now and SSIS now doesn't seem to support it in our databases."
"The debugging could be improved because when it came to solving the errors that I've experienced in the past, I've had to look at the documentation for more information."
"I come from a coding background and this tool is graphically based. Sometimes I think it's cumbersome to do mapping graphically. If there was a way to provide a simple script, it would be helpful and make it easier to use."
"This solution needs full support for real-time processing."
"Improving the login procedure would make our reporting easier on monitoring our ETL processes."
"We're in the process of switching to Informatica, and we need to work out data lineage and data profiling and to improve the quality of our data. SSIS, however, is not that compatible with Informatica. We managed to connect it to Informatica Metadata Manager, but we don't get good lineage, so we have to redo all our ETLs using the Informatica process in order to accept the proper data lineage."
"The interface could use improvement, as well as the administrative tools. Jobs fail from time to time for different reasons. It's not a problem with Microsoft, or SSIS itself. The problems are external, but to find the problems and analyze them it takes too much time."
"There was also not enough instructions from Microsoft in regards to this application or this technology, which can easily be improved upon."
"The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
"It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
"Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
"My advice is to look at what your configuration will be because most companies have their own deals with Microsoft."
"This solution is included with the MSSQL server package."
"t's incredibly cost effective, easy to learn the basics quickly (although like all ETL tools requires the traditional learning curve to get good at) and has an immense user base."
"It would be beneficial if the solution had a less costly cloud offering."
"Based on my experience and understanding, Talend comes out to be a little bit expensive as compared to SSIS. The average cost of having Talend with Talend Management Console is around 72K per region, which is much higher than SSIS. SSIS works very well with Microsoft technologies, and if you have Microsoft technologies, it is not really expensive to have SSIS. If you have SQL Server, SSIS is free."
AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. You can create and run an ETL job with a few clicks in the AWS Management Console. You simply point AWS Glue to your data stored on AWS, and AWS Glue discovers your data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, your data is immediately searchable, queryable, and available for ETL.
AWS Glue is ranked 7th in Cloud Data Integration with 3 reviews while SSIS is ranked 2nd in Data Integration Tools with 21 reviews. AWS Glue is rated 7.6, while SSIS is rated 7.8. The top reviewer of AWS Glue writes "Improved our time to implement a new ETL process and has a good price and scalability, but only works with AWS". On the other hand, the top reviewer of SSIS writes "SSIS 2016 - The good, the bad, and the ugly". AWS Glue is most compared with Talend Open Studio, Informatica PowerCenter, AWS Database Migration Service, IBM InfoSphere DataStage and Informatica Enterprise Data Catalog, whereas SSIS is most compared with Informatica PowerCenter, Talend Open Studio, Oracle Data Integrator (ODI), IBM InfoSphere DataStage and WhereScape RED.
See our list of .
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.