We performed a comparison between AWS Data Pipeline [EOL] and AWS Glue based on real PeerSpot user reviews.
Find out what your peers are saying about Amazon Web Services (AWS), MuleSoft, Matillion and others in Cloud Data Integration."The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool."
"It is a stable solution...It is a scalable solution."
"The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."
"We no longer had to worry much about infrastructure management because AWS Glue is serverless, and Amazon takes care of the underlying infrastructure."
"One of the best features of the solution is its ability to easily integrate with other AWS services."
"Data catalog and triggers are the two best features for me. AWS Glue has its own data catalog, which makes it great and really easy to use. Triggers are also really good for scheduling the ETL process."
"Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you."
"The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs."
"I like its integration and ability to handle all data-related tasks."
"AWS Glue is quite better than other tools, but you have to learn it properly before you start using it."
"It's almost semi-automatic because you must review and approve code push, which works well. Still, we had many problems getting there during the deployment process, but we got there."
"The user-defined functions have shortcomings in AWS Data Pipeline."
"Only people who can code, either in Java or Python, can use the product freely. Those who don't know Java or Python might find using AWS Glue difficult."
"AWS Glue is more costly compared to other tools like Airflow."
"If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data."
"I have encountered challenges with multi-region support."
"The solution's visual ETL tool is of no use for actual implementation."
"There is a learning curve to this tool."
"On occasion, the solution's dashboard reports that a project failed due to runtime but it actually succeeded."
"It fails to handle massive databases acquired from various sources."
AWS Data Pipeline [EOL] doesn't meet the minimum requirements to be ranked in Cloud Data Integration with 2 reviews while AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews. AWS Data Pipeline [EOL] is rated 8.0, while AWS Glue is rated 7.8. The top reviewer of AWS Data Pipeline [EOL] writes "A tool with great orchestration and development capabilities but needs to improve its user-defined functions". On the other hand, the top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". AWS Data Pipeline [EOL] is most compared with AWS Database Migration Service, Oracle Data Integrator (ODI), FME, Perspectium DataSync and IBM InfoSphere DataStage, whereas AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, SSIS, Informatica Cloud Data Integration and Talend Open Studio.
See our list of best Cloud Data Integration vendors.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.