We performed a comparison between Amazon EMR and Azure Data Factory based on real PeerSpot user reviews.
Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."The project management is very streamlined."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"Amazon EMR's most valuable features are processing speed and data storage capacity."
"The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions."
"It has a variety of options and support systems."
"We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot."
"In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."
"This is the best tool for hosts and it's really flexible and scalable."
"It makes it easy to collect data from different sources."
"The data mapping and the ability to systematically derive data are nice features. It worked really well for the solution we had. It is visual, and it did the transformation as we wanted."
"The most valuable features of the solution are its ease of use and the readily available adapters for connecting with various sources."
"The scalability of the product is impressive."
"Most of our customers are Microsoft shops and prefer Azure Data Factory because they have good licensing options and a trust factor with Microsoft."
"It is easy to integrate."
"We use the solution to move data from on-premises to the cloud."
"It is a complete ETL Solution."
"The product's features for storing data in static clusters could be better."
"The product must add some of the latest technologies to provide more flexibility to the users."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"The most complicated thing is configuring to the cluster and ensure it's running correctly."
"As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data."
"The legacy versions of the solution are not supported in the new versions."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"Modules and strategies should be better handled and notified early in advance."
"One area for improvement is documentation. At present, there isn't enough documentation on how to use Azure Data Factory in certain conditions. It would be good to have documentation on the various use cases."
"The product's technical support has certain shortcomings, making it an area where improvements are required."
"The pricing scheme is very complex and difficult to understand."
"Snowflake connectivity was recently added and if the vendor provided some videos on how to create data then that would be helpful."
"There are limitations when processing more than one GD file."
"There is room for improvement primarily in its streaming capabilities. For structured streaming and machine learning model implementation within an ETL process, it lags behind tools like Informatica."
"We require Azure Data Factory to be able to connect to Google Analytics."
"The Microsoft documentation is too complicated."
Amazon EMR is ranked 8th in Cloud Data Warehouse with 20 reviews while Azure Data Factory is ranked 3rd in Cloud Data Warehouse with 81 reviews. Amazon EMR is rated 7.8, while Azure Data Factory is rated 8.0. The top reviewer of Amazon EMR writes "Provides efficient data processing features and has good scalability ". On the other hand, the top reviewer of Azure Data Factory writes "The data factory agent is quite good but pricing needs to be more transparent". Amazon EMR is most compared with Snowflake, Cloudera Distribution for Hadoop, Amazon Redshift, Apache Spark and Microsoft Azure Synapse Analytics, whereas Azure Data Factory is most compared with Informatica PowerCenter, Informatica Cloud Data Integration, Alteryx Designer, Snowflake and IBM InfoSphere DataStage. See our Amazon EMR vs. Azure Data Factory report.
See our list of best Cloud Data Warehouse vendors.
We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.