Amazon EMR Room for Improvement

Quan Vu - PeerSpot reviewer
Data Governance Manager at VPBFC

The product's stability could be even better.

View full review »
Ilya Afanasyev - PeerSpot reviewer
Senior Software Development Engineer at Yahoo!

The problem for us is it starts very slow. They need to improve the start time. If we use a long-running EMR, it costs a lot of money. However, when we start, for example, a job, if the job runs for one hour, it's normal as it starts in about ten minutes. If we want, for example, to run each five minutes, it's a problem if it takes ten minutes to start. It's a little bit weird that you cannot use the service within a short period. 

The support could be better.

View full review »
Prashant  Singh - PeerSpot reviewer
Vice President -Product Management at a computer software company with 1,001-5,000 employees

The cost is increasing. We are looking into how we can optimize the cost part of EMR. We're doing a comparison between Cloudera running on AWS and running AWS EMR.

We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part. 

View full review »
Buyer's Guide
Amazon EMR
April 2024
Learn what your peers think about Amazon EMR. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
767,667 professionals have used our research since 2012.
RahulJadhav - PeerSpot reviewer
Hadoop Administrator at Capgemini

Amazon EMR can improve by adding some features, such as megastore services and HiveServer2. Additionally, the user interface could be better, similar to what Apache service provides, cross-platform services.

View full review »
CG
AWS / Big Data Engineer at Waste Management, Inc.

There is room for improvement in pricing. 

View full review »
VinayKumar2 - PeerSpot reviewer
Lead Data Engineer at Seven Lakes Enterprises, Inc.

Interdependencies with a third-party or open source solution should be improved. Modules and strategies should be better handled and notified early in advance. Maybe if AWS starts releasing AWS-certified or AWS-verified installations, that will generate even more confidence just like OpenJet, it'll add a specific version. 

View full review »
ArnabChatterjee - PeerSpot reviewer
Development Engineer at Signify

We have had issues with the boolean mathematical operation in 2. X's big version is working in newer versions because the old version of the solution does not support it, which is a compatibility issue that can be improved. In addition, the legacy versions of the solution are not supported in the new versions.

View full review »
Atif Tariq - PeerSpot reviewer
Cloud and Big Data Engineer | Developer at Huawei Cloud Middle East

The product must add some of the latest technologies to provide more flexibility to the users.

View full review »
AO
Lead Data Scientist at a manufacturing company with 10,001+ employees

The product can be improved by automating their up-sizing and downsizing their cluster.

View full review »
Atif Tariq - PeerSpot reviewer
Cloud and Big Data Engineer | Developer at Huawei Cloud Middle East

As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data.

View full review »
VV
Technology Analyst at Proodlehospitatilityservicesltd

The product's features for storing data in static clusters could be better. It would be helpful if they released a beta version for limited users to know about the product.

View full review »
AG
Engineering Manager/Solution architect at a computer software company with 201-500 employees

Amazon EMR is continuously improving, but maybe something like CI/CD out-of-the-box or integration with Prometheus Grafana. 

View full review »
it_user744720 - PeerSpot reviewer
Data Science Engineer

There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange. It could have been a red herring, it could have been that something else changed in our environment that we never found out. But all of a sudden one day we couldn't run our scripts to start up clusters, the things we could do the day before. It was because they'd released a new version and we had to change things around.

They have listened to the community quite a bit. So, the things that we had suggested to them - they sometimes have older versions of some of these tools because they're open source and Amazon creates their own version of these. Like, for instance, the version of Hive was pretty far behind for a quite a while.

They've addressed that and I think it's partially because of customers like us telling them, "Hey, there are a lot of new features that should be available but aren't in your distribution."

View full review »
CR
Deputy CTO at a tech company with 51-200 employees

The most complicated thing is configuring to the cluster and to ensure it's running correctly. You need to configure at least three Amazon policies to get authorization for all the instances. And if you're new on the system it's really complicated. It's something that could be simplified for users. For additional features, I'd like to see a better MLOps platform but it's possible that it's already in production. 

View full review »
FN
Responsable sofware factory at BOX AFRICA

The dashboard management could be better. Right now, it's lacking a bit. 

I'd like more of a remote connection between my computer and the solution.

We have multi-factor authentication, and at one point it was an issue due to the fact that I lost my phone. It stopped me from accessing the system.

We have to replicate all the infrastructure and we need to ensure that we have the scalability and to do so in production. We are hoping that Amazon will allow us to scale easily. However, we have not attempted to scale just yet.

View full review »
it_user334827 - PeerSpot reviewer
Big Data Specialist at a media company with 501-1,000 employees

Better monitoring, debugging, and stability are all needed.

View full review »
it_user347568 - PeerSpot reviewer
Big Data Architect at a tech services company with 1,001-5,000 employees

Quicker and offer automation deployment on multiple nodes.

View full review »
it_user1158 - PeerSpot reviewer
Developer at a tech company with 51-200 employees
- Setting up jobs for operations like data mining, web indexing, and machine learning is comparatively easier than log file analysis, financial analysis, etc . - For novice users, there is a bit of a steep learning curve, but things become much easier once you have the basics under your belt. - One of the lacking features is good web support. Though the web interface looks pretty decent, some of the basic features are missing. For example, you will find it a bit difficult to customize a particular map to reduce tasks, which involves a lot of customizations with regard to a given web indexing task. This involves extensive use of the underlying HDFS file system. View full review »
it_user1227 - PeerSpot reviewer
Tech Support Staff at a tech company with 51-200 employees
The web interface for managing all your cloud services is a bit patchy and needs improvement. The way the services and features are intermingled is quite difficult for a new user to get acquainted with. This requires a decent amount of time investment for learning the initial basics. Setting up map reduce tasks for financial analysis, file analysis is very difficult unlike other tasks like data mining etc. View full review »
Buyer's Guide
Amazon EMR
April 2024
Learn what your peers think about Amazon EMR. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
767,667 professionals have used our research since 2012.