Amazon EMR vs Amazon Redshift comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
2,388 views|2,059 comparisons
85% willing to recommend
Amazon Web Services (AWS) Logo
8,203 views|6,066 comparisons
87% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Amazon EMR and Amazon Redshift based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Amazon EMR vs. Amazon Redshift Report (Updated: March 2024).
768,857 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"It has a variety of options and support systems.""We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot.""The initial setup is straightforward.""When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark.""The solution is pretty simple to set up.""Amazon EMR is a good solution that can be used to manage big data.""In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance.""The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions."

More Amazon EMR Pros →

"Redshift has an advantage when it comes to administration, making it easier to manage and collaborate.""The product is relatively easy to use because there is no indexing and no partitions.""The product offers good support for the data lake.""We have found Machine Learning use cases are very nice.""Redshift Spectrum is the most valuable feature.""The solution's flexibility is its most valuable feature. It's also easy to scale and has relatively painless pricing.""The most valuable feature of Amazon Redshift is its ability to handle really large sets of data.""The feature that we find most useful is the ability to do analytics on the fly."

More Amazon Redshift Pros →

Cons
"The dashboard management could be better. Right now, it's lacking a bit.""The legacy versions of the solution are not supported in the new versions.""There is room for improvement in pricing.""There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange.""The initial setup was time-consuming.""The product must add some of the latest technologies to provide more flexibility to the users.""Modules and strategies should be better handled and notified early in advance.""There is no need to pay extra for third-party software."

More Amazon EMR Cons →

"The solution has four maintenance windows so, when it comes to stability, I think it would be better to decrease their number.""AWS Snowflake has a very good feature for cloning databases. It makes it easy to clone a data warehouse, which is useful. I would like to see this feature in Redshift.""They should provide a better way to work with interim data in a structured way than to store it in parquet files locally.""We are using third-party tools to integrate Amazon Redshift, they should create their own interface on their own for it to be easily connected on the AWS itself.""The refreshment rate of data reaching Redshift from other sources should be faster.""It would be good to see Redshift as a serverless offering.""The technical support should be better in terms of their knowledge, and they should be more customer-friendly.""Amazon should provide more cloud-native tools that can integrate with Redshift like Microsoft's development tools for Azure."

More Amazon Redshift Cons →

Pricing and Cost Advice
  • "You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
  • "The cost of Amazon EMR is very high."
  • "The price of the solution is expensive."
  • "Amazon EMR's price is reasonable."
  • "There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
  • "There is no need to pay extra for third-party software."
  • "Amazon EMR is not very expensive."
  • "The product is not cheap, but it is not expensive."
  • More Amazon EMR Pricing and Cost Advice →

  • "Redshift is very cost effective for a cloud based solution if you need to scale it a lot. For smaller data sizes, I would think about using other products."
  • "If you want a fixed price, an to not worry about every query, but you need to manage your nodes personally, use Redshift."
  • "BI is sold to our customer base as a part of the initial sales bundle. A customer may elect to opt for a white labeled site for an up-charge."
  • "One of my customers went with Google Big Query over Redshift because it was significantly cheaper for their project."
  • "Per hour pricing is helpful to keep the costs of a pilot down, but long-term retention is expensive."
  • "It's around $200 US dollars. There are some data transfer costs but it's minimal, around $20."
  • "The best part about this solution is the cost."
  • "The part that I like best is that you only pay for what you are using."
  • More Amazon Redshift Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
    768,857 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Amazon EMR is a good solution that can be used to manage big data.
    Top Answer:As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data.
    Top Answer:Amazon Redshift is very fast, has a very good response time, and is very user-friendly. The initial setup is very straightforward. This solution can merge and integrate well with many different… more »
    Top Answer:Redshift Spectrum is the most valuable feature.
    Ranking
    9th
    Views
    2,388
    Comparisons
    2,059
    Reviews
    12
    Average Words per Review
    346
    Rating
    7.8
    4th
    Views
    8,203
    Comparisons
    6,066
    Reviews
    23
    Average Words per Review
    480
    Rating
    7.7
    Comparisons
    Also Known As
    Amazon Elastic MapReduce
    Learn More
    Overview
    Amazon Elastic MapReduce (Amazon EMR) is a web service that makes it easy to quickly and cost-effectively process vast amounts of data. Amazon EMR simplifies big data processing, providing a managed Hadoop framework that makes it easy, fast, and cost-effective for you to distribute and process vast amounts of your data across dynamically scalable Amazon EC2 instances.

    What is Amazon Redshift?

    Amazon Redshift is a fully administered, petabyte-scale cloud-based data warehouse service. Users are able to begin with a minimal amount of gigabytes of data and can easily scale up to a petabyte or more as needed. This will enable them to utilize their own data to develop new intuitions on how to improve business processes and client relations.

    Initially, users start to develop a data warehouse by initiating what is called an Amazon Redshift cluster or a set of nodes. Once the cluster has been provisioned, users can seamlessly upload data sets, and then begin to perform data analysis queries. Amazon Redshift delivers super-fast query performance, regardless of size, utilizing the exact SQL-based tools and BI applications that most users are already working with today.

    The Amazon Redshift service performs all of the work of setting up, operating, and scaling a data warehouse. These tasks include provisioning capacity, monitoring and backing up the cluster, and applying patches and upgrades to the Amazon Redshift engine.

    Amazon Redshift Functionalities

    Amazon Redshift has many valuable key functionalities. Some of its most useful functionalities include:

    • Cluster administration: The Amazon Redshift cluster is a group of nodes that contains a leader node and one (or more) compute node(s). The compute nodes needed are dependent on the data size, amount of queries needed, and the query execution functionality desired.
    • Cluster snapshots: Snapshots are backups of a cluster from an exact point in time. Amazon Redshift offers two types of snapshots: manual and automated. Amazon will store these snapshots internally in the Amazon Simple Storage Service (Amazon S3) utilizing an SSL connection. Whenever a Snapshot restore is needed, Amazon Redshift will create a new cluster and will import data from the snapshot as directed. 
    • Cluster access: Amazon Redshift provides several intuitive features to help define connectivity rules, encrypt data and connections, and control the overall access of your cluster.
    • IAM credentials and AWS accounts: The Amazon Redshift cluster is only accessible by the AWS account that created the cluster. This automatically secures the cluster and keeps it safe. Inside the AWS account, users access the AWS Identity and IAM protocol to create additional user accounts and manage permissions, granting specified users the desired access needed to control cluster performance.
    • Encryption: Users have the option to choose to encrypt the clusters for additional added security once the cluster is provisioned. When encryption is enabled, Amazon Redshift will store all the data in user-created tables in a secure encrypted format. To manage Amazon Redshift encryption keys, users will access AWS Key Management Service (AWS KMS).

    Reviews from Real Users

    Redshift's versioning and data security are the two most critical features. When migrating into the cloud, it's vital to secure the data. The encryption and security are there.” - Kundan A., Senior Consultant at Dynamic Elements AS

    “With the cloud version whenever you want to deploy, you can scale up, and down, and it has a data warehousing capability. Redshift has many features. They have enriched and elaborate documentation that is helpful.”- Aishwarya K., Solution Architect at Capgemini

    Sample Customers
    Yelp
    Liberty Mutual Insurance, 4Cite Marketing, BrandVerity, DNA Plc, Sirocco Systems, Gainsight, Blue 449
    Top Industries
    REVIEWERS
    Computer Software Company27%
    Wholesaler/Distributor18%
    Media Company18%
    Comms Service Provider9%
    VISITORS READING REVIEWS
    Financial Services Firm23%
    Computer Software Company13%
    Manufacturing Company8%
    Educational Organization6%
    REVIEWERS
    Computer Software Company32%
    Comms Service Provider14%
    Manufacturing Company11%
    Retailer11%
    VISITORS READING REVIEWS
    Educational Organization50%
    Financial Services Firm9%
    Computer Software Company7%
    Manufacturing Company4%
    Company Size
    REVIEWERS
    Small Business26%
    Midsize Enterprise26%
    Large Enterprise47%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise11%
    Large Enterprise72%
    REVIEWERS
    Small Business40%
    Midsize Enterprise24%
    Large Enterprise37%
    VISITORS READING REVIEWS
    Small Business10%
    Midsize Enterprise54%
    Large Enterprise36%
    Buyer's Guide
    Amazon EMR vs. Amazon Redshift
    March 2024
    Find out what your peers are saying about Amazon EMR vs. Amazon Redshift and other solutions. Updated: March 2024.
    768,857 professionals have used our research since 2012.

    Amazon EMR is ranked 9th in Cloud Data Warehouse with 20 reviews while Amazon Redshift is ranked 4th in Cloud Data Warehouse with 58 reviews. Amazon EMR is rated 7.8, while Amazon Redshift is rated 7.8. The top reviewer of Amazon EMR writes "Provides efficient data processing features and has good scalability ". On the other hand, the top reviewer of Amazon Redshift writes "Provides one place where we can store data, and allows us to easily connect to other services with AWS". Amazon EMR is most compared with Snowflake, Cloudera Distribution for Hadoop, Azure Data Factory, Apache Spark and Microsoft Azure Synapse Analytics, whereas Amazon Redshift is most compared with AWS Lake Formation, Snowflake, Teradata, Vertica and SAP BW4HANA. See our Amazon EMR vs. Amazon Redshift report.

    See our list of best Cloud Data Warehouse vendors.

    We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.