Amazon EMR vs Amazon Redshift comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
2,342 views|2,016 comparisons
85% willing to recommend
Amazon Web Services (AWS) Logo
7,785 views|5,798 comparisons
87% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Amazon EMR and Amazon Redshift based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Amazon EMR vs. Amazon Redshift Report (Updated: May 2024).
771,212 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions.""It has a variety of options and support systems.""The solution is pretty simple to set up.""It allows users to access the data through a web interface.""The project management is very streamlined.""This is the best tool for hosts and it's really flexible and scalable.""The initial setup is straightforward.""In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."

More Amazon EMR Pros →

"Changing from local servers to the cloud is very easy. It's so nice not to have to worry about physical servers.""It is quite simple to use and there are no issues with creating the tables.""Redshift COPY command, because much of my work involved helping customers migrate large amounts of data into Redshift.""If the analyst knows SQL, which is comfortable and easy to use to go between all of these tool stacks, I think it's reliable. It's a secure and reliable data warehouse.""The solution's flexibility is its most valuable feature. It's also easy to scale and has relatively painless pricing.""The most valuable features of Amazon Redshift are that its fast and efficient. We have lots of TBs of data and it's very fast.""I like it because the usage is very similar to Microsoft SQL server. The structure of the query and the temporary tables are very similar.""It's very easy to migrate from other databases to Redshift. There are migration tools dedicated for this purpose, enabling migration from other databases like MS SQL directly to Redshift. The majority of the scripts will be automatically transposed."

More Amazon Redshift Pros →

Cons
"The most complicated thing is configuring to the cluster and ensure it's running correctly.""The dashboard management could be better. Right now, it's lacking a bit.""As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data.""The legacy versions of the solution are not supported in the new versions.""There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange.""There is no need to pay extra for third-party software.""The initial setup was time-consuming.""There is room for improvement in pricing."

More Amazon EMR Cons →

"In our experiments, the handling of unstructured data was not very smooth.""The explain panel in the Redshift database could be better.""The product must become a bit more serverless.""It would be good to see Redshift as a serverless offering.""Pricing is one of the things that it could improve. It should be more competitive.""The initial deployment was complex.""AWS Snowflake has a very good feature for cloning databases. It makes it easy to clone a data warehouse, which is useful. I would like to see this feature in Redshift.""Migrating data from other data sources can be challenging when you are working with multibyte character sets."

More Amazon Redshift Cons →

Pricing and Cost Advice
  • "You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
  • "The cost of Amazon EMR is very high."
  • "The price of the solution is expensive."
  • "Amazon EMR's price is reasonable."
  • "There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
  • "There is no need to pay extra for third-party software."
  • "Amazon EMR is not very expensive."
  • "The product is not cheap, but it is not expensive."
  • More Amazon EMR Pricing and Cost Advice →

  • "Redshift is very cost effective for a cloud based solution if you need to scale it a lot. For smaller data sizes, I would think about using other products."
  • "If you want a fixed price, an to not worry about every query, but you need to manage your nodes personally, use Redshift."
  • "BI is sold to our customer base as a part of the initial sales bundle. A customer may elect to opt for a white labeled site for an up-charge."
  • "One of my customers went with Google Big Query over Redshift because it was significantly cheaper for their project."
  • "Per hour pricing is helpful to keep the costs of a pilot down, but long-term retention is expensive."
  • "It's around $200 US dollars. There are some data transfer costs but it's minimal, around $20."
  • "The best part about this solution is the cost."
  • "The part that I like best is that you only pay for what you are using."
  • More Amazon Redshift Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
    771,212 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Amazon EMR is a good solution that can be used to manage big data.
    Top Answer:As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data.
    Top Answer:Amazon Redshift is very fast, has a very good response time, and is very user-friendly. The initial setup is very straightforward. This solution can merge and integrate well with many different… more »
    Top Answer:The tool's most valuable feature is its parallel processing capability. It can handle massive amounts of data, even when pushing hundreds of terabytes, and its scaling capabilities are good.
    Ranking
    8th
    Views
    2,342
    Comparisons
    2,016
    Reviews
    12
    Average Words per Review
    346
    Rating
    7.8
    4th
    Views
    7,785
    Comparisons
    5,798
    Reviews
    25
    Average Words per Review
    497
    Rating
    7.7
    Comparisons
    Also Known As
    Amazon Elastic MapReduce
    Learn More
    Overview
    Amazon Elastic MapReduce (Amazon EMR) is a web service that makes it easy to quickly and cost-effectively process vast amounts of data. Amazon EMR simplifies big data processing, providing a managed Hadoop framework that makes it easy, fast, and cost-effective for you to distribute and process vast amounts of your data across dynamically scalable Amazon EC2 instances.

    What is Amazon Redshift?

    Amazon Redshift is a fully administered, petabyte-scale cloud-based data warehouse service. Users are able to begin with a minimal amount of gigabytes of data and can easily scale up to a petabyte or more as needed. This will enable them to utilize their own data to develop new intuitions on how to improve business processes and client relations.

    Initially, users start to develop a data warehouse by initiating what is called an Amazon Redshift cluster or a set of nodes. Once the cluster has been provisioned, users can seamlessly upload data sets, and then begin to perform data analysis queries. Amazon Redshift delivers super-fast query performance, regardless of size, utilizing the exact SQL-based tools and BI applications that most users are already working with today.

    The Amazon Redshift service performs all of the work of setting up, operating, and scaling a data warehouse. These tasks include provisioning capacity, monitoring and backing up the cluster, and applying patches and upgrades to the Amazon Redshift engine.

    Amazon Redshift Functionalities

    Amazon Redshift has many valuable key functionalities. Some of its most useful functionalities include:

    • Cluster administration: The Amazon Redshift cluster is a group of nodes that contains a leader node and one (or more) compute node(s). The compute nodes needed are dependent on the data size, amount of queries needed, and the query execution functionality desired.
    • Cluster snapshots: Snapshots are backups of a cluster from an exact point in time. Amazon Redshift offers two types of snapshots: manual and automated. Amazon will store these snapshots internally in the Amazon Simple Storage Service (Amazon S3) utilizing an SSL connection. Whenever a Snapshot restore is needed, Amazon Redshift will create a new cluster and will import data from the snapshot as directed. 
    • Cluster access: Amazon Redshift provides several intuitive features to help define connectivity rules, encrypt data and connections, and control the overall access of your cluster.
    • IAM credentials and AWS accounts: The Amazon Redshift cluster is only accessible by the AWS account that created the cluster. This automatically secures the cluster and keeps it safe. Inside the AWS account, users access the AWS Identity and IAM protocol to create additional user accounts and manage permissions, granting specified users the desired access needed to control cluster performance.
    • Encryption: Users have the option to choose to encrypt the clusters for additional added security once the cluster is provisioned. When encryption is enabled, Amazon Redshift will store all the data in user-created tables in a secure encrypted format. To manage Amazon Redshift encryption keys, users will access AWS Key Management Service (AWS KMS).

    Reviews from Real Users

    Redshift's versioning and data security are the two most critical features. When migrating into the cloud, it's vital to secure the data. The encryption and security are there.” - Kundan A., Senior Consultant at Dynamic Elements AS

    “With the cloud version whenever you want to deploy, you can scale up, and down, and it has a data warehousing capability. Redshift has many features. They have enriched and elaborate documentation that is helpful.”- Aishwarya K., Solution Architect at Capgemini

    Sample Customers
    Yelp
    Liberty Mutual Insurance, 4Cite Marketing, BrandVerity, DNA Plc, Sirocco Systems, Gainsight, Blue 449
    Top Industries
    REVIEWERS
    Computer Software Company27%
    Wholesaler/Distributor18%
    Media Company18%
    Comms Service Provider9%
    VISITORS READING REVIEWS
    Financial Services Firm23%
    Computer Software Company13%
    Manufacturing Company8%
    Educational Organization6%
    REVIEWERS
    Computer Software Company34%
    Comms Service Provider14%
    Manufacturing Company10%
    Retailer10%
    VISITORS READING REVIEWS
    Educational Organization51%
    Financial Services Firm9%
    Computer Software Company7%
    Manufacturing Company4%
    Company Size
    REVIEWERS
    Small Business26%
    Midsize Enterprise26%
    Large Enterprise47%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise12%
    Large Enterprise72%
    REVIEWERS
    Small Business38%
    Midsize Enterprise25%
    Large Enterprise37%
    VISITORS READING REVIEWS
    Small Business10%
    Midsize Enterprise55%
    Large Enterprise35%
    Buyer's Guide
    Amazon EMR vs. Amazon Redshift
    May 2024
    Find out what your peers are saying about Amazon EMR vs. Amazon Redshift and other solutions. Updated: May 2024.
    771,212 professionals have used our research since 2012.

    Amazon EMR is ranked 8th in Cloud Data Warehouse with 20 reviews while Amazon Redshift is ranked 4th in Cloud Data Warehouse with 61 reviews. Amazon EMR is rated 7.8, while Amazon Redshift is rated 7.8. The top reviewer of Amazon EMR writes "Provides efficient data processing features and has good scalability ". On the other hand, the top reviewer of Amazon Redshift writes "Provides one place where we can store data, and allows us to easily connect to other services with AWS". Amazon EMR is most compared with Snowflake, Cloudera Distribution for Hadoop, Azure Data Factory, Apache Spark and Microsoft Azure Synapse Analytics, whereas Amazon Redshift is most compared with Teradata, Snowflake, AWS Lake Formation, Vertica and SAP BW4HANA. See our Amazon EMR vs. Amazon Redshift report.

    See our list of best Cloud Data Warehouse vendors.

    We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.