Amazon EC2 Auto Scaling vs Apache Spark comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
3,024 views|2,628 comparisons
100% willing to recommend
Apache Logo
2,793 views|2,165 comparisons
89% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Amazon EC2 Auto Scaling and Apache Spark based on real PeerSpot user reviews.

Find out in this report how the two Compute Service solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Amazon EC2 Auto Scaling vs. Apache Spark Report (Updated: May 2024).
772,679 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Amazon EC2 Auto Scaling operates at a different level, working in parallel to efficiently manage workload distribution. Primarily, it focuses on orchestration rather than directly managing EC2 instances for deployment and configuration. It uses automated processes to deploy and manage ports, leveraging Application Load Balancers to effectively handle data communication and management.""Can handle traffic spikes so the system doesn't overload.""The product’s most valuable feature is the seamless resizing of web connection.""The solution is scalable.""The product is flexible.""The solution is highly scalable.""The most useful feature is elasticity. You can scale up or down based on traffic.""The integration capabilities are good."

More Amazon EC2 Auto Scaling Pros →

"I feel the streaming is its best feature.""The most valuable feature of Apache Spark is its flexibility.""The product is useful for analytics.""The features we find most valuable are the machine learning, data learning, and Spark Analytics.""The most valuable feature of this solution is its capacity for processing large amounts of data.""DataFrame: Spark SQL gives the leverage to create applications more easily and with less coding effort.""The scalability has been the most valuable aspect of the solution.""The distribution of tasks, like the seamless map-reduce functionality, is quite impressive."

More Apache Spark Pros →

Cons
"It should work for the cloud, cloud monitoring features, and DevOps processes. It should automatically enable features for downscaling and upscaling.""The product should improve vertical scaling features.""The solution's configuration process could be better.""The spinning up in the solution can be much faster...The product should have a faster scalability option.""The primary area for improvement is the pricing model.""Scalability can be improved.""The launch configuration feature doesn't work properly. It needs to improve the load configuration feature along with launch templates. The tool needs to tag feature as well.""We have found that the sizing in Amazon EC2 Auto Scaling is far off. For example, we will see some at one terabyte and the other one is two terabytes. There is nothing between one and two terabytes. Sometimes it's a struggle if I need one and a half, I still am supposed to pay for two. There are five terabytes, six terabytes, and 12 terabytes, and if I need something at eight or nine, I'm still paying 30 to 40 percent more by taking the one which is 12 terabytes. Microsoft Azure does similar sizes but the gap can be more, such as six terabytes, and the next one is 12 terabytes."

More Amazon EC2 Auto Scaling Cons →

"When you want to extract data from your HDFS and other sources then it is kind of tricky because you have to connect with those sources.""When using Spark, users may need to write their own parallelization logic, which requires additional effort and expertise.""When you first start using this solution, it is common to run into memory errors when you are dealing with large amounts of data.""The product could improve the user interface and make it easier for new users.""Dynamic DataFrame options are not yet available.""The solution needs to optimize shuffling between workers.""The setup I worked on was really complex.""We are building our own queries on Spark, and it can be improved in terms of query handling."

More Apache Spark Cons →

Pricing and Cost Advice
  • "Pricing could be a little bit more competitive."
  • "The pricing is not fixed and it is based on usage."
  • "The price of this product could be a little bit lower."
  • "Licensing fees are paid on a yearly basis."
  • "I have not explored the price of the solution extensively, but from what I have seen the price is alright."
  • "When we want to use more services, we need to pay more. It's a monthly subscription, rather than licensed-based. Pricing or fees for Amazon EC2 Auto Scaling could be improved."
  • "The solution pricing varies by service region is mid-range."
  • "Amazon EC2 Auto Scaling uses a pay-as-you-go pricing model."
  • More Amazon EC2 Auto Scaling Pricing and Cost Advice →

  • "Since we are using the Apache Spark version, not the data bricks version, it is an Apache license version, the support and resolution of the bug are actually late or delayed. The Apache license is free."
  • "Apache Spark is open-source. You have to pay only when you use any bundled product, such as Cloudera."
  • "We are using the free version of the solution."
  • "Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."
  • "Apache Spark is an expensive solution."
  • "Spark is an open-source solution, so there are no licensing costs."
  • "On the cloud model can be expensive as it requires substantial resources for implementation, covering on-premises hardware, memory, and licensing."
  • "It is an open-source solution, it is free of charge."
  • More Apache Spark Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Compute Service solutions are best for your needs.
    772,679 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:The solution removes the need for hardware. We can easily create servers or machines. Just by clicking or specifying our requirements, like memory size or disk space, it's set up for us. The tool… more »
    Top Answer:The solution's licensing is based on a pay-as-you-go model. You only pay for the resources you use, whether it's RAM, processing power, or storage. So, it's calculated based on the time you use those… more »
    Top Answer:The solution's pricing is expensive. You pay based on how much you use it, like paying for the time or hours you use the service. There's no need to buy hardware separately.
    Top Answer:We use Spark to process data from different data sources.
    Top Answer:In data analysis, you need to take real-time data from different data sources. You need to process this in a subsecond, and do the transformation in a subsecond
    Ranking
    2nd
    out of 16 in Compute Service
    Views
    3,024
    Comparisons
    2,628
    Reviews
    33
    Average Words per Review
    357
    Rating
    8.9
    5th
    out of 16 in Compute Service
    Views
    2,793
    Comparisons
    2,165
    Reviews
    26
    Average Words per Review
    444
    Rating
    8.7
    Comparisons
    Also Known As
    AWS RAM
    Learn More
    Overview

    Amazon EC2 Auto Scaling helps you maintain application availability and allows you to automatically add or remove EC2 instances according to conditions you define. ... Dynamic scaling responds to changing demand and predictive scaling automatically schedules the right number of EC2 instances based on predicted demand.

    Spark provides programmers with an application programming interface centered on a data structure called the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. It was developed in response to limitations in the MapReduce cluster computing paradigm, which forces a particular linear dataflowstructure on distributed programs: MapReduce programs read input data from disk, map a function across the data, reduce the results of the map, and store reduction results on disk. Spark's RDDs function as a working set for distributed programs that offers a (deliberately) restricted form of distributed shared memory

    Sample Customers
    Expedia, Intuit, Royal Dutch Shell, Brooks Brothers
    NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
    Top Industries
    REVIEWERS
    Computer Software Company44%
    Financial Services Firm16%
    Comms Service Provider8%
    Media Company4%
    VISITORS READING REVIEWS
    Financial Services Firm22%
    Computer Software Company13%
    University9%
    Government7%
    REVIEWERS
    Computer Software Company33%
    Financial Services Firm12%
    University9%
    Marketing Services Firm6%
    VISITORS READING REVIEWS
    Financial Services Firm25%
    Computer Software Company13%
    Manufacturing Company7%
    Comms Service Provider5%
    Company Size
    REVIEWERS
    Small Business33%
    Midsize Enterprise15%
    Large Enterprise53%
    VISITORS READING REVIEWS
    Small Business25%
    Midsize Enterprise10%
    Large Enterprise66%
    REVIEWERS
    Small Business42%
    Midsize Enterprise16%
    Large Enterprise42%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise12%
    Large Enterprise71%
    Buyer's Guide
    Amazon EC2 Auto Scaling vs. Apache Spark
    May 2024
    Find out what your peers are saying about Amazon EC2 Auto Scaling vs. Apache Spark and other solutions. Updated: May 2024.
    772,679 professionals have used our research since 2012.

    Amazon EC2 Auto Scaling is ranked 2nd in Compute Service with 39 reviews while Apache Spark is ranked 5th in Compute Service with 60 reviews. Amazon EC2 Auto Scaling is rated 8.8, while Apache Spark is rated 8.4. The top reviewer of Amazon EC2 Auto Scaling writes "Well-documented setup process and highly stable solution". On the other hand, the top reviewer of Apache Spark writes "Reliable, able to expand, and handle large amounts of data well". Amazon EC2 Auto Scaling is most compared with AWS Fargate, AWS Lambda, AWS Batch, Oracle Compute Cloud Service and Amazon Elastic Inference, whereas Apache Spark is most compared with Spring Boot, AWS Batch, Spark SQL, SAP HANA and Cloudera Distribution for Hadoop. See our Amazon EC2 Auto Scaling vs. Apache Spark report.

    See our list of best Compute Service vendors.

    We monitor all Compute Service reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.