Amazon EC2 Auto Scaling vs Apache Spark comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
3,089 views|2,693 comparisons
100% willing to recommend
Apache Logo
2,893 views|2,256 comparisons
89% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Amazon EC2 Auto Scaling and Apache Spark based on real PeerSpot user reviews.

Find out in this report how the two Compute Service solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Amazon EC2 Auto Scaling vs. Apache Spark Report (Updated: March 2024).
770,428 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The solution is highly scalable.""The feature I found most valuable was the vertical and horizontal scaling.""The monitoring tool is helpful.""The documentation is good.""The auto-scaling feature is particularly useful. Additionally, CloudWatch and CloudTrail are also important features for us.""We appreciate that this solution allows us to run all of our severs through it, meaning that our workloads are mainly on the EC2 instance only.""The product's most valuable features are high availability and persistence.""Service for launching or terminating Amazon EC2 instances, with good scalability and stability."

More Amazon EC2 Auto Scaling Pros →

"This solution provides a clear and convenient syntax for our analytical tasks.""Apache Spark can do large volume interactive data analysis.""Features include machine learning, real time streaming, and data processing.""With Hadoop-related technologies, we can distribute the workload with multiple commodity hardware.""The features we find most valuable are the machine learning, data learning, and Spark Analytics.""Provides a lot of good documentation compared to other solutions.""The most valuable feature is the Fault Tolerance and easy binding with other processes like Machine Learning, graph analytics.""The tool's most valuable feature is its speed and efficiency. It's much faster than other tools and excels in parallel data processing. Unlike tools like Python or JavaScript, which may struggle with parallel processing, it allows us to handle large volumes of data with more power easily."

More Apache Spark Pros →

Cons
"The solution's pricing is expensive. You pay based on how much you use it, like paying for the time or hours you use the service. There's no need to buy hardware separately.""Its stability and scalability need improvement.""The support to manage the processes could be better.""What could be improved in Amazon EC2 Auto Scaling is its fees.""Scalability can be improved.""Could integrate more with other platforms.""Amazon EC2 Auto Scaling can provide more discounts when using the machines the solution uses.""The solution's configuration process could be better."

More Amazon EC2 Auto Scaling Cons →

"I know there is always discussion about which language to write applications in and some people do love Scala. However, I don't like it.""It requires overcoming a significant learning curve due to its robust and feature-rich nature.""Apache Spark provides very good performance The tuning phase is still tricky.""Apache Spark could potentially improve in terms of user-friendliness, particularly for individuals with a SQL background. While it's suitable for those with programming knowledge, making it more accessible to those without extensive programming skills could be beneficial.""We are building our own queries on Spark, and it can be improved in terms of query handling.""We use big data manager but we cannot use it as conditional data so whenever we're trying to fetch the data, it takes a bit of time.""It should support more programming languages.""The solution’s integration with other platforms should be improved."

More Apache Spark Cons →

Pricing and Cost Advice
  • "Pricing could be a little bit more competitive."
  • "The pricing is not fixed and it is based on usage."
  • "The price of this product could be a little bit lower."
  • "Licensing fees are paid on a yearly basis."
  • "I have not explored the price of the solution extensively, but from what I have seen the price is alright."
  • "When we want to use more services, we need to pay more. It's a monthly subscription, rather than licensed-based. Pricing or fees for Amazon EC2 Auto Scaling could be improved."
  • "The solution pricing varies by service region is mid-range."
  • "Amazon EC2 Auto Scaling uses a pay-as-you-go pricing model."
  • More Amazon EC2 Auto Scaling Pricing and Cost Advice →

  • "Since we are using the Apache Spark version, not the data bricks version, it is an Apache license version, the support and resolution of the bug are actually late or delayed. The Apache license is free."
  • "Apache Spark is open-source. You have to pay only when you use any bundled product, such as Cloudera."
  • "We are using the free version of the solution."
  • "Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."
  • "Apache Spark is an expensive solution."
  • "Spark is an open-source solution, so there are no licensing costs."
  • "On the cloud model can be expensive as it requires substantial resources for implementation, covering on-premises hardware, memory, and licensing."
  • "It is an open-source solution, it is free of charge."
  • More Apache Spark Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Compute Service solutions are best for your needs.
    770,428 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:The solution removes the need for hardware. We can easily create servers or machines. Just by clicking or specifying our requirements, like memory size or disk space, it's set up for us. The tool… more »
    Top Answer:The solution's licensing is based on a pay-as-you-go model. You only pay for the resources you use, whether it's RAM, processing power, or storage. So, it's calculated based on the time you use those… more »
    Top Answer:The solution's pricing is expensive. You pay based on how much you use it, like paying for the time or hours you use the service. There's no need to buy hardware separately.
    Top Answer:We use Spark to process data from different data sources.
    Top Answer:In data analysis, you need to take real-time data from different data sources. You need to process this in a subsecond, and do the transformation in a subsecond
    Ranking
    2nd
    out of 16 in Compute Service
    Views
    3,089
    Comparisons
    2,693
    Reviews
    31
    Average Words per Review
    327
    Rating
    9.0
    5th
    out of 16 in Compute Service
    Views
    2,893
    Comparisons
    2,256
    Reviews
    26
    Average Words per Review
    444
    Rating
    8.7
    Comparisons
    Also Known As
    AWS RAM
    Learn More
    Overview

    Amazon EC2 Auto Scaling helps you maintain application availability and allows you to automatically add or remove EC2 instances according to conditions you define. ... Dynamic scaling responds to changing demand and predictive scaling automatically schedules the right number of EC2 instances based on predicted demand.

    Spark provides programmers with an application programming interface centered on a data structure called the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. It was developed in response to limitations in the MapReduce cluster computing paradigm, which forces a particular linear dataflowstructure on distributed programs: MapReduce programs read input data from disk, map a function across the data, reduce the results of the map, and store reduction results on disk. Spark's RDDs function as a working set for distributed programs that offers a (deliberately) restricted form of distributed shared memory

    Sample Customers
    Expedia, Intuit, Royal Dutch Shell, Brooks Brothers
    NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
    Top Industries
    REVIEWERS
    Computer Software Company44%
    Financial Services Firm16%
    Comms Service Provider8%
    Media Company4%
    VISITORS READING REVIEWS
    Financial Services Firm22%
    Computer Software Company14%
    University8%
    Government7%
    REVIEWERS
    Computer Software Company30%
    Financial Services Firm15%
    University9%
    Marketing Services Firm6%
    VISITORS READING REVIEWS
    Financial Services Firm25%
    Computer Software Company13%
    Manufacturing Company7%
    Comms Service Provider6%
    Company Size
    REVIEWERS
    Small Business33%
    Midsize Enterprise15%
    Large Enterprise53%
    VISITORS READING REVIEWS
    Small Business25%
    Midsize Enterprise10%
    Large Enterprise65%
    REVIEWERS
    Small Business40%
    Midsize Enterprise18%
    Large Enterprise42%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise12%
    Large Enterprise71%
    Buyer's Guide
    Amazon EC2 Auto Scaling vs. Apache Spark
    March 2024
    Find out what your peers are saying about Amazon EC2 Auto Scaling vs. Apache Spark and other solutions. Updated: March 2024.
    770,428 professionals have used our research since 2012.

    Amazon EC2 Auto Scaling is ranked 2nd in Compute Service with 39 reviews while Apache Spark is ranked 5th in Compute Service with 60 reviews. Amazon EC2 Auto Scaling is rated 8.8, while Apache Spark is rated 8.4. The top reviewer of Amazon EC2 Auto Scaling writes "Well-documented setup process and highly stable solution". On the other hand, the top reviewer of Apache Spark writes "Reliable, able to expand, and handle large amounts of data well". Amazon EC2 Auto Scaling is most compared with AWS Fargate, AWS Lambda, AWS Batch, Oracle Compute Cloud Service and Amazon Elastic Inference, whereas Apache Spark is most compared with Spring Boot, AWS Batch, Spark SQL, SAP HANA and Cloudera Distribution for Hadoop. See our Amazon EC2 Auto Scaling vs. Apache Spark report.

    See our list of best Compute Service vendors.

    We monitor all Compute Service reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.