Amazon EC2 Auto Scaling vs Apache Spark Comparison 2024

Amazon EC2 Auto Scaling

Apache Spark

Amazon EC2 Auto Scaling

Read 39 Amazon EC2 Auto Scaling reviews

3,024 views|2,628 comparisons

Apache Spark

Read 60 Apache Spark reviews

2,793 views|2,165 comparisons

Comparison Buyer's Guide

Download the complete report

Buyer's Guide

Amazon EC2 Auto Scaling vs. Apache Spark

May 2024

Executive Summary

We performed a comparison between Amazon EC2 Auto Scaling and Apache Spark based on real PeerSpot user reviews.

Find out in this report how the two Compute Service solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.

To learn more, read our detailed Amazon EC2 Auto Scaling vs. Apache Spark Report (Updated: May 2024).

Download the complete report

772,679 professionals have used our research since 2012.

Featured Review

Kiran Rajanna

Senior Solutions Architect and Cloud Security Consultant at Cgi Deutschland KG

Integrates with load balancers and helps to reduce operational overheads

The solution incorporates ease of maintenance and reduction in operational overhead and costs. Patching is also easy.

Salvatore Campana

CEO & Founder at XAUTOMATA TECHNOLOGY GmbH

Reduces startup time and gives excellent ROI

Apache Spark helped us with horizontal scalability and cost optimizations.

Quotes From Members

We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:

Pros

"Amazon EC2 Auto Scaling operates at a different level, working in parallel to efficiently manage workload distribution. Primarily, it focuses on orchestration rather than directly managing EC2 instances for deployment and configuration. It uses automated processes to deploy and manage ports, leveraging Application Load Balancers to effectively handle data communication and management.""Can handle traffic spikes so the system doesn't overload.""The product’s most valuable feature is the seamless resizing of web connection.""The solution is scalable.""The product is flexible.""The solution is highly scalable.""The most useful feature is elasticity. You can scale up or down based on traffic.""The integration capabilities are good."

More Amazon EC2 Auto Scaling Pros →

"I feel the streaming is its best feature.""The most valuable feature of Apache Spark is its flexibility.""The product is useful for analytics.""The features we find most valuable are the machine learning, data learning, and Spark Analytics.""The most valuable feature of this solution is its capacity for processing large amounts of data.""DataFrame: Spark SQL gives the leverage to create applications more easily and with less coding effort.""The scalability has been the most valuable aspect of the solution.""The distribution of tasks, like the seamless map-reduce functionality, is quite impressive."

More Apache Spark Pros →

Cons

"It should work for the cloud, cloud monitoring features, and DevOps processes. It should automatically enable features for downscaling and upscaling.""The product should improve vertical scaling features.""The solution's configuration process could be better.""The spinning up in the solution can be much faster...The product should have a faster scalability option.""The primary area for improvement is the pricing model.""Scalability can be improved.""The launch configuration feature doesn't work properly. It needs to improve the load configuration feature along with launch templates. The tool needs to tag feature as well.""We have found that the sizing in Amazon EC2 Auto Scaling is far off. For example, we will see some at one terabyte and the other one is two terabytes. There is nothing between one and two terabytes. Sometimes it's a struggle if I need one and a half, I still am supposed to pay for two. There are five terabytes, six terabytes, and 12 terabytes, and if I need something at eight or nine, I'm still paying 30 to 40 percent more by taking the one which is 12 terabytes. Microsoft Azure does similar sizes but the gap can be more, such as six terabytes, and the next one is 12 terabytes."

More Amazon EC2 Auto Scaling Cons →

"When you want to extract data from your HDFS and other sources then it is kind of tricky because you have to connect with those sources.""When using Spark, users may need to write their own parallelization logic, which requires additional effort and expertise.""When you first start using this solution, it is common to run into memory errors when you are dealing with large amounts of data.""The product could improve the user interface and make it easier for new users.""Dynamic DataFrame options are not yet available.""The solution needs to optimize shuffling between workers.""The setup I worked on was really complex.""We are building our own queries on Spark, and it can be improved in terms of query handling."

More Apache Spark Cons →

Pricing and Cost Advice

"Pricing could be a little bit more competitive."

"The pricing is not fixed and it is based on usage."

"The price of this product could be a little bit lower."

"Licensing fees are paid on a yearly basis."

"I have not explored the price of the solution extensively, but from what I have seen the price is alright."

"When we want to use more services, we need to pay more. It's a monthly subscription, rather than licensed-based. Pricing or fees for Amazon EC2 Auto Scaling could be improved."

"The solution pricing varies by service region is mid-range."

"Amazon EC2 Auto Scaling uses a pay-as-you-go pricing model."

More Amazon EC2 Auto Scaling Pricing and Cost Advice →

"Since we are using the Apache Spark version, not the data bricks version, it is an Apache license version, the support and resolution of the bug are actually late or delayed. The Apache license is free."

"Apache Spark is open-source. You have to pay only when you use any bundled product, such as Cloudera."

"We are using the free version of the solution."

"Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."

"Apache Spark is an expensive solution."

"Spark is an open-source solution, so there are no licensing costs."

"On the cloud model can be expensive as it requires substantial resources for implementation, covering on-premises hardware, memory, and licensing."

"It is an open-source solution, it is free of charge."

More Apache Spark Pricing and Cost Advice →

See Which Vendors Are Best For You

Use our free recommendation engine to learn which Compute Service solutions are best for your needs.

See Recommendations

772,679 professionals have used our research since 2012.

Questions from the Community

What do you like most about Amazon EC2 Auto Scaling?

Top Answer:The solution removes the need for hardware. We can easily create servers or machines. Just by clicking or specifying our requirements, like memory size or disk space, it's set up for us. The tool… more »

Read all 27 answers →

What is your experience regarding pricing and costs for Amazon EC2 Auto S...

Top Answer:The solution's licensing is based on a pay-as-you-go model. You only pay for the resources you use, whether it's RAM, processing power, or storage. So, it's calculated based on the time you use those… more »

Read all 19 answers →

What needs improvement with Amazon EC2 Auto Scaling?

Top Answer:The solution's pricing is expensive. You pay based on how much you use it, like paying for the time or hours you use the service. There's no need to buy hardware separately.

Read all 26 answers →

What do you like most about Apache Spark?

Top Answer:We use Spark to process data from different data sources.

Read all 30 answers →

What is your experience regarding pricing and costs for Apache Spark?

Top Answer:The solution is moderately priced.

Read all 19 answers →

What needs improvement with Apache Spark?

Top Answer:In data analysis, you need to take real-time data from different data sources. You need to process this in a subsecond, and do the transformation in a subsecond

Read all 32 answers →

Ranking

2nd

out of 16 in Compute Service

Views

3,024

Comparisons

2,628

Reviews

Average Words per Review

357

Rating

8.9

5th

out of 16 in Compute Service

Views

2,793

Comparisons

2,165

Reviews

Average Words per Review

444

Rating

8.7

Comparisons

AWS Fargate vs. Amazon EC2 Auto Scaling

Compared 58% of the time.

AWS Lambda vs. Amazon EC2 Auto Scaling

Compared 34% of the time.

AWS Batch vs. Amazon EC2 Auto Scaling

Compared 5% of the time.

Oracle Compute Cloud Service vs. Amazon EC2 Auto Scaling

Compared 2% of the time.

Amazon Elastic Inference vs. Amazon EC2 Auto Scaling

Compared 2% of the time.

More Amazon EC2 Auto Scaling Competitors →

Spring Boot vs. Apache Spark

Compared 31% of the time.

AWS Batch vs. Apache Spark

Compared 10% of the time.

Spark SQL vs. Apache Spark

Compared 9% of the time.

SAP HANA vs. Apache Spark

Compared 8% of the time.

Cloudera Distribution for Hadoop vs. Apache Spark

Compared 7% of the time.

More Apache Spark Competitors →

Also Known As

AWS RAM

Learn More

Amazon Web Services (AWS)

Apache

Overview

Amazon EC2 Auto Scaling helps you maintain application availability and allows you to automatically add or remove EC2 instances according to conditions you define. ... Dynamic scaling responds to changing demand and predictive scaling automatically schedules the right number of EC2 instances based on predicted demand.

Spark provides programmers with an application programming interface centered on a data structure called the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. It was developed in response to limitations in the MapReduce cluster computing paradigm, which forces a particular linear dataflowstructure on distributed programs: MapReduce programs read input data from disk, map a function across the data, reduce the results of the map, and store reduction results on disk. Spark's RDDs function as a working set for distributed programs that offers a (deliberately) restricted form of distributed shared memory

Sample Customers

Expedia, Intuit, Royal Dutch Shell, Brooks Brothers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions

Top Industries

REVIEWERS

Computer Software Company44%

Financial Services Firm16%

Comms Service Provider8%

Media Company4%

VISITORS READING REVIEWS

Financial Services Firm22%

Computer Software Company13%

University9%

Government7%

REVIEWERS

Computer Software Company33%

Financial Services Firm12%

University9%

Marketing Services Firm6%

VISITORS READING REVIEWS

Financial Services Firm25%

Computer Software Company13%

Manufacturing Company7%

Comms Service Provider5%

Company Size

REVIEWERS

Small Business33%

Midsize Enterprise15%

Large Enterprise53%

VISITORS READING REVIEWS

Small Business25%

Midsize Enterprise10%

Large Enterprise66%

REVIEWERS

Small Business42%

Midsize Enterprise16%

Large Enterprise42%

VISITORS READING REVIEWS

Small Business17%

Midsize Enterprise12%

Large Enterprise71%

Amazon EC2 Auto Scaling vs Apache Spark comparison