We performed a comparison between Amazon EC2 and Apache Spark based on real PeerSpot user reviews.
Find out in this report how the two Compute Service solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."The solution offers good access policies."
"The product is easy to set up."
"Configuration can be changed at any time and it's very scalable."
"We don't have to worry about scalability issues or maintenance or security. It's all taken care of."
"EC2 has the typical advantages of using the cloud. It's easy to provision and set up."
"An advantage of Amazon is that it offers a wide range of infrastructure services with an easy way to configure them."
"The most valuable feature is EC2 is scalable, so when you want to move to market, you don't need to wait until your provision is fast, you can just go and provision it and then easily install your application."
"The scalability of Amazon EC2 is good. However, the stability can depend on what service I am using."
"Now, when we're tackling sentiment analysis using NLP technologies, we deal with unstructured data—customer chats, feedback on promotions or demos, and even media like images, audio, and video files. For processing such data, we rely on PySpark. Beneath the surface, Spark functions as a compute engine with in-memory processing capabilities, enhancing performance through features like broadcasting and caching. It's become a crucial tool, widely adopted by 90% of companies for a decade or more."
"The processing time is very much improved over the data warehouse solution that we were using."
"It is useful for handling large amounts of data. It is very useful for scientific purposes."
"The solution has been very stable."
"It's easy to prepare parallelism in Spark, run the solution with specific parameters, and get good performance."
"The product’s most valuable feature is the SQL tool. It enables us to create a database and publish it."
"The deployment of the product is easy."
"The product is useful for analytics."
"They have to provide clarity on pricing. It's not transparent."
"Amazon EC2 could improve the console view. The ability to see the console view directly would be helpful, similar to what VMware has. Additionally, when the system is rebooting we are able to see a screenshot of the UI, but it would be a lot better if we could interact directly with the console level."
"Amazon EC2 could improve by reducing the price."
"Nothing is really missing in terms of features."
"I would like to see improvement in the information available up-front for users around tailoring the package to their actual requirements. At present it can take time to work with the on demand instance until you are used to what features are right for the user."
"We're expecting to have Graviton instances. Graviton means it's not internal, it's a low-cost instance. At present time, Graviton is not supported for a few packages."
"Its price can be reduced."
"In terms of improvement, they could build some client-side desktop tools that provide easier connectivity to Amazon."
"Its UI can be better. Maintaining the history server is a little cumbersome, and it should be improved. I had issues while looking at the historical tags, which sometimes created problems. You have to separately create a history server and run it. Such things can be made easier. Instead of separately installing the history server, it can be made a part of the whole setup so that whenever you set it up, it becomes available."
"Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing."
"When you first start using this solution, it is common to run into memory errors when you are dealing with large amounts of data."
"If you have a Spark session in the background, sometimes it's very hard to kill these sessions because of D allocation."
"Technical expertise from an engineer is required to deploy and run high-tech tools, like Informatica, on Apache Spark, making it an area where improvements are required to make the process easier for users."
"We are building our own queries on Spark, and it can be improved in terms of query handling."
"The setup I worked on was really complex."
"When you are working with large, complex tasks, the garbage collection process is slow and affects performance."
Amazon EC2 is ranked 3rd in Compute Service with 56 reviews while Apache Spark is ranked 5th in Compute Service with 58 reviews. Amazon EC2 is rated 8.6, while Apache Spark is rated 8.4. The top reviewer of Amazon EC2 writes "Highly stable, is auto-scaling, and can be utilized in under five minutes". On the other hand, the top reviewer of Apache Spark writes "Reliable, able to expand, and handle large amounts of data well". Amazon EC2 is most compared with AWS Fargate, AWS Lambda, Apache NiFi, AWS Batch and Google App Engine, whereas Apache Spark is most compared with Spring Boot, AWS Batch, Spark SQL, SAP HANA and Jakarta EE. See our Amazon EC2 vs. Apache Spark report.
See our list of best Compute Service vendors.
We monitor all Compute Service reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.