Amazon EC2 vs Apache Spark vs Azure Stream Analytics comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
2,594 views|1,667 comparisons
98% willing to recommend
Apache Logo
3,093 views|2,345 comparisons
89% willing to recommend
Microsoft Logo
9,925 views|8,426 comparisons
95% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Amazon EC2, Apache Spark, and Azure Stream Analytics based on real PeerSpot user reviews.

Find out what your peers are saying about Amazon Web Services (AWS), Apache, Zadara and others in Compute Service.
To learn more, read our detailed Compute Service Report (Updated: April 2024).
767,847 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The product helps us with scalability. We also need not have data centers.""The most valuable feature of Amazon EC2 is its ability to spin a new virtual machine in a few seconds.""The flexibility of the security features is what is interesting.""The most valuable feature of this solution is the ability to have standard operating systems along with the Windows, Linux operating systems, and their maintenance-free structure, which we prefer.""The ethernet configuration is stable and the product is reliable.""The most valuable features of Amazon EC2 are content delivery and adaptability.""The initial setup is straightforward.""I use it for NextCloud and for developer purposes."

More Amazon EC2 Pros →

"The most valuable feature is the Fault Tolerance and easy binding with other processes like Machine Learning, graph analytics.""I appreciate everything about the solution, not just one or two specific features. The solution is highly stable. I rate it a perfect ten. The solution is highly scalable. I rate it a perfect ten. The initial setup was straightforward. I recommend using the solution. Overall, I rate the solution a perfect ten.""With Hadoop-related technologies, we can distribute the workload with multiple commodity hardware.""The most crucial feature for us is the streaming capability. It serves as a fundamental aspect that allows us to exert control over our operations.""The fault tolerant feature is provided.""ETL and streaming capabilities.""The solution has been very stable.""Now, when we're tackling sentiment analysis using NLP technologies, we deal with unstructured data—customer chats, feedback on promotions or demos, and even media like images, audio, and video files. For processing such data, we rely on PySpark. Beneath the surface, Spark functions as a compute engine with in-memory processing capabilities, enhancing performance through features like broadcasting and caching. It's become a crucial tool, widely adopted by 90% of companies for a decade or more."

More Apache Spark Pros →

"The solution's most valuable feature is its ability to create a query using SQ.""The integrations for this solution are easy to use and there is flexibility in integrating the tool with Azure Stream Analytics.""We find the query editor feature of this solution extremely valuable for our business.""I like all the connected ecosystems of Microsoft, it is really good with other BI tools that are easy to connect.""The solution has a lot of functionality that can be pushed out to companies.""The most valuable features are the IoT hub and the Blob storage.""Provides deep integration with other Azure resources.""It provides the capability to streamline multiple output components."

More Azure Stream Analytics Pros →

Cons
"EC2 could be improved with easier migration.""It is a little too expensive.""An area for improvement in Amazon EC2 is the cost because it's a bit higher than competitor pricing.""In terms of improvement, they could build some client-side desktop tools that provide easier connectivity to Amazon.""The initial setup could be easier because many keys are required for access.""The ease of migrating applications could be improved.""Amazon EC2 could improve by reducing the price.""Pricing-wise, it is a bit high."

More Amazon EC2 Cons →

"The setup I worked on was really complex.""We use big data manager but we cannot use it as conditional data so whenever we're trying to fetch the data, it takes a bit of time.""Apache Spark could potentially improve in terms of user-friendliness, particularly for individuals with a SQL background. While it's suitable for those with programming knowledge, making it more accessible to those without extensive programming skills could be beneficial.""I would like to see integration with data science platforms to optimize the processing capability for these tasks.""Stability in terms of API (things were difficult, when transitioning from RDD to DataFrames, then to DataSet).""When you want to extract data from your HDFS and other sources then it is kind of tricky because you have to connect with those sources.""The initial setup was not easy.""The graphical user interface (UI) could be a bit more clear. It's very hard to figure out the execution logs and understand how long it takes to send everything. If an execution is lost, it's not so easy to understand why or where it went. I have to manually drill down on the data processes which takes a lot of time. Maybe there could be like a metrics monitor, or maybe the whole log analysis could be improved to make it easier to understand and navigate."

More Apache Spark Cons →

"Early in the process, we had some issues with stability.""Sometimes when we connect Power BI, there is a delay or it throws up some errors, so we're not sure.""If something goes wrong, it's very hard to investigate what caused it and why.""There may be some issues when connecting with Microsoft Power BI because we are providing the input and output commands, and there's a chance of it being delayed while connecting.""The solution’s customer support could be improved.""One area that could use improvement is the handling of data validation. Currently, there is a review process, but sometimes the validation fails even before the job is executed. This results in wasted time as we have to rerun the job to identify the failure.""Its features for event imports and architecture could be enhanced.""The solution doesn't handle large data packets very efficiently, which could be improved upon."

More Azure Stream Analytics Cons →

Pricing and Cost Advice
  • "Pricing appears to be cheap, however, it is extremely difficult in calculating what something will cost."
  • "It has helped to reduce costs with infrastructure."
  • "EC2 pricing is somewhat transparent, in that AWS provides pricing for all instance types. However, the number of pricing options can be confusing."
  • "For our usage, the cost is approximately $20,000 to $23,000 per month."
  • "There is a license required to use this solution and we pay on a monthly basis."
  • "The price is reasonable, but there is definitely an opportunity to lower it in instances which are of a higher configuration, because they have been typically used for the long term."
  • "Amazon EC2 has a pay-as-you-use cost model."
  • "The clients have found the billing of Amazon EC2 good, but the price could be less high. There is a monthly subscription to use the solution."
  • More Amazon EC2 Pricing and Cost Advice →

  • "Since we are using the Apache Spark version, not the data bricks version, it is an Apache license version, the support and resolution of the bug are actually late or delayed. The Apache license is free."
  • "Apache Spark is open-source. You have to pay only when you use any bundled product, such as Cloudera."
  • "We are using the free version of the solution."
  • "Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."
  • "Apache Spark is an expensive solution."
  • "Spark is an open-source solution, so there are no licensing costs."
  • "On the cloud model can be expensive as it requires substantial resources for implementation, covering on-premises hardware, memory, and licensing."
  • "It is an open-source solution, it is free of charge."
  • More Apache Spark Pricing and Cost Advice →

  • "The cost of this solution is less than competitors such as Amazon or Google Cloud."
  • "We pay approximately $500,000 a year. It's approximately $10,000 a year per license."
  • "I rate the price of Azure Stream Analytics a four out of five."
  • "The licensing for this product is payable on a 'pay as you go' basis. This means that the cost is only based on data volume, and the frequency that the solution is used."
  • "There are different tiers based on retention policies. There are four tiers. The pricing varies based on steaming units and tiers. The standard pricing is $10/hour."
  • "The current price is substantial."
  • "Azure Stream Analytics is a little bit expensive."
  • "The product's price is at par with the other solutions provided by the other cloud service providers in the market."
  • More Azure Stream Analytics Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Compute Service solutions are best for your needs.
    767,847 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Amazon EC2 is really reliable and provides great flexibility.
    Top Answer:The solution has different pricing models, and its cost differs when you purchase it for one year or three years.
    Top Answer:The solution’s pricing and downtimes could be improved. I would like to have a better pricing model for Amazon EC2… more »
    Top Answer:We use Spark to process data from different data sources.
    Top Answer:In data analysis, you need to take real-time data from different data sources. You need to process this in a subsecond… more »
    Top Answer:Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics… more »
    Top Answer:The product's price is at par with the other solutions provided by the other cloud service providers in the market.
    Top Answer:Azure Stream Analytics was not meeting our company's expectations because it was tedious to change the job, write… more »
    Ranking
    4th
    out of 16 in Compute Service
    Views
    2,594
    Comparisons
    1,667
    Reviews
    46
    Average Words per Review
    347
    Rating
    8.6
    5th
    out of 16 in Compute Service
    Views
    3,093
    Comparisons
    2,345
    Reviews
    25
    Average Words per Review
    432
    Rating
    8.7
    4th
    out of 38 in Streaming Analytics
    Views
    9,925
    Comparisons
    8,426
    Reviews
    13
    Average Words per Review
    405
    Rating
    8.2
    Comparisons
    Also Known As
    Amazon Elastic Compute Cloud, EC2
    ASA
    Learn More
    Overview

    Amazon Elastic Compute Cloud (Amazon EC2) is a web service that provides secure, resizable compute capacity in the cloud. It is designed to make web-scale cloud computing easier for developers.

    Amazon EC2’s simple web service interface allows you to obtain and configure capacity with minimal friction. It provides you with complete control of your computing resources and lets you run on Amazon’s proven computing environment. Amazon EC2 reduces the time required to obtain and boot new server instances to minutes, allowing you to quickly scale capacity, both up and down, as your computing requirements change. Amazon EC2 changes the economics of computing by allowing you to pay only for capacity that you actually use. Amazon EC2 provides developers the tools to build failure resilient applications and isolate them from common failure scenarios.

    Spark provides programmers with an application programming interface centered on a data structure called the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. It was developed in response to limitations in the MapReduce cluster computing paradigm, which forces a particular linear dataflowstructure on distributed programs: MapReduce programs read input data from disk, map a function across the data, reduce the results of the map, and store reduction results on disk. Spark's RDDs function as a working set for distributed programs that offers a (deliberately) restricted form of distributed shared memory

    Azure Stream Analytics is a robust real-time analytics service that has been designed for critical business workloads. Users are able to build an end-to-end serverless streaming pipeline in minutes. Utilizing SQL, users are able to go from zero to production with a few clicks, all easily extensible with unique code and automatic machine learning abilities for the most advanced scenarios.

    Azure Stream Analytics has the ability to analyze and accurately process exorbitant volumes of high-speed streaming data from numerous sources at the same time. Patterns and scenarios are quickly identified and information is gathered from various input sources, such as social media feeds, applications, clickstreams, sensors, and devices. These patterns can then be implemented to trigger actions and launch workflows, such as feeding data to a reporting tool, storing data for later use, or creating alerts. Azure Stream Analytics is also offered on Azure IoT Edge runtime, so the data can be processed on IoT devices.

    Top Benefits

    • User friendly: Azure Stream Analytics is very straightforward and easy to use. Out of the box and with a few clicks, users are able to connect to numerous sources and sinks, and easily develop an end-to-end pipeline. Stream Analytics can easily connect to Azure IoT Hub and Azure Event Hub for streaming ingestion, in addition to connecting with Azure Blob storage for historical data ingestion.

    • Flexible deployment: For low-latency analytics, Azure Stream Analytics can run on Azure Stack or IoT edge. For large-scale analytics, the solution can run in the cloud. Azure Stream Analytics uses the same query language and tools for both the cloud and the edge, facilitating an easier process for developers to design exceptional hybrid architectures for streaming processes.

    • Cost-effective: With Azure Stream Analytics, users only pay for the streaming units they consume; there are no upfront costs. Users can easily scale up or down as needed; there is no commitment or cluster provisioning.

    • Trustworthy: Azure Stream Analytics guarantees event processing to be 99.99% available with a minute level of granularity. Azure Stream Analytics has embedded recovery capabilities and checkpoints to keep things running smoothly at all times. Events are never lost with Azure Stream Analytics at-least once delivery of events and exactly one event processing.

    Reviews from Real Users

    “Azure Stream Analytics is something that you can use to test out streaming scenarios very quickly in the general sense and it is useful for IoT scenarios. If I was to do a project with IoT and I needed a streaming solution, Azure Stream Analytics would be a top choice. The most valuable features of Azure Stream Analytics are the ease of provisioning and the interface is not terribly complex.” - Olubisi A., Team Lead at a tech services company.

    “It's used primarily for data and mining - everything from the telemetry data side of things. It's great for streaming and makes everything easy to handle. The streaming from the IoT hub and the messaging are aspects I like a lot.” - Sudhendra U., Technical Architect at Infosys

    Sample Customers
    Netflix, Expedia, TimeInc., Novaris, airbnb, Lamborghini
    NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
    Rockwell Automation, Milliman, Honeywell Building Solutions, Arcoflex Automation Solutions, Real Madrid C.F., Aerocrine, Ziosk, Tacoma Public Schools, P97 Networks
    Top Industries
    REVIEWERS
    Computer Software Company32%
    Financial Services Firm16%
    Comms Service Provider12%
    Government8%
    VISITORS READING REVIEWS
    Financial Services Firm20%
    Computer Software Company18%
    Educational Organization7%
    Manufacturing Company6%
    REVIEWERS
    Computer Software Company30%
    Financial Services Firm15%
    University9%
    Marketing Services Firm6%
    VISITORS READING REVIEWS
    Financial Services Firm25%
    Computer Software Company13%
    Manufacturing Company7%
    Comms Service Provider6%
    REVIEWERS
    Computer Software Company27%
    Manufacturing Company18%
    Government9%
    Transportation Company9%
    VISITORS READING REVIEWS
    Computer Software Company15%
    Financial Services Firm12%
    Manufacturing Company8%
    Comms Service Provider5%
    Company Size
    REVIEWERS
    Small Business45%
    Midsize Enterprise16%
    Large Enterprise38%
    VISITORS READING REVIEWS
    Small Business20%
    Midsize Enterprise12%
    Large Enterprise68%
    REVIEWERS
    Small Business40%
    Midsize Enterprise19%
    Large Enterprise40%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise12%
    Large Enterprise71%
    REVIEWERS
    Small Business24%
    Midsize Enterprise10%
    Large Enterprise67%
    VISITORS READING REVIEWS
    Small Business20%
    Midsize Enterprise11%
    Large Enterprise69%
    Buyer's Guide
    Compute Service
    April 2024
    Find out what your peers are saying about Amazon Web Services (AWS), Apache, Zadara and others in Compute Service. Updated: April 2024.
    767,847 professionals have used our research since 2012.