We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
"Apache Spark is open-source. You have to pay only when you use any bundled product, such as Cloudera."
Spark provides programmers with an application programming interface centered on a data structure called the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. It was developed in response to limitations in the MapReduce cluster computing paradigm, which forces a particular linear dataflowstructure on distributed programs: MapReduce programs read input data from disk, map a function across the data, reduce the results of the map, and store reduction results on disk. Spark's RDDs function as a working set for distributed programs that offers a (deliberately) restricted form of distributed shared memory
AWS Batch enables developers, scientists, and engineers to easily and efficiently run hundreds of thousands of batch computing jobs on AWS. AWS Batch dynamically provisions the optimal quantity and type of compute resources (e.g., CPU or memory optimized instances) based on the volume and specific resource requirements of the batch jobs submitted. With AWS Batch, there is no need to install and manage batch computing software or server clusters that you use to run your jobs, allowing you to focus on analyzing results and solving problems. AWS Batch plans, schedules, and executes your batch computing workloads across the full range of AWS compute services and features, such as Amazon EC2 and Spot Instances.
Apache Spark is ranked 1st in Compute Service with 10 reviews while AWS Batch is ranked 7th in Compute Service. Apache Spark is rated 8.6, while AWS Batch is rated 0.0. The top reviewer of Apache Spark writes "Good Streaming features enable to enter data and analysis within Spark Stream". On the other hand, Apache Spark is most compared with Spring Boot, Azure Stream Analytics, AWS Lambda, SAP HANA and Cloudera Distribution for Hadoop, whereas AWS Batch is most compared with AWS Lambda, AWS Fargate, Oracle Compute Cloud Service, Apache NiFi and Amazon Elastic Inference.
See our list of best Compute Service vendors.
We monitor all Compute Service reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.