We performed a comparison between Amazon Kinesis and Apache Spark Streaming based on real PeerSpot user reviews.
Find out in this report how the two Streaming Analytics solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Setting Amazon Kinesis up is quick and easy; it only takes a few minutes to configure the necessary settings and start using it."
"Great auto-scaling, auto-sharing, and auto-correction features."
"The scalability is pretty good."
"Everything is hosted and simple."
"Kinesis is a fully managed program streaming application. You can manage any infrastructure. It is also scalable. Kinesis can handle any amount of data streaming and process data from hundreds, thousands of processes in every source with very low latency."
"From my experience, one of the most valuable features is the ability to track silent events on endpoints. Previously, these events might have gone unnoticed, but now we can access them within the product range. For example, if a customer reports that their calls are not reaching the portal files, we can use this feature to troubleshoot and optimize the system."
"The solution works well in rather sizable environments."
"The feature that I've found most valuable is the replay. That is one of the most valuable in our business. We are business-to-business so replay was an important feature - being able to replay for 24 hours. That's an important feature."
"Apache Spark Streaming was straightforward in terms of maintenance. It was actively developed, and migrating from an older to a newer version was quite simple."
"Apache Spark Streaming is versatile. You can use it for competitive intelligence, gathering data from competitors, or for internal tasks like monitoring workflows."
"Apache Spark Streaming's most valuable feature is near real-time analytics. The developers can build APIs easily for a code-steaming pipeline. The solutions have an ecosystem of integration with other stock services."
"The solution is better than average and some of the valuable features include efficiency and stability."
"The solution is very stable and reliable."
"Apache Spark Streaming has features like checkpointing and Streaming API that are useful."
"It's the fastest solution on the market with low latency data on data transformations."
"As an open-source solution, using it is basically free."
"In general, the pain point for us was that once the data gets into Kinesis there is no way for us to understand what's happening because Kinesis divides everything into shards. So if we wanted to understand what's happening with a particular shard, whether it is published or not, we could not. Even with the logs, if we want to have some kind of logging it is in the shard."
"I think the default settings are far too low."
"Could include features that make it easier to scale."
"There are certain shortcomings in the machine learning capacity offered by the product, making it an area where improvements are required."
"Lacks first in, first out queuing."
"Kinesis Data Analytics needs to be improved somewhat. It's SQL based data but it is not as user friendly as MySQL or Athena tools."
"One thing that would be nice would be a policy for increasing the number of Kinesis streams because that's the one thing that's constant. You can change it in real time, but somebody has to change it, or you have to set some kind of meter. So, auto-scaling of adding and removing streams would be nice."
"I suggest integrating additional features, such as incorporating Amazon Pinpoint or Amazon Connect as bundled offerings, rather than deploying them as separate services."
"It was resource-intensive, even for small-scale applications."
"The cost and load-related optimizations are areas where the tool lacks and needs improvement."
"The initial setup is quite complex."
"In terms of improvement, the UI could be better."
"The solution itself could be easier to use."
"We would like to have the ability to do arbitrary stateful functions in Python."
"The service structure of Apache Spark Streaming can improve. There are a lot of issues with memory management and latency. There is no real-time analytics. We recommend it for the use cases where there is a five-second latency, but not for a millisecond, an IOT-based, or the detection anomaly-based. Flink as a service is much better."
"There could be an improvement in the area of the user configuration section, it should be less developer-focused and more business user-focused."
Amazon Kinesis is ranked 1st in Streaming Analytics with 24 reviews while Apache Spark Streaming is ranked 8th in Streaming Analytics with 8 reviews. Amazon Kinesis is rated 8.0, while Apache Spark Streaming is rated 8.0. The top reviewer of Amazon Kinesis writes "Used for media streaming and live-streaming data". On the other hand, the top reviewer of Apache Spark Streaming writes "Easy integration, beneficial auto-scaling, and good open-sourced support community". Amazon Kinesis is most compared with Azure Stream Analytics, Amazon MSK, Confluent, Apache Flink and Databricks, whereas Apache Spark Streaming is most compared with Spring Cloud Data Flow, Azure Stream Analytics, Apache Pulsar, Confluent and Starburst Enterprise. See our Amazon Kinesis vs. Apache Spark Streaming report.
See our list of best Streaming Analytics vendors.
We monitor all Streaming Analytics reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.