We just raised a $30M Series A: Read our story
2018-06-27T19:19:00Z

What do you like most about Apache Spark?

4

Hi Everyone,

What do you like most about Apache Spark?

Thanks for sharing your thoughts with the community!

ITCS user
Guest
1515 Answers

author avatar
Top 5LeaderboardReal User

The solution has been very stable.

2021-08-18T14:51:07Z
author avatar
Top 20LeaderboardReal User

I like that it can handle multiple tasks parallelly. I also like the automation feature. JavaScript also helps with the parallel streaming of the library.

2021-03-27T15:39:24Z
author avatar
Top 5LeaderboardReal User

Its scalability and speed are very valuable. You can scale it a lot. It is a great technology for big data. It is definitely better than a lot of earlier warehouse or pipeline solutions, such as Informatica.

Spark SQL is very compliant with normal SQL that we have been using over the years. This makes it easy to code in Spark. It is just like using normal SQL. You can use the APIs of Spark or you can directly write SQL code and run it. This is something that I feel is useful in Spark.

2021-02-01T12:04:16Z
author avatar
Top 10LeaderboardReal User

AI libraries are the most valuable. They provide extensibility and usability. Spark has a lot of connectors, which is a very important and useful feature for AI. You need to connect a lot of points for AI, and you have to get data from those systems. Connectors are very wide in Spark. With a Spark cluster, you can get fast results, especially for AI.

2020-10-28T02:27:29Z
author avatar
Top 20LeaderboardReal User

The memory processing engine is the solution's most valuable aspect. It processes everything extremely fast, and it's in the cluster itself. It acts as a memory engine and is very effective in processing data correctly.

2020-07-23T07:58:35Z
author avatar
Real User

I love every core functionality of Apache Spark Initially they have only provided RDD basic interface to process the data across distributed cluster. Then it evolved to dataframe and dataset interface with optimised execution engine and more flexibility for developers to perform querying on the data.

2020-06-10T05:27:31Z
author avatar
Top 20LeaderboardConsultant

The processing time is very much improved over the data warehouse solution that we were using.

2020-02-02T10:42:14Z
author avatar
Consultant

The main feature that we find valuable is that it is very fast.

2020-01-29T11:22:00Z
author avatar
Real User

The features we find most valuable are the machine learning, data learning, and Spark Analytics.

2020-01-29T11:22:00Z
author avatar
Consultant

I feel the streaming is its best feature.

2019-12-23T07:05:00Z
author avatar
Top 20Real User

The solution is very stable.

2019-12-09T10:58:00Z
author avatar
Consultant

The most valuable feature of this solution is its capacity for processing large amounts of data.

2019-10-13T05:48:00Z
author avatar
Real User

The scalability has been the most valuable aspect of the solution.

2019-07-14T10:21:00Z
author avatar
Real User

I found the solution stable. We haven't had any problems with it.

2019-07-10T12:01:00Z
author avatar
User

Features include machine learning, real time streaming, and data processing.

2018-06-27T19:19:00Z
Learn what your peers think about Apache Spark. Get advice and tips from experienced pros sharing their opinions. Updated: October 2021.
540,884 professionals have used our research since 2012.