Manager | Data Science Enthusiast | Management Consultant at a consultancy with 5,001-10,000 employees
Dec 10 2017

What do you think of Apache Spark?

Improvements to My Organization Organisations can now harness richer data sets and benefit from use cases, which add value to their business functions. • Valuable Features Distributed in memory processing. Some of the algorithms are resource heavy and executing this requires a lot of RAM and CPU. With Hadoop-related technologies, we can distribute the workload with multiple commodity hardware. • Room for Improvement Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing. • Use of Solution Three to five years. • Stability Issues At times when users do not know how to use Spark and request a lot of resources, then the underlying JVMs can crash, which is a big sense of worry.  • Scalability Issues No...

