Apache Spark Room for Improvement

Big Data Consultant at a tech services company with 501-1,000 employees
Apache Spark provides very good performance The tuning phase is still tricky. View full review »
Sr. Software Engineer at a tech vendor with 1-10 employees
This product is already improving as the community is developing it rapidly. More ML based algorithms should be added to it, to make it algorithmic-rich for developers. View full review »
Sumit Pal
Architect at a healthcare company with 51-200 employees
Stability in terms of API (things were difficult, when transitioning from RDD to DataFrames, then to DataSet). View full review »
Abhijit Nayak
Manager | Data Science Enthusiast | Management Consultant at a consultancy with 5,001-10,000 employees
Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing. View full review »
I would suggest for it to support more programming languages, and also provide an internal scheduler to schedule spark jobs with monitoring capability. View full review »
Subhasish Guha
Big Data and Cloud Solution Consultant at a financial services firm with 10,001+ employees
Dynamic DataFrame options are not yet available. View full review »
Sumanth Punyamurthula
Director - Data Management, Governance and Quality with 10,001+ employees
It is like going back to the '80s for the complicated coding that is required to write efficient programs. View full review »
Rosemary Walsh
Portfolio Manager, Enterprise Solutions Architect with 10,001+ employees
Better data lineage support. View full review »

Sign Up with Email