- RDDs
- DataFrames
- Machine learning libraries
Faster time to parse and compute data. It makes web-based queries for plotting data easier.
It needs to be simpler to use the machine learning algorithms supported by Octave (example polynomial regressions, polynomial interpolation).
I've been using it for one year.
There have been no issues with the deployment.
There have been no issues with the stability.
There have been no issues with the scalability.
We still rely on user forums for my answers. We do not use a commercial product yet.
The initial set-up was easy. I have not explored using this on AWS clusters.
We did an in-house implementation and development for our regression tool.
The ROI will be an in-house product to do machine learning analytics on data obtained from customer.
We did not evaluate any other products.
It's easy to use and has a learning curve.