If you were talking to someone whose organization is considering H2O.ai, what would you say?
How would you rate it and why? Any other tips or advice?
H2O is a good product, and I suggest that people use it. My advice to anybody who is considering this type of solution is to consider whether they want to procure such products and use them versus building something custom. It depends on time availability. It really just exposes Spark as the next layer. I would rate this solution a seven out of ten.
It deals well with its core functionality. The product is definitely worth looking at, as it is one of the upcoming products where you can build large models for use cases. I am using the on-premise version.
Do your due diligence, making sure with your use cases, this is the right product for you. Directionally, they are headed in the right place. They're also putting a lot of muscle behind it, but they're very focused in one area. Supervised on supervised learning is the market that they're going after. If that's their strategy, then they'll get some part of the market, but they'll leave the other part of the market behind. We use just the AWS version of the product. It integrates well with our notebooks. It also integrates well with our homegrown tool sets.
H2O.ai works directly with a lot of our cloud data, big data environment, and Amazon RedShift environment. The big data integration was easier from a performance perspective than Amazon RedShift. That is because our big data environment is still on-premise vs RedShift, which is on the cloud, so we had to go through some struggles to get it operating with RedShift. We also use the on-premise version.