Anonymous UserLead Data Architect at a government
We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
"We are completely satisfied with the ease of connecting to different sources of data or pocket files in the search"
"Automation with Databricks is very easy when using the API."
"Databricks is based on a Spark cluster and it is fast. Performance-wise, it is great."
"The most valuable aspect of the solution is its notebook. It's quite convenient to use, both terms of the research and the development and also the final deployment, I can just declare the spark jobs by the load tables. It's quite convenient."
"I work in the data science field and I found Databricks to be very useful."
"The time travel feature is the solution's most valuable aspect."
"I haven't heard about any major stability issues. At this time I feel like it's stable."
"Imageflow is a visual tool that helps make it easier for business people to understand complex workflows."
"All of the features of this product are quite good."
"The integration features could be more interesting, more involved."
"Some of the error messages that we receive are too vague, saying things like "unknown exception", and these should be improved to make it easier for developers to debug problems."
"It should have more compatible and more advanced visualization and machine learning libraries."
"The solution could be improved by integrating it with data packets. Right now, the load tables provide a function, like team collaboration. Still, it's unclear as to if there's a function to create different branches and/or more branches. Our team had used data packets before, however, I feel it's difficult to integrate the current with the previous data packets."
"It would be very helpful if Databricks could integrate with platforms in addition to Azure."
"Databricks is an analytics platform. It should offer more data science. It should have more features for data scientists to work with."
"Pricing is one of the things that could be improved."
"The product needs samples and templates to help invite users to see results and understand what the product can do."
"The interface should be more user-friendly."
"Whenever we want to find the actual costing, we have to send an email to Databricks, so having the information available on the internet would be helpful."
"I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly."
"Licensing on site I would counsel against, as on-site hardware issues tend to really delay and slow down delivery."
"We find Databricks to be very expensive, although this improved when we found out how to shut it down at night."
"The pricing depends on the usage itself."
"I am based in South Africa, where it is expensive adapting to the cloud, and then there is the price for the tool itself."
"The price is okay. It's competitive."
"Databricks uses a price-per-use model, where you can use as much compute as you need."
Earn 20 points
Databricks creates a Unified Analytics Platform that accelerates innovation by unifying data science, engineering, and business. It utilizes Apache Spark to help clients with cloud-based big data processing. It puts Spark on “autopilot” to significantly reduce operational complexity and management cost. The Databricks I/O module (DBIO) improves the read and write performance of Apache Spark in the cloud. An increase in productivity is ensured through Databricks’ collaborative workplace.
Cloud Datalab is a powerful interactive tool created to explore, analyze, transform and visualize data and build machine learning models on Google Cloud Platform. It runs on Google Compute Engine and connects to multiple cloud services easily so you can focus on your data science tasks.
Databricks is ranked 2nd in Data Science Platforms with 22 reviews while Google Cloud Datalab is ranked 19th in Data Science Platforms with 1 review. Databricks is rated 8.0, while Google Cloud Datalab is rated 8.0. The top reviewer of Databricks writes "Has a good feature set but it needs samples and templates to help invite users to see results". On the other hand, the top reviewer of Google Cloud Datalab writes "Stable, feature-rich, and easy to set up". Databricks is most compared with Microsoft Azure Machine Learning Studio, Amazon SageMaker, Azure Stream Analytics, Alteryx and Domino Data Science Platform, whereas Google Cloud Datalab is most compared with IBM Watson Studio, Microsoft Azure Machine Learning Studio, Cloudera Data Science Workbench, MathWorks Matlab and KNIME.
See our list of best Data Science Platforms vendors.
We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.