Comparison Buyer's Guide

Executive SummaryUpdated on Jan 25, 2024
 

Categories and Ranking

Databricks
Average Rating
8.2
Number of Reviews
78
Ranking in other categories
Data Science Platforms (1st), Streaming Analytics (2nd)
VAST Data
Average Rating
10.0
Number of Reviews
2
Ranking in other categories
All-Flash Storage (17th), File and Object Storage (11th), NVMe All-Flash Storage Arrays (8th)
 

Market share comparison

As of June 2024, in the Data Science Platforms category, the market share of Databricks is 20.3% and it increased by 4.6% compared to the previous year. The market share of VAST Data is 0.8% and it increased by Infinity% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
Unique Categories:
Streaming Analytics
17.8%
All-Flash Storage
2.3%
File and Object Storage
3.7%
 

Featured Reviews

Axel Richier - PeerSpot reviewer
May 14, 2024
Simple to set up, fast to deploy, and with regular product updates
The shared experience of collaborative notebooks is probably the most useful aspect since, as an expert, it allows me to help my juniors debug their books and their code live. I can do some live coding with them or help them find the errors very efficiently. It has become very simple to set up thanks to its official Terraform provider and the open-source modules made available on GitHub. I love Databricks due to the fact that we can now deploy it in 15 minutes and it's ready to use. That's very nice since we often help our clients in deploying their first Data Platform with Databricks. The solution is stable, with LTS Runtimes that have proven to remain stable over the years.
Alan Powers - PeerSpot reviewer
May 3, 2023
Stability-wise, a device that has been up and running for years
The solution is useful for machine learning and scientific applications, including computer simulations The failover capability and resiliency are some of the solution's valuable features. The big thing is resilience because it has richer coding in it, so multiple devices can't fail. Also, one…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The setup is quite easy."
"Databricks' most valuable feature is the data transformation through PySpark."
"The solution is easy to use and has a quick start-up time due to being on the cloud."
"The most valuable aspect of the solution is its notebook. It's quite convenient to use, both terms of the research and the development and also the final deployment, I can just declare the spark jobs by the load tables. It's quite convenient."
"The time travel feature is the solution's most valuable aspect."
"A very valuable feature is the data processing, and the solution is specifically good at using the Spark ecosystem."
"I like cloud scalability and data access for any type of user."
"The initial setup is pretty easy."
"The solution is useful for machine learning and scientific applications, including computer simulations."
"This has been one of the most reliable storage systems that I have ever used."
 

Cons

"In the future, I would like to see Data Lake support. That is something that I'm looking forward to."
"I believe that this product could be improved by becoming more user-friendly."
"It should have more compatible and more advanced visualization and machine learning libraries."
"The integration of data could be a bit better."
"The integration and query capabilities can be improved."
"I would like to see more documentation in terms of how an end-user could use it, and users like me can easily try it and implement use cases."
"The Databricks cluster can be improved."
"There are no direct connectors — they are very limited."
"The read/write ratio is an area in the solution with some flaws and needs improvement."
"The write performance could be improved because it is less than half of the read performance."
 

Pricing and Cost Advice

"The solution is affordable."
"We're charged on what the data throughput is and also what the compute time is."
"The solution uses a pay-per-use model with an annual subscription fee or package. Typically this solution is used on a cloud platform, such as Azure or AWS, but more people are choosing Azure because the price is more reasonable."
"Price-wise, I would rate Databricks a three out of five."
"We have only incurred the cost of our AWS cloud services. This is because during this period, Databricks provided us with an extended evaluation period, and we have not spent much money yet. We are just starting to incur costs this month, I will know more later on the full cost perspective."
"The licensing costs of Databricks depend on how many licenses we need, depending on which Databricks provides a lot of discounts."
"The solution is a good value for batch processing and huge workloads."
"Whenever we want to find the actual costing, we have to send an email to Databricks, so having the information available on the internet would be helpful."
"Price-wise, VAST Data is not the cheapest, not the most expensive one."
"We acquired VAST Data as a one-time, capital purchase."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
787,061 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
15%
Computer Software Company
12%
Manufacturing Company
9%
Healthcare Company
6%
Computer Software Company
19%
Manufacturing Company
14%
Financial Services Firm
11%
University
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
What do you like most about VAST Data?
The solution is useful for machine learning and scientific applications, including computer simulations.
What is your experience regarding pricing and costs for VAST Data?
Price-wise, VAST Data is not the cheapest, not the most expensive one.
What needs improvement with VAST Data?
The read/write ratio is an area in the solution with some flaws and needs improvement.
 

Comparisons

 

Also Known As

Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
No data available
 

Overview

 

Sample Customers

Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Norwest Venture Partners, General Dynamics Information Technology, Ginkgo Bioworks
Find out what your peers are saying about Databricks, Microsoft, Alteryx and others in Data Science Platforms. Updated: May 2024.
787,061 professionals have used our research since 2012.