Databricks Reviews

Filter by:
Industry
Loading...
Filter Unavailable
Company Size
Loading...
Filter Unavailable
Job Level
Loading...
Filter Unavailable
Rating
Loading...
Filter Unavailable
Considered
Loading...
Filter Unavailable
Order by:
Loading...
  • Date
  • Highest Rating
  • Lowest Rating
  • Review Length
Search:
Showingreviews based on the current filters. Reset all filters
Real User
Data Scientist at a energy/utilities company with 10,001+ employees
Feb 14 2020

What is most valuable?

Of the available feature set, I like the Imageflow feature a lot. It is very interesting. It gives me clarity on the execution of a process. I can draw the complete flow from start to finish in the exact way that I want it to execute. It is more visual and it is also easier for the people in… more»

What needs improvement?

I think the automatic categorization of variables needs to be improved. The current functionality is not always efficiently identifying the features of the data that is collected. Probably that is the only thing I can think of. Apart from that, I have not explored the product enough yet to go into… more»

What other advice do I have?

On a scale from one to ten where one is the worst and ten is the best, I would rate Databricks overall as around a 7 or 7.5. If we had more experience with it and could be sure we had a solid understanding of what it could do and the reliability, I might recommend it with a better score. I do not… more»

Which other solutions did I evaluate?

I did have some experience with SageMaker before looking at Databricks, but apart from we have not been looking into any of the other solutions that are available. We were just exploring a few of the different solutions that the members of the team already have experience with. Most of the team came… more»
RameshCh
Real User
Sr. BigData Architect at ITC Infotech
Jul 01 2020

What is most valuable?

The elasticity of the solution is excellent. The storage, etc., can be scaled up quite easily when we need it to. It's easy to increase performance as required. The solution runs on Spark very well.

What needs improvement?

Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively. They're currently coming out with a new… more»

What's my experience with pricing, setup cost, and licensing?

The solution is expensive. It's not like a lot of competitors, which are open-source.

Which solution did I use previously and why did I switch?

We work with multiple clients and this solution is just one of the examples of products we work with. We use several others as well, depending on the client. It's all wrappers between the same underlying systems. For example, Spark. It's… more»

What other advice do I have?

There isn't really a version, per se. It's a popular service. I'd recommend the solution. The solution is cloud-agnostic right now, so it really can go into any cloud. It's the users who will be leveraging installed environments that can… more»
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: April 2020.
431,790 professionals have used our research since 2012.
Tristan Bergh
Real User
Data Scientist at iOCO
Mar 16 2020

What is most valuable?

Immense ease in running very large scale analytics, with a convenient and slick UI. This saved us from having to tweak, tune, dive into deeper abstractions, get involved… more»

How has it helped my organization?

Databricks took care of all the underlying cluster management seamlessly. We could configure our clusters to run and deliver results without any delays due to hardware… more»

What needs improvement?

The product could be improved by offering an expansion of their visualization capabilities, which currently assists in development in their notebook environment. Perhaps a… more»

What's my experience with pricing, setup cost, and licensing?

Set up a cluster in your cloud of choice, but Databricks' service might also be very competitive as their pricing units will be built in. Licensing on site I would counsel… more»

Which solution did I use previously and why did I switch?

Previously I used Hive and Livy in Zeppelin on an in-house Hadoop installation. The queries constantly threw exceptions and timeouts and the necessary configuration… more»

What other advice do I have?

By investing in people skilled in data querying, Python coding, and even basic Data Science, a Databricks setup will reward the business. Once the Databricks data flows… more»

Which other solutions did I evaluate?

I evaluated Hortonworks, Livy, and Zeppelin. These were unsuitable due to the unavailability of sufficiently skilled personnel.
Abhijith Dattatreya
Consultant
Business Intelligence and Analytics Consultant at a tech services company with 201-500 employees
Dec 12 2019

What is most valuable?

The most valuable feature is the ability to switch loads between multiple clusters. Automation with Databricks is very easy when using the API. The ability to write code and SQL in the same interface is useful. It is easy to connect notebooks to a cluster. There are a large number of inbuilt functions that help to make things easier.

What needs improvement?

Some of the error messages that we receive are too vague, saying things like "unknown exception", and these should be improved to make it easier for developers to debug problems. As it is now, we have to go into the driver logs to identify the error messages properly. There is not much information about Databricks available online, such as cost. Whenever we want to find the actual costing, we have… more»

What other advice do I have?

My advice for developers who are interested in working with this solution is to first go through the Spark architecture. I would rate this solution a nine out of ten.
Alexandre Akrour
Real User
CEO at Inosense
Dec 05 2019

What is most valuable?

Valuable features would have to include the Notebook for piping some models and the future of executing the notebooks in parallel, in batches, which is also something that we use. And we use the Notebook on Spark with Python.

What needs improvement?

Improvements could include the pricing, the product is a little expensive, although I think comparable to other similar options. The integration features could be more interesting, more involved. For example, we use the Database Notebook, which is not as great as Jupyter Notebook, for providing a… more»

Which solution did I use previously and why did I switch?

We previously used the earlier version of Azure Machine Learning services and we decided to move over because over time it became more difficult to deploy. That was two years ago, but now with the new version, it's much easier to deploy Machine Learning.

What other advice do I have?

The product has improved and I'm sure this will continue in the next versions. We are completely satisfied with it, the ease of connecting to different sources of data or pocket files in the search. I think it could be very interesting for users looking for a framework to use Databricks. I would… more»
Yuval Klein
Real User
Pre-sale Leader, Big Data Enterprise Solutions at Ness Technologies
Apr 14 2020

What is most valuable?

The most valuable feature is the ability to use SQL directly with Databricks. That is the most relevant thing for my current project. After deployment, it is easy to load files and query data.

What needs improvement?

I have seen better user interfaces, so that is something that can be improved. It was quite hard to deploy.

What's my experience with pricing, setup cost, and licensing?

We find Databricks to be very expensive, although this improved when we found out how to shut it down at night.

Which solution did I use previously and why did I switch?

I have used Snowflake and one of the differences is that Snowflake is much easier to deploy.

What other advice do I have?

Our client is a bank and some of the information can be shared outside of the organization, whereas some of the data is confidential and private. Using a purely on-premises solution would have made it more difficult to share information… more»
ShrikanthHebbar
Consultant
Data Science Consultant at Syniti
Jan 08 2020

What is most valuable?

I found that PySpark is the most useful tool. It uses in-memory calculation and when you want to run a model it does it very quickly. We used to use Python and when we migrated to PySpark the performance was much better.

What needs improvement?

It would be very helpful if Databricks could integrate with platforms in addition to Azure. Having an open-source version or having the option to get a trial version of Databricks would be very helpful. It would be very useful for beginners if there were tutorials and examples on how to write code… more»

Which solution did I use previously and why did I switch?

Before using Databricks, we were running our own cluster with a web server that executed our Python queries.

What other advice do I have?

I work in the data science field and I found Databricks to be very useful. If I want to run any models then I can code them in PySpark. If you are coming from a Python background then you can write code in PySpark and it runs quickly. This is a good solution in terms of performance. I would rate… more»
Real User
Engineer at a tech services company with 10,001+ employees
Feb 04 2020

What is most valuable?

The time travel feature is the solution's most valuable aspect.

What needs improvement?

The management of the solution needs to be modernized. Managing the radius data is hard. The solution requires modern scoring. There's not a good way of knowing how the models are performing from a data science perspective. The solution… more»

What's my experience with pricing, setup cost, and licensing?

I'm not sure what the licensing costs are on the solution.

What other advice do I have?

We're partners with Databricks. We're using the latest version of the solution, but I can't recall what version number we are on. I'd advise others considering the solution to look at usage. They shouldn't adopt the solution blindly. How… more»

Which other solutions did I evaluate?

We did evaluate Amazon PageMaker before ultimately choosing Databricks. It's the only other solution we evaluated at the time.
See 4 More Databricks Reviews

Articles

User Assessments By Topic About Databricks

Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: April 2020.
431,790 professionals have used our research since 2012.

Databricks Questions

What is Databricks?

Databricks creates a Unified Analytics Platform that accelerates innovation by unifying data science, engineering, and business. It utilizes Apache Spark to help clients with cloud-based big data processing. It puts Spark on “autopilot” to significantly reduce operational complexity and management cost. The Databricks I/O module (DBIO) improves the read and write performance of Apache Spark in the cloud. An increase in productivity is ensured through Databricks’ collaborative workplace.

Also known as
Databricks Unified Analytics, Databricks Unified Analytics Platform
Databricks customers

Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware

BUYER'S GUIDE
Download our free Databricks Report and get advice and tips from experienced pros sharing their opinions.