Compare Databricks vs. Dataiku Data Science Studio

Databricks is ranked 5th in Data Science Platforms with 6 reviews while Dataiku Data Science Studio is ranked 12th in Data Science Platforms with 4 reviews. Databricks is rated 8.6, while Dataiku Data Science Studio is rated 7.6. The top reviewer of Databricks writes "Good build-in optimization, easy to use with a good user interface". On the other hand, the top reviewer of Dataiku Data Science Studio writes "User interface is colorful, beautiful, and well-designed but sometimes the solution can be slow". Databricks is most compared with Amazon SageMaker, Microsoft Azure Machine Learning Studio and Cloudera Data Science Workbench, whereas Dataiku Data Science Studio is most compared with Alteryx, KNIME and Databricks. See our Databricks vs. Dataiku Data Science Studio report.
Cancel
You must select at least 2 products to compare!
Most Helpful Review
Find out what your peers are saying about Databricks vs. Dataiku Data Science Studio and other solutions. Updated: January 2020.
391,045 professionals have used our research since 2012.
Quotes From Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros
I work in the data science field and I found Databricks to be very useful.The most valuable aspect of the solution is its notebook. It's quite convenient to use, both terms of the research and the development and also the final deployment, I can just declare the spark jobs by the load tables. It's quite convenient.Databricks is based on a Spark cluster and it is fast. Performance-wise, it is great.Automation with Databricks is very easy when using the API.We are completely satisfied with the ease of connecting to different sources of data or pocket files in the searchThe built-in optimization recommendations halved the speed of queries and allowed us to reach decision points and deliver insights very quickly.

Read more »

I like the interface, which is probably my favorite part of the solution. It is really user-friendly for an IT person.The most valuable feature is the set of visual data preparation tools.The most valuable feature of this solution is that it is one tool that can do everything, and you have the ability to very easily push your design to prediction.Extremely easy to use with its GUI-based functionality and large compatibility with various data sources. Also, maintenance processes are much more automated than ever, with fewer errors.Cloud-based process run helps in not keeping the systems on while processes are running.

Read more »

Cons
It would be very helpful if Databricks could integrate with platforms in addition to Azure.The solution could be improved by integrating it with data packets. Right now, the load tables provide a function, like team collaboration. Still, it's unclear as to if there's a function to create different branches and/or more branches. Our team had used data packets before, however, I feel it's difficult to integrate the current with the previous data packets.It should have more compatible and more advanced visualization and machine learning libraries.Some of the error messages that we receive are too vague, saying things like "unknown exception", and these should be improved to make it easier for developers to debug problems.The integration features could be more interesting, more involved.The product could be improved by offering an expansion of their visualization capabilities, which currently assists in development in their notebook environment.

Read more »

I find that it is a little slow during use. It takes more time than I would expect for operations to complete.In the next release of this solution, I would like to see deep learning better integrated into the tool and not simply an extension or plugin.The ability to have charts right from the explorer would be an improvement.Server up-time needs to be improved. Also, query engines like Spark and Hive need to be more stable.Although known for Big Data, the processing time to process 1.8 billion records was terribly slow (five days).There were stability issues: 1) SQL operations, such as partitioning, had bugs and showed wrong results. 2) Due to server downtime, scheduled processes used to fail. 3) Access to project folders was compromised (privacy issue) with wrong people getting access to confidential project folders.

Read more »

Pricing and Cost Advice
I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly.Whenever we want to find the actual costing, we have to send an email to Databricks, so having the information available on the internet would be helpful.Licensing on site I would counsel against, as on-site hardware issues tend to really delay and slow down delivery.

Read more »

The annual licensing fees are approximately €20 ($22 USD) per key for the basic version and €40 ($44 USD) per key for the version with everything.

Read more »

report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
391,045 professionals have used our research since 2012.
Ranking
5th
Views
8,640
Comparisons
7,868
Reviews
5
Average Words per Review
567
Avg. Rating
8.6
12th
Views
7,866
Comparisons
5,723
Reviews
4
Average Words per Review
493
Avg. Rating
7.5
Top Comparisons
Compared 19% of the time.
Compared 14% of the time.
Also Known As
Databricks Unified Analytics, Databricks Unified Analytics PlatformDataiku DSS
Learn
Databricks
Dataiku
Overview

Databricks creates a Unified Analytics Platform that accelerates innovation by unifying data science, engineering, and business. It utilizes Apache Spark to help clients with cloud-based big data processing. It puts Spark on “autopilot” to significantly reduce operational complexity and management cost. The Databricks I/O module (DBIO) improves the read and write performance of Apache Spark in the cloud. An increase in productivity is ensured through Databricks’ collaborative workplace.

Dataiku DSS is the collaborative data science software platform for teams of data scientists, data analysts, and engineers to explore, prototype, build, and deliver their own data products more efficiently.

Offer
Learn more about Databricks
Learn more about Dataiku Data Science Studio
Sample Customers
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, YeswareBGL BNP Paribas, Dentsu Aegis, Link Mobility Group, AramisAuto
Top Industries
VISITORS READING REVIEWS
Software R&D Company40%
Media Company9%
Comms Service Provider7%
Government7%
VISITORS READING REVIEWS
Software R&D Company30%
Financial Services Firm16%
Comms Service Provider7%
Manufacturing Company5%
Find out what your peers are saying about Databricks vs. Dataiku Data Science Studio and other solutions. Updated: January 2020.
391,045 professionals have used our research since 2012.
We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.