Databricks Initial Setup
JH
JurajHrapko
Solution Architect at a insurance company with 10,001+ employees
The setup is not easy but also is not too complicated. An infrastructure needs to be set up first. We use Azure storage or SQL S3 and create private end points.
This is maybe a little more complex or a bit different than other databases in the cloud. For a traditional setup, you need to also think about file systems and disks. Here, you just transform it into the storage and private end point.
The first setup might be a bit of a struggle until you learn and understand what is necessary.
View full review »SS
Solleti Sudheer Kumar
Business Architect at YASH Technologies
The setup is of average difficulty but tougher than Snowflake.
Deployment is easy and run time is quick.
View full review »The initial setup is easy for me because I access the solution on a web browser.
View full review »Buyer's Guide
Databricks
April 2024
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
767,995 professionals have used our research since 2012.
Setup was complex. There were some issues with setting up a database and installing the third party component on top of services. I would rate the setup 3 out of 5.
View full review »When we have administration experience, the solution is not difficult to deploy. Technically, however, it's difficult because governance is more complex. For example, I have two warehouses on Databricks, which are clusters in this workspace, and we have to switch from workspace to workspace to have all this information. There is a system table that has all this, but I don't know if everyone can use these tables.
View full review »The initial setup phase of Databricks was good. You can spin up clusters and integrate those with DevOps as well. Databricks it's quite nice owing to its user-friendly UI, DPP, and workspaces.
The solution is deployed on the cloud.
The time taken for the deployment depends on the workload.
The implementation is not challenging because the solution integrates well with the platforms on which they are established, whether it's Azure, AWS, or GCP. The solution is not difficult to set up, but you'd probably need a technical user to operate it.
It's the same story with maintenance, where you'd need a technically proficient person with programming knowledge to maintain it.
View full review »The implementation is very simple to set up. That's why we choose it over many other tools.
Usually, we have two to five data engineers handling the maintenance and running of our solutions.
View full review »The initial setup was okay.
View full review »The initial setup was easy because the third-party team made the clusters for us.
View full review »AO
Alphonse Okossi
Lead Data Scientist at a manufacturing company with 10,001+ employees
The initial setup was straightforward.
View full review »The cloud-based deployment is simple.
If you use an on-premises deployment then there is more to do.
View full review »The initial setup was pretty complex and required three people.
View full review »The initial setup was relatively straightforward. I would rate it nine, on a scale from one to 10, with one being the easiest and 10 being the hardest.
There is no need to worry about the deployment as it can be done quickly. It is relatively automated. We used Terraform for auto-deployment, which happens in Azure. With Terraform, there are two options. As option one, you can deploy manually by creating services. For option two, you use Terraform and automate. Terraform is like infrastructure as a code where you can code the deployment part using it.
There were two or three persons involved in the deployment of this solution.
The initial setup for this solution is very simple.
View full review »RC
RameshCh
Sr. BigData Architect at ITC Infotech
The situation may have been a bit different for me than for many users or organizations. I've been in this industry for more than 15 or 17 years. I have a lot of experience. I also took the time to do some research and preparation for the setup. It was straightforward for me.
The deployment with Microsoft usually can be done in 20 minutes. However, it can take 40 to 45 minutes to complete. An organization only requires one person to upload the data and have complete access to the account.
View full review »SA
reviewer2041779
Principal at a computer software company with 5,001-10,000 employees
I don't have experience setting up Databricks because that's generally taken care of by the IT, data, or software engineering team before the data science team comes in and starts leveraging the platform. I have yet to experience setting up the Databricks environment personally. However, I have had experience setting up clusters, which was pretty straightforward. Still, in the overall environment of an enterprise-wide system, I have yet to gain experience setting Databricks up.
View full review »The initial setup is not that difficult. I rate the ease of setup a seven out of ten. The solution is cloud-based. We use native services like Data Factory for orchestration. Sometimes, the customers require us to use Amazon as the cloud provider instead of Azure.
View full review »AB
Alexis Bustamante
STI Data Leader at grupo gtd
The implementation is quite easy. It's not complex or difficult. The first time, I did it using a tutorial which was quite helpful. Later, I took a course. I know it quite well.
The deployment only takes a few days.
You only need to deploy or maintain the solution.
View full review »Setting up Databricks is easy. I set it up at my previous company. That was on Azure as well, but they utilized a third-party team with expertise in Databricks to ensure everything was optimized.
View full review »The transition to Databricks was smooth.
View full review »RM
RobertoMessora
Head of Business Integration and Architecture at Jakala
The initial setup is very easy. It is a managed solution inside Azure so you just need to search for Databricks. There are a couple of pages to follow in the setup wizard and Databricks is up and running.
View full review »The initial setup is easy.
View full review »GR
reviewer1510053
Head of Referential and Big Data at a financial services firm with 5,001-10,000 employees
The initial setup was fairly okay. It takes about two minutes to deploy this solution. It's all code, so we click a button, and then it's done.
On a scale from one to five, I would give the initial setup a four.
View full review »JH
Joao Henriques
Head of Credit Risk and Data at Cegid Invoice and Financing
Setting up Databricks is a bit complex, and the initial deployment took a few days—closer to a week. Of course, not everyone is working full-time on this. There are intervals when people are doing other stuff.
View full review »RC
reviewer2058633
Data Engineering Manager at a pharma/biotech company with 10,001+ employees
The initial setup was easy to complete and not complex. It may initially be challenging for a new user, but it improves over time. The CICD pipeline works well with the Microsoft Azure platform because the continuous integration, development and deployment come with the Git integration. It makes it easier for Databricks and the CICD. The deployment should be improved from the perspective of auto ML functionality, so it doesn't have intensive automation learning capability.
We don't use Databricks directly because we work on a data science project. It requires an auto ML and inbuilt machine learning capability. We found capabilities like the large language model using NLP and other deep learning models that are not that intensive. It is meant for data engineering purposes rather than data science purposes. It'll be great if Databricks could be intensive for data science.
We used a third-party, Dataiku platform for the deployment, where we connected to Databricks and completed the ML ops. We required about three people for deployment, and it is easy to maintain the solution.
RX
reviewer1702092
Machine Learning Engineer at a mining and metals company with 10,001+ employees
The initial setup is easy. However, I do not know much about the implementation because the company does it.
View full review »The setup is quite easy, and Databricks has also partnered with Microsoft, so we get this service on Microsoft Azure.
Deploying Databricks on the cloud is straightforward. It's not like an on-premise solution, where you must create a cluster and all those other prerequisites for big data.
I don't think it's challenging to maintain, but you need an expert programmer because Databricks isn't GUI-based. With GUI-based tools, building ETLs is drag-and-drop. Databricks entirely relies on coding, so you need skilled programmers to building your code, ETLs, etc.
View full review »The solution is on the cloud and therefore there isn't really an installation process that you need to go through. You only really need to configure the clusters.
Within the clusters, you configure according to how many platforms you need, or if you want to, you can build a cluster for artificial intelligence. You just configure it as required.
View full review »MA
Mahesh Alam
Senior Data Engineer at TCS
The initial setup of Databricks is not straightforward. You need to create VLANs, VPNs, and networks. We are two ways of deployment, we are having the legacy PowerShell for the deployment and the template method to deploy the Databricks code to higher levels.
We have not integrated Databricks directly into the DevOps architecture. We are downloading the notebooks manually and we are uploading them.
View full review »The initial setup is not very easy, but it's medium in complexity.
View full review »The initial setup of Databricks is simple. I did not experience any challenges. The time it takes for the deployment is approximately four hours.
I rate the initial setup of Databricks.
View full review »The initial setup of Databricks is more complex. I would rate it a four out of five on the complexity of the setup. It took two days to deploy the solution.
View full review »The initial setup of the solution is straightforward, once you understand the UI it is easy to implement. I would rate Databricks a four out of five for ease of setup.
One migration project took two to three months, including writing all the code and implementing end-to-end pipelines.
We are planning to deploy the solution in stages over the next 15 months to completely implement MLOps for our organization.
View full review »JK
Jayaprakash Kaippada
Lead Architect at Birlasoft IndiaLtd.
The initial setup is pretty simple and requires minimal configuration compared to other technology.
View full review »I am not a data engineer because I just started data science at the company, but it was straightforward and clear for the architect to set up. He provided me with that idea because he realized it would take time if we had use cases. You can select and change the data or add some modules or products. You have all the technology to do so.
View full review »AK
Allan Kirszberg
Coordenador Financeiro at Icatu
The initial setup is difficult.
While I don't know exactly how long the deployment took, I do know that it lasted longer than the one day needed for Alteryx.
Setup and Support are single-click.
View full review »It was relatively simple, we didn't face any challenges. Deployment takes around two days.
PD
Premasish Dan
Enterprise Data Architect at a financial services firm with 51-200 employees
The initial setup was not very complex. We deploy the solution manually and the time required depends on the complexity of the business logic. I rate it an eight out of ten.
View full review »KG
reviewer1488372
Associate Manager at a consultancy with 501-1,000 employees
There is no installation required. It is easy to use, for example, in Azure it is available, you subscribe, and use it.
View full review »HA
reviewer1901577
Cloud Administrator at a retailer with 5,001-10,000 employees
The solution is very easy to setup. I would rate its setup a ten out of ten.
View full review »AM
Ariful Mondal
Global Data Architecture and Data Science Director at FH
There is no installation required.
View full review »MM
reviewer1558740
Lead Data Architect at a government with 1,001-5,000 employees
It was pretty easy to set up. At least, that is my understanding. I'm not the data engineer though. I don't actually do installs and configurations. I explore features and build them in my architecture designs.
YK
Yuval Klein
Pre-sale Leader, Big Data Enterprise Solutions at Ness Technologies
The first deployment is difficult. It is not straightforward and you have to think about a lot of stuff. It is not really like a SaaS deployment and there are a lot of steps that you have to take.
OB
reviewer1438992
Cloud & Infra Security, Group Manager at a tech vendor with 10,001+ employees
The initial setup depends on the readiness of the team working with Databricks. There is no one template saying that it's easy, and it isn't easy. It can be complex to set up if you don't have a really good plan.
You can get in this environment at least for a test. You can do it in the lab, follow it step by step, and that'll take about an hour. The difficulty depends on the business requirements.
If it's for training purposes, you can do it in about half an hour, and you're good to go. If you need it to support a business, it will be much more rigorous because multiple divisions would be interested in running their own environment, working with their data.
View full review »The setup is straightforward, I did it myself.
View full review »VP
reviewer1276782
Data Scientist at a energy/utilities company with 10,001+ employees
The initial setup was not complex at all. The documentation is good. It is clear and not very difficult to understand. Because the documentation is good, the installation is fine.
We did the implementation by ourselves — within our team and with the help of the documentation. But I would not say that we have already deployed the model yet. This is an ongoing process, as there are certain inputs that changed over time.
So we have not implemented the product completely, but we have gotten to advance with the product and our understanding of it. It is good, but our company is still trying to get much better data from it. At this point, it is like the data is just junk and more junk. So we are now working toward that goal of improving the result. Whenever the data result gets better, we'll try to implement the workflow to see how it performs. I would say it will probably take two to three months more before we actually get good data.
ZH
reviewer2144922
Data engineer
The setup was straightforward. It also depends on the projects.
View full review »The initial setup of Databricks was not straightforward. We had to do trial and error and we learned as we went along.
I rate the initial setup of Databricks a four out of five.
View full review »Setup isn't difficult. We used about 15 people for deployment and maintenance. We have data scientists and statisticians using this solution and doing different analyses.
RB
RaphaelBilecki Freitas
Business Intelligence Coordinator Latam at a construction company with 5,001-10,000 employees
The initial setup of Databricks is straightforward and simple. It is not complex because they provide a lot of documentation. The deployment was fast, it took less than three days with five people assigned to the task.
View full review »AP
reviewer1393860
Chief Research Officer at a consumer goods company with 1,001-5,000 employees
The initial set was very straightforward because it's also in our Azure cloud so it was quite easy to set up and configure. Very intuitive.
View full review »PG
PankajGaikwad
Data Science Developer at a tech services company with 501-1,000 employees
It is not difficult to deploy this solution because it is well documented. We followed the normal steps that included all of the APIs.
View full review »LV
reviewer1526169
Advanced Analytics Lead at a pharma/biotech company with 1,001-5,000 employees
The installation is straightforward, and it took approximately one hour.
View full review »AD
Abhijith Dattatreya
Business Intelligence and Analytics Consultant at a tech services company with 201-500 employees
I found the initial setup easy because I had previously worked on Spark.
If somebody goes through the training, which is available on the website, then it should be straightforward. I don't think that it is very hard.
When it comes to developing things based on use cases, it can take between three days and two weeks, plus two to three days for testing and deploying it. I would say that for an entire use case, it will take a maximum of three weeks.
View full review »BG
reviewer1269582
Data Architect at a tech services company with 201-500 employees
The initial setup was not very complex. We had it up and running in no time; it's a quick process.
View full review »NH
reviewer2058678
Director of Data (Engineering & Science) at a tech services company with 11-50 employees
The initial setup for the solution is a bit complex.
View full review »RP
reviewer1888527
Big Data and Cloud Architect at a computer software company with 201-500 employees
The initial setup was easy.
View full review »SN
Sandesh Nagaraj
Head of Data & Analytics at a tech services company with 11-50 employees
The initial setup is easy. It's not difficult when you are used to Azure.
View full review »SV
reviewer1276107
Engineer at a tech services company with 10,001+ employees
The initial setup is straightforward. I wouldn't say that it's complex in any way.
Deployment times vary and really depend on multiple factors. It can take anywhere from a few weeks to a few months to deploy the solution. In our case, it took us about three months to fully deploy it.
It takes two to three people to deploy the solution.
View full review »SH
ShrikanthHebbar
Data Science Consultant at Syniti
The initial setup is straightforward. With respect to deployment, the development can be done within half an hour and we can use code and deploy from there.
View full review »DW
reviewer1235523
Machine Learning Engineer at a tech vendor with 51-200 employees
The initial setup is very straightforward. We just use their job functions. To deploy as a spark job is quite straightforward.
In our use case, we also had some external databases to handle the deployment. For example, we only generated some prediction results. We saved the results into an external database. The solution takes time to deploy to the external database, but the spark job is quite easy.
View full review »SC
ShitanshuChandra
Chief Data Scientist at a tech services company with 11-50 employees
The installation was straightforward because it is on the cloud. The full deployment took approximately one week.
View full review »AA
reviewer1708788
Technical Architect at a tech services company with 10,001+ employees
The initial setup was a little complex because it was a new architecture for the customer, so there was nothing to compare it to in order to accelerate the project. This meant the deployment of the first project using Databricks took almost nine months and the second took almost a year.
View full review »PC
reviewer1270416
Vice President, Business Intelligence and Analytics at a tech services company with 10,001+ employees
The initial setup was straightforward.
View full review »Buyer's Guide
Databricks
April 2024
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
767,995 professionals have used our research since 2012.