Amazon SageMaker vs Databricks comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
10,980 views|8,670 comparisons
84% willing to recommend
Databricks Logo
27,412 views|17,316 comparisons
96% willing to recommend
Comparison Buyer's Guide
Executive Summary
Updated on Mar 6, 2024

We compared Databricks and Amazon SageMaker based on our user's reviews in several parameters.

Databricks offers seamless integration with various data sources, advanced analytics capabilities, and efficient customer service. Users appreciate the collaborative features and positive ROI. On the other hand, Amazon SageMaker is praised for its ease of use, comprehensive ML capabilities, and robust monitoring tools. Users find the pricing transparent and support team responsive.

Features: Databricks is known for its seamless integration with various data sources and platforms, collaborative capabilities, advanced analytics, and machine learning capabilities. On the other hand, Amazon SageMaker offers ease of use, comprehensive machine learning capabilities, seamless integration with other AWS services, customizable workflows, efficient model training and deployment, automated data labeling, and robust monitoring and troubleshooting tools.

Pricing and ROI: Databricks users have reported positive feedback on pricing, setup cost, and licensing. The setup cost is straightforward and hassle-free, while the license terms offer flexibility. Similarly, Amazon SageMaker users find the pricing reasonable, setup cost hassle-free, and licensing process clear and transparent., Users have reported positive outcomes and returns on investment with Databricks, appreciating its impact on efficiency, productivity, and data analysis capabilities. Similarly, Amazon SageMaker delivers positive ROI, providing value and benefits for businesses.

Room for Improvement: Databricks has room for improvement in aspects such as data visualization, monitoring and debugging tools, integration with external data sources and services, documentation and tutorials, and pricing flexibility. In comparison, users have identified areas for enhancement in Amazon SageMaker.

Deployment and customer support: Based on user reviews, there are varying durations required for deploying, setting up, and implementing a new tech solution on both Databricks and Amazon SageMaker. While some users mentioned spending three months on deployment and a week on setup for both products, it is important to evaluate the context to determine if these terms refer to the same period or should be considered separately., Customers have reported positive experiences with both Databricks and Amazon SageMaker customer service. Databricks is praised for its efficiency and proactive approach, while SageMaker is commended for its attentiveness and commitment to customer needs.

The summary above is based on 56 interviews we conducted recently with Databricks and Amazon SageMaker users. To access the review's full transcripts, download our report.

To learn more, read our detailed Amazon SageMaker vs. Databricks Report (Updated: May 2024).
772,679 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"We've had no problems with SageMaker's stability.""The solution's ability to improve work at my organization stems from the ensemble model and a combination of various models it provides.""We were able to use the product to automate processes.""The tool has made client management easier where patients need to upload their health records and we can use the tool to understand details on treatment date, amount, etc.""The Autopilot feature is really good because it's helpful for people who don't have much experience with coding or data pipelines. When we suggest SageMaker to clients, they don't have to go through all the steps manually. They can leverage Autopilot to choose variables, run experiments, and monitor costs. The results are also pretty accurate.""The deployment is very good, where you only need to press a few buttons.""The most valuable feature of Amazon SageMaker is its integration. For example, AWS Lambda. Additionally, we can write Python code.""The superb thing that SageMaker brings is that it wraps everything well. It's got the deployment, the whole framework."

More Amazon SageMaker Pros →

"Databricks is a unified solution that we can use for streaming. It is supporting open source languages, which are cloud-agnostic. When I do database coding if any other tool has a similar language pack to Excel or SQL, I can use the same knowledge, limiting the need to learn new things. It supports a lot of Python libraries where I can use some very easily.""Its lightweight and fast processing are valuable.""The fast data loading process and data storage capabilities are great.""Databricks has helped us have a good presence in data.""Databricks provides a consistent interface for data engineers to work with data in a consistent language on a single integrated platform for ingesting, processing, and serving data to the end user.""One of the features provides nice interactive clusters, or compute instances that you don't really need to manage often.""Easy to use and requires minimal coding and customizations.""The integration with Python and the notebooks really helps."

More Databricks Pros →

Cons
"In my opinion, one improvement for Amazon SageMaker would be to offer serverless GPUs. Currently, we incur costs on an hourly basis. It would be beneficial if the tool could provide pay-as-you-go pricing based on endpoints.""The payment and monitoring metrics are a bit confusing not only for Amazon SageMaker but also for the range of other products that fall under AWS, especially for a new user of the product.""The pricing of the solution is an issue...In SageMaker, monitoring could be improved by supporting more data types other than JSON and CSV.""Creating notebook instances for multiple users is pretty expensive in Amazon SageMaker.""Lacking in some machine learning pipelines.""SageMaker would be improved with the addition of reporting services.""The training modules could be enhanced. We had to take in-person training to fully understand SageMaker, and while the trainers were great, I think more comprehensive online modules would be helpful.""Amazon SageMaker could improve in the area of hyperparameter tuning by offering more automated suggestions and tips during the tuning process."

More Amazon SageMaker Cons →

"Databricks can improve by making the documentation better.""Doesn't provide a lot of credits or trial options.""The query plan is not easy with Databrick's job level. If I want to tune any of the code, it is not easily available in the blogs as well.""Databricks may not be as easy to use as other tools, but if you simplify a tool too much, it won't have the flexibility to go in-depth. Databricks is completely in the programmer's hands. I prefer flexibility rather than simplicity.""I have had some issues with some of the Spark clusters running on Databricks, where the Spark runtime and clusters go up and down, which is an area for improvement.""The product could be improved by offering an expansion of their visualization capabilities, which currently assists in development in their notebook environment.""The stability of the clusters or the instances of Databricks would be better if it was a much more stable environment. We've had issues with crashes.""The tool should improve its integration with other products."

More Databricks Cons →

Pricing and Cost Advice
  • "The pricing is complicated as it is based on what kind of machines you are using, the type of storage, and the kind of computation."
  • "The support costs are 10% of the Amazon fees and it comes by default."
  • "SageMaker is worth the money for our use case."
  • "Databricks solution is less costly than Amazon SageMaker."
  • "I would rate the solution's price a ten out of ten since it is very high."
  • "There is no license required for the solution since you can use it on demand."
  • "I rate the pricing a five on a scale of one to ten, where one is the lowest price, and ten is the highest price. The solution is priced reasonably. There is no additional cost to be paid in excess of the standard licensing fees."
  • "You don't pay for Sagemaker. You only pay for the compute instances in your storage."
  • More Amazon SageMaker Pricing and Cost Advice →

  • "Whenever we want to find the actual costing, we have to send an email to Databricks, so having the information available on the internet would be helpful."
  • "I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly."
  • "Licensing on site I would counsel against, as on-site hardware issues tend to really delay and slow down delivery."
  • "We find Databricks to be very expensive, although this improved when we found out how to shut it down at night."
  • "The pricing depends on the usage itself."
  • "I am based in South Africa, where it is expensive adapting to the cloud, and then there is the price for the tool itself."
  • "The price is okay. It's competitive."
  • "Databricks uses a price-per-use model, where you can use as much compute as you need."
  • More Databricks Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
    772,679 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:We researched AWS SageMaker, but in the end, we chose Databricks Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It… more »
    Top Answer:We've had experience with unique ML projects using SageMaker. For example, we're developing a platform similar to ChatGPT that requires models. We utilize Amazon SageMaker to create endpoints for… more »
    Top Answer:Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with… more »
    Top Answer:Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their… more »
    Top Answer:Databricks is hosted on the cloud. It is very easy to collaborate with other team members who are working on it. It is production-ready code, and scheduling the jobs is easy.
    Ranking
    5th
    Views
    10,980
    Comparisons
    8,670
    Reviews
    12
    Average Words per Review
    538
    Rating
    7.3
    1st
    Views
    27,412
    Comparisons
    17,316
    Reviews
    45
    Average Words per Review
    441
    Rating
    8.2
    Comparisons
    Also Known As
    AWS SageMaker, SageMaker
    Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
    Learn More
    Overview

    Amazon SageMaker is a fully-managed platform that enables developers and data scientists to quickly and easily build, train, and deploy machine learning models at any scale. Amazon SageMaker removes all the barriers that typically slow down developers who want to use machine learning.

    Databricks is an industry-leading data analytics platform which is a one-stop product for all data requirements. Databricks is made by the creators of Apache Spark, Delta Lake, ML Flow, and Koalas. It builds on these technologies to deliver a true lakehouse data architecture, making it a robust platform that is reliable, scalable, and fast. Databricks speeds up innovations by synthesizing storage, engineering, business operations, security, and data science.

    Databricks is integrated with Microsoft Azure, Amazon Web Services, and Google Cloud Platform. This enables users to easily manage a colossal amount of data and to continuously train and deploy machine learning models for AI applications. The platform handles all analytic deployments, ranging from ETL to models training and deployment.

    Databricks deciphers the complexities of processing data to empower data scientists, engineers, and analysts with a simple collaborative environment to run interactive and scheduled data analysis workloads. The program takes advantage of AI’s cost-effectivity, flexibility, and cloud storage.

    Databricks Key Features

    Some of Databricks key features include:

    • Cloud-native: Works well on any prominent cloud provider.
    • Data storage: Stores a broad range of data, including structured, unstructured, and streaming.
    • Self-governance: Built-in governance and security controls.
    • Flexibility: Flexible for small-scale jobs as well as running large-scale jobs like Big Data processing because it’s built from Spark and is specifically optimized for Cloud environments.
    • Data science tools: Production-ready data tooling, from engineering to BI, AI, and ML.
    • Familiar languages: While Databricks is Spark-based, it allows commonly used programming languages like R, SQL, Scala, and Python to be used.
    • Team sharing workspaces: Creates an environment that provides interactive workspaces for collaboration, which allow multiple members to collaborate for data model creation, machine learning, and data extraction.
    • Data source: Performs limitless Big Data analytics by connecting to Cloud providers AWS, Azure, and Google, as well as on-premises SQL servers, JSON and CSV.

    Reviews from Real Users

    Databricks stands out from its competitors for several reasons. Two striking features are its collaborative ability and its ability to streamline multiple programming languages.

    PeerSpot users take note of the advantages of these features. A Chief Research Officer in consumer goods writes, “We work with multiple people on notebooks and it enables us to work collaboratively in an easy way without having to worry about the infrastructure. I think the solution is very intuitive, very easy to use. And that's what you pay for.”

    A business intelligence coordinator in construction notes, “The capacity of use of the different types of coding is valuable. Databricks also has good performance because it is running in spark extra storage, meaning the performance and the capacity use different kinds of codes.”

    An Associate Manager who works in consultancy mentions, “The technology that allows us to write scripts within the solution is extremely beneficial. If I was, for example, able to script in SQL, R, Scala, Apache Spark, or Python, I would be able to use my knowledge to make a script in this solution. It is very user-friendly and you can also process the records and validation point of view. The ability to migrate from one environment to another is useful.”

    Sample Customers
    DigitalGlobe, Thomson Reuters Center for AI and Cognitive Computing, Hotels.com, GE Healthcare, Tinder, Intuit
    Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
    Top Industries
    REVIEWERS
    Computer Software Company22%
    Manufacturing Company11%
    Logistics Company11%
    Transportation Company11%
    VISITORS READING REVIEWS
    Financial Services Firm17%
    Educational Organization13%
    Computer Software Company11%
    Manufacturing Company8%
    REVIEWERS
    Computer Software Company25%
    Financial Services Firm16%
    Retailer9%
    Manufacturing Company9%
    VISITORS READING REVIEWS
    Financial Services Firm15%
    Computer Software Company12%
    Manufacturing Company9%
    Healthcare Company6%
    Company Size
    REVIEWERS
    Small Business15%
    Midsize Enterprise40%
    Large Enterprise45%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise18%
    Large Enterprise68%
    REVIEWERS
    Small Business27%
    Midsize Enterprise14%
    Large Enterprise59%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise11%
    Large Enterprise71%
    Buyer's Guide
    Amazon SageMaker vs. Databricks
    May 2024
    Find out what your peers are saying about Amazon SageMaker vs. Databricks and other solutions. Updated: May 2024.
    772,679 professionals have used our research since 2012.

    Amazon SageMaker is ranked 5th in Data Science Platforms with 19 reviews while Databricks is ranked 1st in Data Science Platforms with 78 reviews. Amazon SageMaker is rated 7.4, while Databricks is rated 8.2. The top reviewer of Amazon SageMaker writes "Easy to use and manage, but the documentation does not have a lot of information". On the other hand, the top reviewer of Databricks writes "A nice interface with good features for turning off clusters to save on computing". Amazon SageMaker is most compared with Azure OpenAI, Google Vertex AI, Domino Data Science Platform, Dataiku and DataRobot, whereas Databricks is most compared with Informatica PowerCenter, Dataiku, Dremio, Microsoft Azure Machine Learning Studio and Azure Stream Analytics. See our Amazon SageMaker vs. Databricks report.

    See our list of best Data Science Platforms vendors.

    We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.