Cloudera Data Science Workbench vs KNIME comparison

Cancel
You must select at least 2 products to compare!
Cloudera Logo
2,070 views|1,837 comparisons
66% willing to recommend
Knime Logo
10,966 views|7,554 comparisons
93% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Cloudera Data Science Workbench and KNIME based on real PeerSpot user reviews.

Find out what your peers are saying about Databricks, Microsoft, Alteryx and others in Data Science Platforms.
To learn more, read our detailed Data Science Platforms Report (Updated: April 2024).
770,292 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast.""The Cloudera Data Science Workbench is customizable and easy to use."

More Cloudera Data Science Workbench Pros →

"Key features include: very easy-to-use visual interface; Help functions and clear explanations of the functionalities and the used algorithms; Data Wrangling and data manipulation functionalities are certainly sufficient, as well as the looping possibilities which help you to automate parts of the analysis.""The product is open-source and therefore free to use.""It can handle an unlimited amount of data, which is the advantage of using Knime.""The most valuable feature is the data wrangling, which is what I mainly use it for.""Automation is most valuable. It allows me to automatically download information from different sources, and once I create a workflow, I can apply it anytime I want. So, there is efficiency at the same time.""It's a coding-less opportunity to use AI. This is the major value for me.""KNIME is easy to learn.""Stability is excellent. I would give it a nine out of ten."

More KNIME Pros →

Cons
"The tool's MLOps is not good. It's pricing also needs to improve.""Running this solution requires a minimum of 12GB to 16GB of RAM."

More Cloudera Data Science Workbench Cons →

"It needs more examples, use cases, and MOOC to learn, especially with respect to the algorithms and how to practically create a flow from end-to-end.""The predefined workflows could use a bit of improvement.""The ability to handle large amounts of data and performance in processing need to be improved.""The program is not fit for handling very large files or databases (greater than 1GB); it gets too slow and has a tendency to crash easily.""The overall user experience feels unpolished. In particular: Data field type conversion is a real hassle, and date fields are a hassle; documentation is pretty poor; user community is average at best.""The license is quite expensive for us.""The main issue with KNIME is that it sometimes uses too much CPU and RAM when working with large amounts of data.""KNIME needs to provide more documentation and training materials, including webinars or online seminars."

More KNIME Cons →

Pricing and Cost Advice
  • "It is free of cost. It is GNU licensed."
  • "KNIME desktop is free, which is great for analytics teams. Server is well priced, depending on how much support is required."
  • "KNIME is free as a stand-alone desktop-based platform but if you want to get a KNIME server then you can find the cost on their website."
  • "The price of KNIME is quite reasonable and the designer tool can be used free of charge."
  • "It's an open-source solution."
  • "The price for Knime is okay."
  • "At this time, I am using the free version of Knime."
  • "This is an open-source solution that is free to use."
  • More KNIME Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
    770,292 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to… more »
    Top Answer:The tool's MLOps is not good. It's pricing also needs to improve.
    Top Answer:We have different use cases. Our banking use case uses machine learning to identify customer life events and recommend the best-suited card products. These machine-learning models are deployed in our… more »
    Top Answer:Since KNIME is a no-code platform, it is easy to work with.
    Top Answer:We're using the free academic license just locally. I went for KNIME because they have a free academic license. And to be honest, I never bothered to check the prices.
    Top Answer:KNIME is not good at visualization. I would like to see NLQ (Natural language query) and automated visualizations added to KNIME.
    Ranking
    18th
    Views
    2,070
    Comparisons
    1,837
    Reviews
    1
    Average Words per Review
    353
    Rating
    6.0
    4th
    Views
    10,966
    Comparisons
    7,554
    Reviews
    21
    Average Words per Review
    501
    Rating
    7.9
    Comparisons
    Also Known As
    CDSW
    KNIME Analytics Platform
    Learn More
    Overview

    Cloudera Data Science Workbench (CDSW) makes secure, collaborative data science at scale a reality for the enterprise and accelerates the delivery of new data products. With CDSW, organizations can research and experiment faster, deploy models easily and with confidence, as well as rely on the wider Cloudera platform to reduce the risks and costs of data science projects. Access any data anywhere – from cloud object storage to data warehouses, CDSW provides connectivity not only to CDH but the systems your data science teams rely on for analysis.

    KNIME is an open-source analytics software used for creating data science that is built on a GUI based workflow, eliminating the need to know code. The solution has an inherent modular workflow approach that documents and stores the analysis process in the same order it was conceived and implemented, while ensuring that intermediate results are always available. 

    KNIME supports Windows, Linux, and Mac operating systems and is suitable for enterprises of all different sizes. With KNIME, you can perform functions ranging from basic I/O to data manipulations, transformations and data mining. It consolidates all the functions of the entire process into a single workflow. The solution covers all main data wrangling and machine learning techniques, and is based on visual programming.

    KNIME Features

    KNIME has many valuable key features. Some of the most useful ones include:

    • Scalability through data handling (intelligent automatic caching of data in the background while maximizing throughput performance)
    • High extensibility via a well-defined API for plugin extensions
    • Intuitive user interface
    • Import/export of workflows
    • Parallel execution on multi-core systems
    • Command line version for "headless" batch executions
    • Activity dashboard
    • Reporting & statistics
    • Third-party integrations
    • Workflow management
    • Local automation
    • Metanode linking
    • Tool blending
    • Big Data extensions

    KNIME Benefits

    There are many benefits to implementing KNIME. Some of the biggest advantages the solution offers include:

    • Integrated Deployment: KNIME’s integrated deployment moves both the selected model, and the entire data model preparation process into production simply and automatically, allowing for continuous optimization in production and also saving time because it eliminates error.
    • Elastic and Hybrid Execution: KNIME’s elastic and hybrid executions helps you reduce costs while covering periods of high demand, dynamically.
    • Metadata Mapping: KNIME enables complete metadata mapping of all aspects of your workflow. In addition, KNIME offers blueprint workflows for documenting the nodes, data sources, and libraries used, as well as runtime information.
    • Guided Analytics: KNIME’s guided analytics applications can be customized based on reusable components.
    • Powerful analytics, local automation, and workflow difference: KNIME uses advanced predictive and machine learning algorithms to provide you with the analytics you need. In combination with powerful analytics, KNIME’s automation capabilities and workflow difference prepare your organization with the tools you need to make better business decisions.
    • Supports enterprise-wide data science practices: The deployment and management functionalities of KNIME make it easy to productionize data science applications and services, and deliver usable, reliable, and reproducible insights for the business.
    • Helps you leverage insights gained from your data: Using KNIME ensures the data science process immediately reflects changing requirements or new insights.

    Reviews from Real Users

    Below are some reviews and helpful feedback written by PeerSpot users currently using the KNIME solution.

    An Emeritus Professor at a university says, “It can read many different file formats. It can very easily tidy up your data, deleting blank rows, and deleting rows where certain columns are missing. It allows you to make lots of changes internally, which you do using JavaScript to put in the conditional. It also has very good fundamental machine learning. It has decision trees, linear regression, and neural nets. It has a lot of text mining facilities as well. It's fairly fully-featured.”

    Benedikt S., CEO at SMH - Schwaiger Management Holding GmbH, explains, “All of the features related to the ETL are fantastic. That includes the connectors to other programs, databases, and the meta node function. Technical support has been extremely responsive so far. The solution has a very strong and supportive community that shares information and helps each other troubleshoot. The solution is very stable. The initial setup is pretty simple and straightforward.”

    Piotr Ś., Test Engineer at ProData Consult, says, “What I like the most is that it works almost out of the box with Random Forest and other Forest nodes.”

    Sample Customers
    IQVIA, Rush University Medical Center, Western Union
    Infocom Corporation, Dymatrix Consulting Group, Soluzione Informatiche, MMI Agency, Estanislao Training and Solutions, Vialis AG
    Top Industries
    VISITORS READING REVIEWS
    Financial Services Firm31%
    Healthcare Company10%
    Computer Software Company8%
    Manufacturing Company7%
    REVIEWERS
    University25%
    Comms Service Provider17%
    Retailer14%
    Government8%
    VISITORS READING REVIEWS
    Manufacturing Company12%
    Financial Services Firm11%
    Computer Software Company9%
    Educational Organization8%
    Company Size
    VISITORS READING REVIEWS
    Small Business9%
    Midsize Enterprise12%
    Large Enterprise80%
    REVIEWERS
    Small Business28%
    Midsize Enterprise26%
    Large Enterprise46%
    VISITORS READING REVIEWS
    Small Business19%
    Midsize Enterprise14%
    Large Enterprise67%
    Buyer's Guide
    Data Science Platforms
    April 2024
    Find out what your peers are saying about Databricks, Microsoft, Alteryx and others in Data Science Platforms. Updated: April 2024.
    770,292 professionals have used our research since 2012.

    Cloudera Data Science Workbench is ranked 18th in Data Science Platforms with 2 reviews while KNIME is ranked 4th in Data Science Platforms with 50 reviews. Cloudera Data Science Workbench is rated 7.0, while KNIME is rated 8.2. The top reviewer of Cloudera Data Science Workbench writes "Useful for data science modeling but improvement is needed in MLOps and pricing ". On the other hand, the top reviewer of KNIME writes "A low-code platform that reduces data mining time by linking script". Cloudera Data Science Workbench is most compared with Databricks, Amazon SageMaker, Microsoft Azure Machine Learning Studio, Dataiku Data Science Studio and SAS Enterprise Miner, whereas KNIME is most compared with RapidMiner, Microsoft Power BI, Alteryx, Dataiku Data Science Studio and Weka.

    See our list of best Data Science Platforms vendors.

    We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.