We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
"It's helpful if you want to make informed decisions using data. We can take the information, tease out the attributes, and label everything. It's suitable for profiling and forecasting in any industry."
"The most valuable features are the Binary classification and Auto Model."
"The most valuable feature is what the product sets out to do, which is extracting information and data."
"The data science, collaboration, and IDN are very, very strong."
"The best part of RapidMiner is efficiency."
"The GUI capabilities of the solution are excellent. Their Auto ML model provides for even non-coder data scientists to deploy a model."
"Scalability is not really a concern with RapidMiner. It scales very well and can be used in global implementations."
"RapidMiner is very easy to use."
"The features that I find to be the most valuable are the extensibility, the integration, and the ease of integration with multiple platforms."
"It is saving a lot of time. Today, we can mask around a hundred million records in 10 minutes. Masking is one of the key pieces that is used heavily by the business and IT folks. Normally in the software development life cycle, before you project anything into the production environment, you have to test it in the test environment to make sure that when the data goes into production, it works, but these are all production files. For example, we acquired a new company or a new state for which we're going to do the entire back office, which is related to claims processing, payments, and member enrollment every year. If you get the production data and process it again, it becomes a compliance issue. Therefore, for any migrations that are happening, we have developed a new capability called pattern masking. This feature looks at those files, masks that information, and processes it through the system. With this, there is no PHI and PII element, and there is data integrity across different systems. It has seamless integration with different databases. It has components using which you can easily integrate with different databases on the cloud or on-premise. It is a drag and drop kind of tool. Instead of writing a lot of Java code or SQL queries, you can just drag and drop things. It is all very pictorial. It easily tells you where the job is failing. So, you can just go quickly and figure out why it is happening and then fix it."
"RapidMiner would be improved with the inclusion of more machine learning algorithms for generating time-series forecasting models."
"Many things in the interface look nice, but they aren't of much use to the operator. It already has lots of variables in there."
"In the Mexican or Latin American market, it's kind of pricey."
"The visual interface could use something like the-drag-and-drop features which other products already support. Some additional features can make RapidMiner a better tool and maybe more competitive."
"A great product but confusing in some way with regard to the user interface and integration with other tools."
"I would like to see all users have access to all of the deep learning models, and that they can be used easily."
"It would be helpful to have some tutorials on communicating with Python."
"The biggest problem, not from a platform process, but from an avoidance process, is when you work in a heavily regulated environment, like banking and finance. Whenever you make a decision or there is an output, you need to bill it as an avoidance to the investigator or to the bank audit team. If you made decisions within this machine learning model, you need to explain why you did so. It would better if you could explain your decision in terms of delivery. However, this is an issue with all ML platforms. Many companies are working heavily in this area to help figure out how to make it more explainable to the business team or the regulator."
"They don't have any AI capabilities. Talend DQ is specifically for data quality, which only has data profiling. With Talend DQ, I cannot generate any reports today, so I need an ETL tool. It provides general Excel files, or I have to create some views. If instead of buying a new tool, Talend provides a reporting capability or solution, it would be great. It will reduce the development effort for creating these kinds of reports. We also manage the infrastructure for Talend. From the licensing perspective, for cloud, they only have seat licenses where one person is tied to one license, but for on-premise, they have concurrent licenses. It would be really awesome if they can provide concurrent licenses for the cloud so that if one person is not there, somebody else can use that license. Currently, it is not possible unless a person deactivates his or her license and moves the same seat license to someone else. We are one of the biggest customers in the central zone of the US for Talend, and this is the feedback that we have provided them again and again, but they come back and say that they aren't able to provide concurrent licenses on the cloud. In version 7.3, there is a feature for tokenization and de-tokenization of data. This is the feature that we are looking for. It is useful if somebody wants to see what we have masked and how do we demask it. This feature is not there in version 7.1. There are also a few other capabilities on the cloud, but we don't yet have a big footprint in the cloud."
"I would say that some of the support elements need improvement."
"The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license."
"Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year."
"It's a subscription-based platform, we renew it every year."
"It is cheaper than Informatica. Talend Data Quality costs somewhere between $10,000 to $12,000 per year for a seat license. It would cost around $20,000 per year for a concurrent license. It is the same for the whole big data solution, which comes with Talend DI, Talend DQ, and TDM."
RapidMiner's unified data science platform accelerates the building of complete analytical workflows - from data prep to machine learning to model validation to deployment - in a single environment, improving efficiency and shortening the time to value for data science projects.
RapidMiner is ranked 3rd in Predictive Analytics with 9 reviews while Talend Data Quality is ranked 4th in Data Quality with 3 reviews. RapidMiner is rated 8.6, while Talend Data Quality is rated 9.0. The top reviewer of RapidMiner writes "Offers good tutorials that make it easy to learn and use, with a powerful feature to compare machine learning algorithms". On the other hand, the top reviewer of Talend Data Quality writes "Saves a lot of time, good ROI, seamless integration with different databases, and stable". RapidMiner is most compared with KNIME, Alteryx, Microsoft Azure Machine Learning Studio, Dataiku Data Science Studio and TIBCO Statistica, whereas Talend Data Quality is most compared with Alteryx, Ataccama DQ Analyzer, Informatica Data Quality, Microsoft Data Quality Services and Melissa Data Quality.
We monitor all Predictive Analytics reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.