We performed a comparison between RapidMiner and Talend Data Quality based on real PeerSpot user reviews.
Find out what your peers are saying about Alteryx, RapidMiner, SAP and others in Predictive Analytics."What I like about RapidMiner is its all-in-one nature, which allows me to prepare, extract, transform, and load data within the same tool."
"The GUI capabilities of the solution are excellent. Their Auto ML model provides for even non-coder data scientists to deploy a model."
"We value the collaboration and governance features because it's a comprehensive platform that covers everything from data extraction to modeling operations in the ML language. RapidMiner is competitive in the ML space."
"Scalability is not really a concern with RapidMiner. It scales very well and can be used in global implementations."
"It is easy to use and has a huge community that I can rely on for help. Moreover, it is interactive."
"I like not having to write all solutions from code. Being able to drag and drop controls, enables me to focus on building the best model, without needing to search for syntax errors or extra libraries."
"The most valuable features are the Binary classification and Auto Model."
"Using the GUI, I can have models and algorithms drag and drop nodes."
"I really like the fact that there are no out-of-the-box solutions regarding the development of jobs. Other vendors may have modules which cleanse your addresses. In Talend, you have the freedom to completely develop the process yourself. This can be tricky, but it also makes it fun."
"The numerous components provided by Talend mean you’re able to create jobs quickly and efficiently."
"The solution is customizable."
"It has definitely streamlined certain processes."
"It reduces the QA effort immensely by handling most of the test scenarios in a reusable way."
"The most valuable feature lies in the capability to assign data quality issues to different stakeholders, facilitating the tracking and resolution of defective work."
"We have used value frequency and patterns. We have been it impressed with these functions as they have helped us in making decisions in transformation work."
"tLogRows are also great for finding bad data."
"Many things in the interface look nice, but they aren't of much use to the operator. It already has lots of variables in there."
"Improve the online data services."
"In the Mexican or Latin American market, it's kind of pricey."
"The biggest problem, not from a platform process, but from an avoidance process, is when you work in a heavily regulated environment, like banking and finance. Whenever you make a decision or there is an output, you need to bill it as an avoidance to the investigator or to the bank audit team. If you made decisions within this machine learning model, you need to explain why you did so. It would better if you could explain your decision in terms of delivery. However, this is an issue with all ML platforms. Many companies are working heavily in this area to help figure out how to make it more explainable to the business team or the regulator."
"A great product but confusing in some way with regard to the user interface and integration with other tools."
"In terms of the UI and SaaS, the user interface with KNIME is more appealing than RapidMiner."
"I think that they should make deep learning models easier."
"It would be helpful to have some tutorials on communicating with Python."
"The ability to change the code when debugging the JavaScript could be improved."
"Heap space issues plague us consistently. We maxed it out and it runs fine, then it doesn’t, then it does."
"SQL for displaying underlying data in non-match results does not work."
"It would be more helpful if it offered dynamic dashboards that could be directly used by clients for better analysis."
"They don't have any AI capabilities. Talend DQ is specifically for data quality, which only has data profiling. With Talend DQ, I cannot generate any reports today, so I need an ETL tool. It provides general Excel files, or I have to create some views. If instead of buying a new tool, Talend provides a reporting capability or solution, it would be great. It will reduce the development effort for creating these kinds of reports. We also manage the infrastructure for Talend. From the licensing perspective, for cloud, they only have seat licenses where one person is tied to one license, but for on-premise, they have concurrent licenses. It would be really awesome if they can provide concurrent licenses for the cloud so that if one person is not there, somebody else can use that license. Currently, it is not possible unless a person deactivates his or her license and moves the same seat license to someone else. We are one of the biggest customers in the central zone of the US for Talend, and this is the feedback that we have provided them again and again, but they come back and say that they aren't able to provide concurrent licenses on the cloud. In version 7.3, there is a feature for tokenization and de-tokenization of data. This is the feature that we are looking for. It is useful if somebody wants to see what we have masked and how do we demask it. This feature is not there in version 7.1. There are also a few other capabilities on the cloud, but we don't yet have a big footprint in the cloud."
"In redundancy analysis, the query is failing to bring non-matched records. This query is an internal script. There is no way (that I know of) to fix this syntax error for future runs."
"There are too many functions which could be streamlined."
"In terms of the solution's technical support, the interactions were satisfactory, but there is room for improvement, especially in managing expectations."
RapidMiner is ranked 2nd in Predictive Analytics with 19 reviews while Talend Data Quality is ranked 4th in Data Quality with 20 reviews. RapidMiner is rated 8.6, while Talend Data Quality is rated 8.0. The top reviewer of RapidMiner writes "Offers good tutorials that make it easy to learn and use, with a powerful feature to compare machine learning algorithms". On the other hand, the top reviewer of Talend Data Quality writes "Saves a lot of time, good ROI, seamless integration with different databases, and stable". RapidMiner is most compared with KNIME, Alteryx, Dataiku Data Science Studio, Tableau and Microsoft Azure Machine Learning Studio, whereas Talend Data Quality is most compared with Ataccama DQ Analyzer, Informatica Data Quality, Alteryx, Precisely Trillium and Informatica Cloud Data Quality.
We monitor all Predictive Analytics reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.