We performed a comparison between StreamSets and Talend Data Management Platform based on real PeerSpot user reviews.
Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."It's very easy to integrate. It integrates with Snowflake, AWS, Google Cloud, and Azure. It's very helpful for DevOps, DataOps, and data engineering because it provides a comprehensive solution, and it's not complicated."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"The entire user interface is very simple and the simplicity of creating pipelines is something that I like very much about it. The design experience is very smooth."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
"The most valuable would be the GUI platform that I saw. I first saw it at a special session that StreamSets provided towards the end of the summer. I saw the way you set it up and how you have different processes going on with your data. The design experience seemed to be pretty straightforward to me in terms of how you drag and drop these nodes and connect them with arrows."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"The features that I like the most are the simplicity of the interface, and the ability to quickly develop with a predefined component."
"The basic tools are easy to pick up and understand."
"I like everything about this product, but the biggest thing is the ease of use."
"They're very competitive in terms of performance, which is a good selling point. It has very rich features. It provides a very rich feature set in the application."
"Talend Studio has the ability to connect to almost anything to integrate data from files, databases, web services, etc."
"The solution can run on any machine and that is a big advantage."
"The most valuable feature is the data loading and scripting language"
"I like the way that you can use the context variables, and how you can work those context variables to give you values and settings for every development environment, such as PROD, TEST, and DEV."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"The data collector in StreamSets has to be designed properly. For example, a simple database configuration with MySQL DB requires the MySQL Connector to be installed."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"The documentation from version to version could be more accurate."
"The sales and market department could improve the Talend Data Management Platform."
"We'd like to see more connectors it the future."
"I think they should drive toward AI and machine learning. They could include a machine-learning algorithm for the deduplication."
"The stability is good, but the performance is slower when I work on a huge amount of data."
"Performance and speed could be improved."
"I would like to sync a project and do an upload from that current version, and then from GitLab, be able to download the latest one."
"I'd be interested in seeing the running of Python programs and transformations from within the studio itself."
More Talend Data Management Platform Pricing and Cost Advice →
StreamSets is ranked 8th in Data Integration with 24 reviews while Talend Data Management Platform is ranked 21st in Data Integration with 18 reviews. StreamSets is rated 8.4, while Talend Data Management Platform is rated 8.2. The top reviewer of StreamSets writes "We no longer need to hire highly skilled data engineers to create and monitor data pipelines". On the other hand, the top reviewer of Talend Data Management Platform writes "Built for everything and packed with features but there are some monitoring limitations". StreamSets is most compared with Fivetran, Informatica PowerCenter, Azure Data Factory, SSIS and IBM InfoSphere DataStage, whereas Talend Data Management Platform is most compared with Talend Open Studio, Talend Data Fabric, SAP Data Services, Collibra Catalog and SSIS. See our StreamSets vs. Talend Data Management Platform report.
See our list of best Data Integration vendors.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.