We performed a comparison between Informatica Cloud Data Integration and StreamSets based on real PeerSpot user reviews.
Find out in this report how the two Cloud Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Replication allows us to fully replicate all objects from Shop Floor Data Collection (SFDC) to in-house/on-premises database in one job."
"It has all the advantages of the Cloud in that you can use it without worrying about infrastructure, upkeep, or upgrades."
"The program is stable and scalable."
"I do a quite a lot of data transformations, and the fact that I can do them without changing any of my SQL queries from the code, using the inbuilt tools, is very helpful."
"It has a more UI-based tool, and the scripting is good."
"The solution provides increased efficiency while still being user-friendly and easy to operate."
"The product improves data quality."
"The most valuable feature is the building of mockups and tasks."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"One of the things I like is the data pipelines. They have a very good design. Implementing pipelines is very straightforward. It doesn't require any technical skill."
"Important features include that it comprises lots of functionality to connect data from various sources through connector availability, scheduling pipelines at any time, and integration with third-party and security solutions for encryption."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"It is really easy to set up and the interface is easy to use."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"The best feature that I really like is the integration."
"It could be improved by including a buffer that saves data when there is a connectivity issue."
"It needs to be a little more intuitive but it’s really not bad."
"I would like to see more functionality added so that it is a bit closer to how much you can do with Informatica PowerCenter."
"It would be helpful if there was a GenAI feature integrated into the system, especially regarding the data quality."
"The main issue preventing Brazilian companies from migrating to Informatica Cloud Data Integration from on-prem is the price."
"The error information provided is not informative, as compared to Power Center."
"With the solution, we had some issues, and we have every day, and we used to open a ticket. Sometimes, there are data issues and transformation issues."
"The current features are a bit complicated, and we need to write big scripts and test."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
"I would like to see it integrate with other kinds of platforms, other than Java. We're going to have a lot of applications using .NET and other languages or frameworks. StreamSets is very helpful for the old Java platform but it's hard to integrate with the other platforms and frameworks."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"They need to improve their customer care services. Sometimes it has taken more than 48 hours to resolve an issue. That should be reduced. They are aware of small or generic issues, but not the more technical or deep issues. For those, they require some time, generally 48 to 72 hours to respond. That should be improved."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"The documentation is inadequate and has room for improvement because the technical support does not regularly update their documentation or the knowledge base."
More Informatica Cloud Data Integration Pricing and Cost Advice →
Informatica Cloud Data Integration is ranked 5th in Cloud Data Integration with 40 reviews while StreamSets is ranked 8th in Data Integration with 24 reviews. Informatica Cloud Data Integration is rated 7.8, while StreamSets is rated 8.4. The top reviewer of Informatica Cloud Data Integration writes "A stable, scalable, and user-friendly solution". On the other hand, the top reviewer of StreamSets writes "We no longer need to hire highly skilled data engineers to create and monitor data pipelines". Informatica Cloud Data Integration is most compared with Informatica PowerCenter, Azure Data Factory, AWS Glue, Fivetran and Mule Anypoint Platform, whereas StreamSets is most compared with Fivetran, Azure Data Factory, Informatica PowerCenter, SSIS and IBM InfoSphere DataStage. See our Informatica Cloud Data Integration vs. StreamSets report.
See our list of best Cloud Data Integration vendors.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.