We performed a comparison between Informatica PowerCenter and StreamSets based on real PeerSpot user reviews.
Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."It reduces a lot of legacy coding."
"Deployment was simple and straightforward."
"Informatica PowerCenter is a very good ETL tool."
"The greatest feature is that it is very easy to have someone come in and jump right in. It is one of the nicest tools in terms of getting a person acquainted quickly."
"We can scale the product."
"I would recommend that others considering the solution go ahead and use it for any batch and high volume loads with complex transactions."
"It's a very powerful tool you can use to load data, get data, do the drawing between the tables, and put into the packet in a very fast way."
"Has a good visual tool for data mapping."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"The best feature that I really like is the integration."
"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them."
"Also, the intuitive canvas for designing all the streams in the pipeline, along with the simplicity of the entire product are very big pluses for me. The software is very simple and straightforward. That is something that is needed right now."
"The most valuable features are the option of integration with a variety of protocols, languages, and origins."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"One of the things I like is the data pipelines. They have a very good design. Implementing pipelines is very straightforward. It doesn't require any technical skill."
"The pricing could be improved."
"The price of the product is an area of concern where improvements are required, considering the fact that the present licensing charges of the tool are expensive."
"Include more instruments for LOGs analysis, interpretation, and job corrections."
"We had stability issues, mostly with JVM size."
"The performance of Informatica PowerCenter could improve."
"If we could have the option of performance improvement within Informatica, and if it could have more features, that would be ideal."
"As a connector to big data, it is not well developed. We've had problems connecting Informatica with Hadoop. The functionality to connect Informatica with Hadoop, for me it's not good."
"This solution needs the functionality to do batch processing of data. It also lacks connectivity to NoSQL, unstructured data sources."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"The data collector in StreamSets has to be designed properly. For example, a simple database configuration with MySQL DB requires the MySQL Connector to be installed."
"I would like to see further improvement in the UI. In addition, upgrades are not automatic and they should be automated. Currently, we have to manually upgrade versions."
Informatica PowerCenter is ranked 3rd in Data Integration with 78 reviews while StreamSets is ranked 8th in Data Integration with 24 reviews. Informatica PowerCenter is rated 8.0, while StreamSets is rated 8.4. The top reviewer of Informatica PowerCenter writes "Stable, provides good support, and integrating it with other systems is very fast, but its pricing is expensive". On the other hand, the top reviewer of StreamSets writes "We no longer need to hire highly skilled data engineers to create and monitor data pipelines". Informatica PowerCenter is most compared with Informatica Cloud Data Integration, Azure Data Factory, SSIS and Databricks, whereas StreamSets is most compared with Fivetran, Azure Data Factory, SSIS, Oracle GoldenGate and IBM InfoSphere DataStage. See our Informatica PowerCenter vs. StreamSets report.
See our list of best Data Integration vendors.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.