We performed a comparison between Informatica PowerCenter and StreamSets based on real PeerSpot user reviews.
Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."I like the completeness of the way I can build ETL workflows."
"We use Informatica PowerCenter to transfer the transitional database to and from the data warehouse. This is very efficient as it enables us to quickly find our data reports and the data, so we can build AI models."
"Has a good visual tool for data mapping."
"The reliability of the product and the way of orchestration of different services is valuable to us."
"What I like most about Informatica PowerCenter is that it's the best tool in the market for data integration. Currently, I work in L'Oréal, where a new system from SAP is used. Informatica PowerCenter integration with SAP is very, very fast and very, very simple, so you have the server flow from SAP, and through Informatica PowerCenter, you can ingest the data and make that data available for the business more quickly."
"It is easy to use, and it is quick for developing things. It is fairly powerful, and it can integrate with a lot of different platforms without much hassle."
"The most valuable features are the metadata repository and the data warehouse application console."
"It's a very powerful tool you can use to load data, get data, do the drawing between the tables, and put into the packet in a very fast way."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"Important features include that it comprises lots of functionality to connect data from various sources through connector availability, scheduling pipelines at any time, and integration with third-party and security solutions for encryption."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"StreamSets Transformer is a good feature because it helps you when you are developing applications and when you don't want to write a lot of code. That is the best feature overall."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
"Also, the intuitive canvas for designing all the streams in the pipeline, along with the simplicity of the entire product are very big pluses for me. The software is very simple and straightforward. That is something that is needed right now."
"The best feature that I really like is the integration."
"The UI is user-friendly, it doesn't require any technical know-how and we can navigate to social media or use it more easily."
"There is some room for improvement in terms of pricing."
"The solution's commercial cost is very high. Other open-source tools can do the tool's functions for free. The world is moving to the cloud, but the solution hasn't updated its drivers. I presume that its downfall will start soon. The tool is trying to cross-sell or upsell without helping customers derive benefits from the existing products. They have multiple tools and licenses. It is better to bring the smaller tools in one umbrella."
"They should release new versions for the solution's on-premises setup."
"This solution needs the functionality to do batch processing of data. It also lacks connectivity to NoSQL, unstructured data sources."
"While Informatica is great for data-integration, it does not have any analytics features. Thus, organizations have to always look for another product for their BI needs."
"The reputation of Informatica is that it is expensive."
"Areas for improvement in Informatica PowerCenter include scalability and high availability or the clustering configuration because that's still very basic. The elasticity or scaling of the platform needs a lot of improvement. For example, when it comes to DR handling or building an active-active or active-passive cluster, Informatica PowerCenter is still not that powerful. Automation also needs improvement in the solution. Improving automation leads to some improvement in the stability of Informatica PowerCenter and other aspects related to it. What I'd like to see in the next release of Informatica PowerCenter is real-time capability because the solution is mainly for patches, and to have real-time integration, you need to count on some additional components from Informatica. I would expect more integration and a complete platform in terms of real-time capability or patching with minimal interventions or minimal components to be aligned together."
"The pricing could be improved."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"Visualization and monitoring need to be improved and refined."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"They need to improve their customer care services. Sometimes it has taken more than 48 hours to resolve an issue. That should be reduced. They are aware of small or generic issues, but not the more technical or deep issues. For those, they require some time, generally 48 to 72 hours to respond. That should be improved."
"Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
Informatica PowerCenter is ranked 3rd in Data Integration with 78 reviews while StreamSets is ranked 8th in Data Integration with 24 reviews. Informatica PowerCenter is rated 8.0, while StreamSets is rated 8.4. The top reviewer of Informatica PowerCenter writes "Stable, provides good support, and integrating it with other systems is very fast, but its pricing is expensive". On the other hand, the top reviewer of StreamSets writes "We no longer need to hire highly skilled data engineers to create and monitor data pipelines". Informatica PowerCenter is most compared with Informatica Cloud Data Integration, Azure Data Factory, SSIS and Databricks, whereas StreamSets is most compared with Fivetran, Azure Data Factory, SSIS, IBM InfoSphere DataStage and Oracle GoldenGate. See our Informatica PowerCenter vs. StreamSets report.
See our list of best Data Integration vendors.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.