Informatica Cloud Data Integration vs StreamSets comparison

Cancel
You must select at least 2 products to compare!
Informatica Logo
3,500 views|2,815 comparisons
88% willing to recommend
StreamSets Logo
4,200 views|2,349 comparisons
100% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Informatica Cloud Data Integration and StreamSets based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Informatica Cloud Data Integration vs. StreamSets Report (Updated: March 2024).
770,394 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"It has a more UI-based tool, and the scripting is good.""It is quite easy to use and flexible.""Their new licensing is very flexible. With Informatica Cloud, you have plenty of items under the same umbrella, such as services, offerings, data quality, and data masking. You have also got master data management and API management. What I really like about them is that you don't need to go to Informatica and say that you need a data integration module. You would say that you need iPaaS or Informatica Cloud. They'll then try to understand your needs and give you IPUs, which are the processing units. If I purchased a hundred IPUs from Informatica as a customer, I can use 70 IPUs for data integration. I would also need data quality, so I can use 10 IPUs for data quality. I can use the remaining 20 IPUs for API management. Down the line, if I see that my initial data integration needs for the development phase are met, then out of the 70 IPUs assigned for data integration, I can use 30 IPUs for data masking. I can shuffle these numbers in any way within the Informatica Cloud umbrella for the tenure for which I have subscribed to these IPUs. I can use all services the way I want. This flexibility is what I really love about Informatica. It also has got good connectors.""Whether we need data cleansing or data mastering, we get it all in one platform.""The most valuable features of Informatica Cloud Data Integration for our clients are the AI capabilities within Informatica Intelligent Cloud Services.""The solution provides increased efficiency while still being user-friendly and easy to operate.""The mass ingestion functionality and the elasticity of the solution are great.""The Mapping Designer allows for declarative ETL development (visual scripting) that leverages a wide array of different transformations."

More Informatica Cloud Data Integration Pros →

"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them.""Also, the intuitive canvas for designing all the streams in the pipeline, along with the simplicity of the entire product are very big pluses for me. The software is very simple and straightforward. That is something that is needed right now.""StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes.""What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes.""The UI is user-friendly, it doesn't require any technical know-how and we can navigate to social media or use it more easily.""In StreamSets, everything is in one place.""The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customize it to do what you need. Many other tools have started to use features similar to those introduced by StreamSets, like automated workflows that are easy to set up.""StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."

More StreamSets Pros →

Cons
"It could be improved by including a buffer that saves data when there is a connectivity issue.""There may be some types of limitations with the performance.""There is room for improvement at the highest level in terms of useability and connectors for various types of new applications. The row processing performance could be better because you experience some latency dealing with high volumes of data. Most organizations will be dealing with multiple cloud applications, so you could see performance issues moving from one system to another.""Error reporting and debugging need improvement.""I have received feedback from certain teams and there is a steep learning curve to use this solution.""The cloud version of the Informatica, it's a very substandard product. They might say it's enterprise-ready but it's not at all ready. They need to add more features, such as improved data replication features. If you look at other tools, such as Matillion they are now cloud-native and flexible. Additionally, Informatica Cloud Data Integration should have a good migration strategy from Informatica PowerCenter to Informatica Cloud Data Integration.""I would like to see more functionality added so that it is a bit closer to how much you can do with Informatica PowerCenter.""The error information provided is not informative, as compared to Power Center."

More Informatica Cloud Data Integration Cons →

"One area for improvement could be the cloud storage server speed, as we have faced some latency issues here and there.""I would like to see it integrate with other kinds of platforms, other than Java. We're going to have a lot of applications using .NET and other languages or frameworks. StreamSets is very helpful for the old Java platform but it's hard to integrate with the other platforms and frameworks.""I would like to see further improvement in the UI. In addition, upgrades are not automatic and they should be automated. Currently, we have to manually upgrade versions.""Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using.""If you use JDBC Lookup, for example, it generally takes a long time to process data.""Using ETL pipelines is a bit complicated and requires some technical aid.""Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful.""Visualization and monitoring need to be improved and refined."

More StreamSets Cons →

Pricing and Cost Advice
  • "It is cost effective and an easily accessible tool."
  • "The pricing structure is good, but having to pay for extra drivers to be used in an ICS environment makes me a little nervous."
  • "Licensing is difficult to understand, but the team is always available to explain anything. They are very helpful."
  • "My understanding is that Informatica is quite expensive compare to other tools that are available in the market."
  • "Our customers sometimes are able to negotiate a much better price for Informatica Cloud Data Integration based on their relationship with the vendor."
  • "Its pricing model can be improved."
  • "I'm not sure about the most recent pricing trends, but I don't believe it's significantly different from PowerCenter. I believe it is nearly the same."
  • "The price of Informatica Cloud Data Integration could be reduced."
  • More Informatica Cloud Data Integration Pricing and Cost Advice →

  • "We are running the community version right now, which can be used free of charge."
  • "StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
  • "It has a CPU core-based licensing, which works for us and is quite good."
  • "There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
  • "The pricing is good, but not the best. They have some customized plans you can opt for."
  • "We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
  • "The overall cost for small and mid-size organizations needs to be better."
  • "There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
  • More StreamSets Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    770,394 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power… more »
    Top Answer:Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge… more »
    Top Answer:When it comes to cloud data integration, this solution can provide you with multiple benefits, including Overhead reduction by integrating data on any cloud in various ways Effective integration of… more »
    Top Answer:The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customize… more »
    Top Answer:We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was… more »
    Top Answer:StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data is… more »
    Ranking
    5th
    Views
    3,500
    Comparisons
    2,815
    Reviews
    18
    Average Words per Review
    459
    Rating
    7.7
    8th
    out of 101 in Data Integration
    Views
    4,200
    Comparisons
    2,349
    Reviews
    22
    Average Words per Review
    1,306
    Rating
    8.4
    Comparisons
    Learn More
    StreamSets
    Video Not Available
    Overview

    Informatica Cloud Data Integration is a cloud-native cloud data integration solution that enables users to connect a large number of applications and data sources across on-premises and integrate the data sources at scale on the cloud. The product is built on microservices-driven management and integration platform as a service (iPaaS) and assists organizations to govern costs, increase productivity and collaboration, and simplify their experience. Informatica Cloud Data Integration allows companies to deliver data and analytics to lines of business in a timely manner, build data warehouses on Amazon Redshift, Google Cloud BigQuery, Snowflake, and Microsoft Azure Synapse Analytics, and utilize the required data integration patterns, including elastic processing, extract, load, and transform (ELT), and extract, transform, and load (ETL).

    The solution allows users to to build enterprise-scale integration workloads within hours while it improves the productivity of development teams by providing them a codeless, drag-and-drop user interface. Companies can benefit from integration features built for data warehousing and optimized connectors for bulk loads of billions of records. Informatica Cloud Data Integration offers organizations the option of going serverless at scale by allowing them to process data integration jobs from cloud-hosted as well as managed environments. The Spark-based engine allows the solution to handle high-volume data demands and complex data integration tasks.

    Informatica Cloud Data Integration Features

    Informatica Cloud Data Integration provides its users with various features and tools. Among the key capacities of the product are:

    • Advanced Pushdown Optimization: Informatica Cloud Data Integration offers a feature that provides users with the benefits of ELT while maintaining their data flow definitions at a logical or abstract level. This feature allows users to choose a runtime option that complies with the workload as well as send their data processing work to cloud ecosystem pushdown, cloud data warehouse pushdown, Spark serverless processing, or traditional ETL.

    • Connectors for all major data sources: This feature provides out-of-the-box connectivity to a large number of cloud and on-premise systems, data stores, analytics and BI tools, and enterprise and middleware applications.

    • Data transformation capabilities: This feature allows users to process data transformation in real time or batch by using a variety of transformation types, such as cleansing, masking, aggregation, fileting, parsing, and ranking.

    • Spark-based complex data integration: Informatica Cloud Data Integration Elastic allows specialists to use elastic clusters to process their data transformation.

    • Codeless integration: This feature facilitates the creation of simple-to-sophisticated data integration projects with a visual mapping designer that speeds up pre-build transformations for development through a variety of endpoints across cloud and on-premises.

    • Serverless data integration: Users can achieve cloud data integration in a mode called Advanced Serverless, where they can benefit from a fully managed environment with no software, no cloud administration, and no servers or clusters to manage.

    • Taskflow orchestration: This feature allows users to combine batch and real-time integration through a taskflow designer in order to create simple-to-sophisticated orchestrations.

    • Intelligent structure discovery: This feature uses the CLAIRE engine to automatically understand the parsing model for complicated files based on their structure.

    • Change data capture: Utilizing the prebuilt task wizards and Change Data Capture tool, users can automatically pull only the updated or incremental data from source systems to the targets on a frequent basis.

    • Security: The product offers various features which ensure the highest level of data and workload security and comply with various policies.

    Informatica Cloud Data Integration Benefits

    Informatica Cloud Data Integration brings multiple benefits to its users. These include:

    • The product offers optimized connectivity to various systems through custom build-connectors.

    • Users can benefit from improved elasticity and performance by utilizing Spark clusters and auto-tuning.

    • The tool allows developers to focus on business logic by facilitating infrastructure management through serverless deployment features.

    • Informatica Data Cloud Integration provides user flexibility by connecting to any database, cloud data lake, on-premise apps, and data warehouses.

    • Through a zero-coding environment and role-appropriate user experience, the solution is suitable for all types of users.

    • The solution offers consistent experience and unified metadata across all cloud services.

    • Users can leverage enterprise-level performance for integration design with no coding required.

    • Informatica Data Cloud Integration scales as a business grows, providing a high level of adaptability.

    Reviews from Real Users

    Divya R., a senior consultant at Deloitte, rates Informatica Cloud Data Integration highly because it is a UI-based tool with great scripting.

    A data architect at a retailer likes Informatica Cloud Data Integration because of its flexible licensing, good connectors, and timely upgrades and patches.

    StreamSets is a data integration platform that enables organizations to efficiently move and process data across various systems. It offers a user-friendly interface for designing, deploying, and managing data pipelines, allowing users to easily connect to various data sources and destinations. StreamSets also provides real-time monitoring and alerting capabilities, ensuring that data is flowing smoothly and any issues are quickly addressed.

    Sample Customers
    Chicago Cubs, Telegraph Media Group
    Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
    Top Industries
    REVIEWERS
    Computer Software Company37%
    Pharma/Biotech Company21%
    Manufacturing Company11%
    Financial Services Firm5%
    VISITORS READING REVIEWS
    Financial Services Firm16%
    Computer Software Company14%
    Manufacturing Company9%
    Insurance Company8%
    REVIEWERS
    Financial Services Firm20%
    Energy/Utilities Company20%
    Comms Service Provider13%
    Computer Software Company13%
    VISITORS READING REVIEWS
    Financial Services Firm17%
    Computer Software Company13%
    Manufacturing Company8%
    Government7%
    Company Size
    REVIEWERS
    Small Business21%
    Midsize Enterprise21%
    Large Enterprise57%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    REVIEWERS
    Small Business40%
    Midsize Enterprise12%
    Large Enterprise48%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise11%
    Large Enterprise73%
    Buyer's Guide
    Informatica Cloud Data Integration vs. StreamSets
    March 2024
    Find out what your peers are saying about Informatica Cloud Data Integration vs. StreamSets and other solutions. Updated: March 2024.
    770,394 professionals have used our research since 2012.

    Informatica Cloud Data Integration is ranked 5th in Cloud Data Integration with 40 reviews while StreamSets is ranked 8th in Data Integration with 24 reviews. Informatica Cloud Data Integration is rated 7.8, while StreamSets is rated 8.4. The top reviewer of Informatica Cloud Data Integration writes "A stable, scalable, and user-friendly solution". On the other hand, the top reviewer of StreamSets writes "We no longer need to hire highly skilled data engineers to create and monitor data pipelines". Informatica Cloud Data Integration is most compared with Informatica PowerCenter, Azure Data Factory, AWS Glue, Fivetran and Mule Anypoint Platform, whereas StreamSets is most compared with Fivetran, Azure Data Factory, Informatica PowerCenter, SSIS and IBM InfoSphere DataStage. See our Informatica Cloud Data Integration vs. StreamSets report.

    See our list of best Cloud Data Integration vendors.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.