Informatica Enterprise Data Lake vs StreamSets Comparison 2024

Informatica Enterprise Data Lake

StreamSets

Informatica Enterprise Data Lake

Read 1 Informatica Enterprise Data Lake review

434 views|413 comparisons

StreamSets

Read 24 StreamSets reviews

4,226 views|2,398 comparisons

Comparison Buyer's Guide

Download the complete report

Buyer's Guide

Data Integration

April 2024

Executive Summary

We performed a comparison between Informatica Enterprise Data Lake and StreamSets based on real PeerSpot user reviews.

Find out what your peers are saying about Microsoft, Informatica, Oracle and others in Data Integration.

To learn more, read our detailed Data Integration Report (Updated: April 2024).

Download the complete report

767,847 professionals have used our research since 2012.

Featured Review

Anonymous User

Data Architect at a tech services company

A scalable tool that needs a lot of maintenance due to its unstable nature

Namanya Brian

CEO-founder at Tubayo

Data streams and pipelines help our team identify areas for improvement in our solution

It enables us to create data streams and pipelines that our team can use to identify areas for improvement. Our marketing team can read the data... Read more →

Quotes From Members

We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:

Pros

"The process of using the tool's scalability option is well documented."

More Informatica Enterprise Data Lake Pros →

"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them.""The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too""It is really easy to set up and the interface is easy to use.""Important features include that it comprises lots of functionality to connect data from various sources through connector availability, scheduling pipelines at any time, and integration with third-party and security solutions for encryption.""What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes.""The ability to have a good bifurcation rate and fewer mistakes is valuable.""I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks.""The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."

More StreamSets Pros →

Cons

"Informatica Enterprise Data Lake's setup process was complex since it doesn't support a lot of real-time systems."

More Informatica Enterprise Data Lake Cons →

"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that.""One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing.""I would like to see further improvement in the UI. In addition, upgrades are not automatic and they should be automated. Currently, we have to manually upgrade versions.""Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful.""In terms of the product, I don't think there is any room for improvement because it is very good. One small area of improvement that is very much needed is on the knowledge base side. Sometimes, it is not very clear how to set up a certain process or a certain node for a person who's using the platform for the first time.""If you use JDBC Lookup, for example, it generally takes a long time to process data.""Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful.""The design experience is the bane of our existence because their documentation is not the best. Even when they update their software, they don't publish the best information on how to update and change your pipeline configuration to make it conform to current best practices. We don't pay for the added support. We use the "freeware version." The user community, as well as the documentation they provide for the standard user, are difficult, at best."

More StreamSets Cons →

Pricing and Cost Advice

"The licenses attached to the solution are highly priced."

More Informatica Enterprise Data Lake Pricing and Cost Advice →

"We are running the community version right now, which can be used free of charge."

"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."

"It has a CPU core-based licensing, which works for us and is quite good."

"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."

"The pricing is good, but not the best. They have some customized plans you can opt for."

"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."

"The overall cost for small and mid-size organizations needs to be better."

"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."

More StreamSets Pricing and Cost Advice →

See Which Vendors Are Best For You

Use our free recommendation engine to learn which Data Integration solutions are best for your needs.

See Recommendations

767,847 professionals have used our research since 2012.

Questions from the Community

What do you like most about Informatica Enterprise Data Lake?

Top Answer:The process of using the tool's scalability option is well documented.

What is your experience regarding pricing and costs for Informatica Enter...

Top Answer:The licenses attached to the solution are highly priced. Informatica has licensing models for every product and for every feature, like the web service feature, which is something my company doesn't… more »

What needs improvement with Informatica Enterprise Data Lake?

Top Answer:Governance, data dictionary, and data cataloging are not available in Informatica Enterprise Data Lake. A lot of businesses are facing issues related to understanding the area revolving around… more »

What do you like most about StreamSets?

Top Answer:I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines… more »

Read all 17 answers →

What needs improvement with StreamSets?

Top Answer:StreamSets should provide a mechanism to be able to perform data quality assessment when the data is being moved from one source to the target. So the ability to validate the data against various data… more »

Read all 17 answers →

What is your primary use case for StreamSets?

Top Answer:We are using StreamSets to migrate our on-premise data to the cloud.

Read all 17 answers →

Ranking

41st

out of 100 in Data Integration

Views

434

Comparisons

413

Reviews

Average Words per Review

832

Rating

7.0

8th

out of 100 in Data Integration

Views

4,226

Comparisons

2,398

Reviews

Average Words per Review

1,337

Rating

8.4

Comparisons

Palantir Foundry vs. Informatica Enterprise Data Lake

Compared 100% of the time.

More Informatica Enterprise Data Lake Competitors →

Fivetran vs. StreamSets

Compared 12% of the time.

Azure Data Factory vs. StreamSets

Compared 11% of the time.

Informatica PowerCenter vs. StreamSets

Compared 10% of the time.

SSIS vs. StreamSets

Compared 7% of the time.

IBM InfoSphere DataStage vs. StreamSets

Compared 5% of the time.

More StreamSets Competitors →

Also Known As

Informatica Intelligent Data Lake, Intelligent Data Lake

Learn More

Informatica

StreamSets

Video Not Available

Overview

The Intelligent Data Lake enables raw big data to be systematically transformed into fit-for-purpose data sets for a variety of data consumers. Data scientists and analysts can quickly find the data they’re looking for using semantic and faceted search. They can see data profiles, lineage, and other relationships to know whether they can trust the data and whether it’s fit-for-use in their analytic projects.

StreamSets is a data integration platform that enables organizations to efficiently move and process data across various systems. It offers a user-friendly interface for designing, deploying, and managing data pipelines, allowing users to easily connect to various data sources and destinations. StreamSets also provides real-time monitoring and alerting capabilities, ensuring that data is flowing smoothly and any issues are quickly addressed.

Sample Customers

Information Not Available

Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge

Top Industries

VISITORS READING REVIEWS

Financial Services Firm15%

Manufacturing Company11%

Computer Software Company11%

Healthcare Company9%

REVIEWERS

Financial Services Firm20%

Energy/Utilities Company20%

Comms Service Provider13%

Computer Software Company13%

VISITORS READING REVIEWS

Financial Services Firm17%

Computer Software Company13%

Manufacturing Company8%

Government7%

Company Size

VISITORS READING REVIEWS

Small Business11%

Midsize Enterprise11%

Large Enterprise78%

REVIEWERS

Small Business40%

Midsize Enterprise12%

Large Enterprise48%

VISITORS READING REVIEWS

Small Business16%

Midsize Enterprise11%

Large Enterprise73%

Find out what your peers are saying about Microsoft, Informatica, Oracle and others in Data Integration. Updated: April 2024.

DOWNLOAD NOW

767,847 professionals have used our research since 2012.

Informatica Enterprise Data Lake is ranked 41st in Data Integration with 1 review while StreamSets is ranked 8th in Data Integration with 24 reviews. Informatica Enterprise Data Lake is rated 7.0, while StreamSets is rated 8.4. The top reviewer of Informatica Enterprise Data Lake writes "A scalable tool that needs a lot of maintenance due to its unstable nature". On the other hand, the top reviewer of StreamSets writes "We no longer need to hire highly skilled data engineers to create and monitor data pipelines". Informatica Enterprise Data Lake is most compared with Palantir Foundry, whereas StreamSets is most compared with Fivetran, Azure Data Factory, Informatica PowerCenter, SSIS and IBM InfoSphere DataStage.

See our list of best Data Integration vendors.

We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.

Informatica Enterprise Data Lake vs StreamSets comparison

Informatica Enterprise Data Lake

StreamSets