StreamSets Primary Use Case

Reyansh Kumar - PeerSpot reviewer
Technical Specialist at Accenture

Our company builds products mainly for healthcare divisions and we use StreamSets for all our data engineering tasks.

View full review »
Prateek Agarwal - PeerSpot reviewer
Manager at Indian Institute of Management Visakhapatnam

We are working on a very large data analytics project, in which we are integrating large data sets to a platform from multiple sources. We need to create data pipelines. We are using StreamSets for all the data integration activities, for creating the pipelines, monitoring them, and running all the data processes smoothly.

View full review »
Nantabo Jackie - PeerSpot reviewer
Sales Manager at Soft Hostings Limited

I use StreamSets to develop data feeds for different balance streams, I use it to control options for scheduling my data plane, and for internal version control.

View full review »
Buyer's Guide
StreamSets
March 2024
Learn what your peers think about StreamSets. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
767,995 professionals have used our research since 2012.
Karthik Rajamani - PeerSpot reviewer
Principal Engineer at Tata Consultancy Services

I worked mostly on data injection use cases when I was using Data Collector. Later on, I got involved with some Spark-based transformations using Transformer.

Currently, we are not using CI/CD. We are not using automated deployments. We are manually deploying in prod, but going forward, we are planning to use CI/CD to have automated deployments.

I worked on on-prem and cloud deployments. The current implementation is on-prem, but in my previous project, we worked on AWS-based implementation. We did a small PoC with GCP as well.

View full review »
Namanya Brian - PeerSpot reviewer
CEO-founder at Tubayo

We use it for building a data lake in our content. We have sales multiple times during the day, and a sale is the trigger. Sales use the lake as a landing zone. We also use it for various types of data transformation.

View full review »
Saket Pandey - PeerSpot reviewer
Product Manager at a hospitality company with 51-200 employees

We were receiving data from hospitals or any kind of healthcare service providers in the country. We were dominantly operating in the US. When we received that data, we had to classify it into different repositories or different datasets. This data was sent to different vendors, and for that, the data needed to get processed in different ways. We needed to bifurcate data at many steps with different kinds of filters. For that, we used StreamSets.

View full review »
JA
IT Project Manager at Orange España

It is being used by the data engineering team in projects we are working on in banking and healthcare services to design data pipelines for extracting and injecting data from on-premises and cloud services sources.

View full review »
MI
Software Engineer at Soft Hostings Limited

StreamSets is being used in the IT department to make sure that we have a stable solution and that our configuration is secure and running smoothly. We are using it for our data analytic tool as well as for real-time prediction for various real-life business use cases. It's helping us in generating new business ideas. It's a tool that allows us to share data between platforms, which also removes the dependency on other ETL tools, such as SSIS.

View full review »
AbhishekKatara - PeerSpot reviewer
Technical Lead at Sopra Steria

StreamSets is a wonderful data engineering, data ops tool where we can design and create data pipelines, loading on-prem data to the cloud. One of our major projects was to move data from on-premises to Azure and GCP Cloud. From there, once data is loaded, the data scientist and data analyst teams use that data to generate patterns and insights. 

For a US healthcare service provider company, we designed a StreamSets pipeline to connect to relational database sources. We did generate schema from the source data loaded into Azure Data Lake Storage (ADLS) or any cloud, like S3 or GCP. This was one of our batch use cases. 

With StreamSets, we have also tried to solve our real-time streaming use cases as well, where we were streaming data from source Kafka topic to Azure Event Hubs. This was a trigger-based streaming pipeline, which moved data when it appeared in a Kafka topic. Since this pipeline was a streaming pipeline, it was continuously streaming data from Kafka to Azure for further analysis.

View full review »
Avinash Mukesh - PeerSpot reviewer
IT Specialists at Soft Hostings

We are sharing data between platforms. It's helping me to be independent of the ETL tools as well as have the data format without using any programming language.

View full review »
JM
Software Engineer at ZIDIYO

We use StreamSets to create data pipelines and to make sure that we know the exact analytics of our data usage within our company.

View full review »
Kevin Kathiem Mutunga - PeerSpot reviewer
Chief software engineer at Appnomu Business Services

In our department, we use StreamSets to design data pipelines that load all data from various RD and VMS sources to the cloud, such as Azure. We also use the data set for data analysts to generate panels for our organization, as well as for real-time use cases for monitoring and consuming other streaming data. Additionally, we are able to customize StreamSets to suit our needs and budget.

View full review »
Sumesh Gansar - PeerSpot reviewer
Product Marketing Manager at a tech vendor with 10,001+ employees

My primary use case with StreamSets is to integrate large data sets from multiple sources into a destination. We also use it as a platform to ingest data and deliver data for database analytics.

View full review »
SS
Senior Data Engineer at a energy/utilities company with 1,001-5,000 employees

We are using the StreamSets DataOps platform to ingest data to a data lake.

View full review »
MB
Director Data Engineering, Governance, Operation and Analytics Platform at a financial services firm with 10,001+ employees

We are using StreamSets to migrate our on-premise data to the cloud.

View full review »
Al Mercado - PeerSpot reviewer
AI Engineer at Techvanguard

I was working on an integration project where I was using the StreamSets platform. I was looking at both their data collector and their transformer. The idea was to integrate it with AWS SageMaker Canvas. Both of them are what they call no-code options. StreamSets is for data pipelining, managing your data flow, and transforming your data. SageMaker is AWS, and Canvas is basically their no-code option for machine learning.

I was trying to connect it to a data object repository. For AWS, that's a specific managed service called S3. I wasn't trying to run it with a data warehouse.

View full review »
Ramesh Kuppuswamy - PeerSpot reviewer
Senior Software Developer at a tech vendor with 10,001+ employees

The main use case of StreamSets is to work on data integration and ingesting data for DataOps and modern analytics. We also use it for integrating data files from multiple sources. We use it to build, monitor, and manage smart, continuous data pipelines.

View full review »
Ved Prakash Yadav - PeerSpot reviewer
Senior Data Platform Manager at a manufacturing company with 10,001+ employees

StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data is loaded into Amazon Redshift or other data warehousing solutions.

View full review »
BahatiAsher Faith - PeerSpot reviewer
Software Developer at Appnomu Business Services

It is primarily being used by our IT department to configure things and see what is missing and what the issues are. 

View full review »
BR
Data Engineer at a consultancy with 11-50 employees

The project which I work on is developed in StreamSets and I lead the team. I'm the team leader and the Solution Architect. I also train my juniors and my team.

For the last year and a half, I’ve been using this tool and this tool is very effective for data processing from source to destination. This tool is very effective and I developed many integrations in this tool.

View full review »
SR
Product Marketer at a media company with 1,001-5,000 employees

Our major use case with StreamSets is to build data pipelines from multiple sources to multiple destinations. We mainly use the StreamSets Data Collector Engine for seamless streaming from any source to any destination.

We also use it to deliver continuous data for database operations and modern analytics.

View full review »
TH
Senior Network Administrator at a energy/utilities company with 201-500 employees

We use the whole Data Collector application.

View full review »
AC
Senior Technical Manager at a financial services firm with 501-1,000 employees

It performs very well. The main use is to extract information from some of our Kafka topics and put it in our internal systems, flat files, and integration with Java.

View full review »
MP
Data Engineer at a energy/utilities company with 10,001+ employees

We typically use it to transport our Oracle raw datasets up to Microsoft Azure, and then into SQL databases there.

View full review »
Buyer's Guide
StreamSets
March 2024
Learn what your peers think about StreamSets. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
767,995 professionals have used our research since 2012.