We performed a comparison between SSIS, StreamSets, and WhereScape RED based on real PeerSpot user reviews.
Find out what your peers are saying about Microsoft, Informatica, Oracle and others in Data Integration."The most valuable feature of SSIS is its ease of use. It is easier to use than other applications."
"The most valuable feature of SSIS is that you can take data from other servers which are not MS SQL Server or Oracle."
"It is easy to set up the product."
"The technical support is very good."
"The debugging capabilities are great, particularly during data flow execution. You can look into the data and see what's going on in the pipeline."
"The UI is very user-friendly."
"The scalability of SSIS is good."
"SSIS provides you with lookup and transformation functions, and you have the flexibility to write your own custom code."
"StreamSets Transformer is a good feature because it helps you when you are developing applications and when you don't want to write a lot of code. That is the best feature overall."
"I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines centrally."
"The scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"Also, the intuitive canvas for designing all the streams in the pipeline, along with the simplicity of the entire product are very big pluses for me. The software is very simple and straightforward. That is something that is needed right now."
"The UI is user-friendly, it doesn't require any technical know-how and we can navigate to social media or use it more easily."
"The entire user interface is very simple and the simplicity of creating pipelines is something that I like very much about it. The design experience is very smooth."
"The most valuable would be the GUI platform that I saw. I first saw it at a special session that StreamSets provided towards the end of the summer. I saw the way you set it up and how you have different processes going on with your data. The design experience seemed to be pretty straightforward to me in terms of how you drag and drop these nodes and connect them with arrows."
"This is a fantastically robust DW tool that will make you at least 10 times faster in producing a DW."
"RED has provided us the ability to integrate, stage, and transform data from diverse sources into an enterprise-grade data warehouse which meets the needs of my organization, but it also enables us to easily and quickly make ETL or DW changes."
"The tool supports multiple target update methods."
"RED generates comprehensive documentation and regenerates it as quickly as things changes, but it also provides impact documentation."
"Naturally produces a way to easily debug your DW data solutions."
"I found the initial setup very easy."
"Data transformations and rollups are easy to accomplish."
"It has a built-in automatic scheduling environment."
"Tuning using this solution requires extensive expertise to improve performance."
"I have a tool called ZappySys. I need that tool to cut down on the complexity of SSIS. That tool really helps with a quick turnaround. I can do things quickly, and I can do things accurately. I can get better reporting on errors."
"I come from a coding background and this tool is graphically based. Sometimes I think it's cumbersome to do mapping graphically. If there was a way to provide a simple script, it would be helpful and make it easier to use."
"The high prices attached to the product can be an area of concern where improvements are required."
"Sometimes, there are compatibility issues with some features. From time to time, I also face issues when trying to migrate. If I misconfigure things when I use Snapshot, the migration will fail.It can take a long time to migrate huge amounts of data, so it would be nice if that could be faster."
"SSIS doesn't have a very good user interface, but if you can work with it, it'll provide you with almost all of the functionality."
"Sometimes we need to connect to AWS to get additional data sources, so we have to install some external LAN and not a regular RDBMS. We need external tools to connect. It would be great if SSIS included these tools. I'd also like some additional features for row indexing and data conversion."
"The creation of the measure in the DAC's model could be improved."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"The documentation is inadequate and has room for improvement because the technical support does not regularly update their documentation or the knowledge base."
"The data collector in StreamSets has to be designed properly. For example, a simple database configuration with MySQL DB requires the MySQL Connector to be installed."
"Visualization and monitoring need to be improved and refined."
"StreamSet works great for batch processing but we are looking for something that is more real-time. We need latency in numbers below milliseconds."
"They need a more robust support center. It has been a bit difficult to find solutions to problems that are out-of-the-box."
"Project-based searching of data objects in the data warehouse browser needs to be improved."
"Improve the object renaming ability (it works, but it could be more automated)."
"No support for change data capture or delta detection - that must be custom coded ."
"Jobs cannot be deleted via the deployment package. When deploying from dev to QA or production, a job has to be retired. The job has to be manually removed from the target environment."
"The solution can be a little more user-friendly on enterprise-level where people use it."
"The ability to execute SSIS projects within WhereScape would be nice because we have a lot of packages that are too cumbersome to recreate."
"Customization could be better."
Earn 20 points