We performed a comparison between Oracle GoldenGate and StreamSets based on real PeerSpot user reviews.
Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."When we are replicating data between similar databases then it is straightforward."
"Data migration is the most valuable feature of Oracle GoldenGate."
"It is quite scalable."
"Since the world is going towards open source, the solution's heterogeneous data migration feature is very useful."
"It scales well. You can either have it working in with the databases, or you can move it outside. It's got the ability to parallelize up, so you can certainly extract and replay the data from your databases in parallel. So it does have horizontal scalability. That is probably one of its stronger features, the ability just to automatically parallelize the replay and playing of data to allow to scale to large volumes."
"It's very simple to configure, it's very simple to implement. In addition, the ability it has to capture data and transmit it with incredible speed is better than any of the product out there. It's extremely powerful."
"We can replicate data across different data centers and different technologies."
"It is a scalable solution."
"The best feature that I really like is the integration."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"The UI is user-friendly, it doesn't require any technical know-how and we can navigate to social media or use it more easily."
"I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines centrally."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution."
"Some issues regarding stability need to be resolved."
"Technical support for Oracle products needs to be more efficient (at least locally in Egypt)."
"Lacks the ability to become a full-scale ETL tool."
"You cannot put everything in Golden Gate for the license and large models."
"There are a lot of bugs in the solution."
"Microservices Architecture is not user-friendly."
"The product lacks some features and it's expensive."
"We struggle with memory. It's limited. However, it may be because of our unique business case and how we use it that it's limiting for us."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"The data collector in StreamSets has to be designed properly. For example, a simple database configuration with MySQL DB requires the MySQL Connector to be installed."
"StreamSet works great for batch processing but we are looking for something that is more real-time. We need latency in numbers below milliseconds."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"StreamSets should provide a mechanism to be able to perform data quality assessment when the data is being moved from one source to the target."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
Oracle GoldenGate is ranked 6th in Data Integration with 48 reviews while StreamSets is ranked 8th in Data Integration with 24 reviews. Oracle GoldenGate is rated 8.2, while StreamSets is rated 8.4. The top reviewer of Oracle GoldenGate writes "Performs real-time replication without data loss, but we cannot do much automation". On the other hand, the top reviewer of StreamSets writes "We no longer need to hire highly skilled data engineers to create and monitor data pipelines". Oracle GoldenGate is most compared with AWS Database Migration Service, Qlik Replicate, Quest SharePlex and Azure Data Factory, whereas StreamSets is most compared with Fivetran, Informatica PowerCenter, Azure Data Factory, SSIS and AWS Glue. See our Oracle GoldenGate vs. StreamSets report.
See our list of best Data Integration vendors.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.