We compared IBM InfoSphere DataStage and IBM Cloud Pak for Data based on our user's reviews in several parameters.
IBM InfoSphere DataStage is praised for its strong data integration, connectors, workflow management, ETL functionalities, and data quality controls. In contrast, IBM Cloud Pak for Data is commended for its analytics capabilities, user interface, data management tools, integration, scalability, governance, security, collaboration, and AI-driven features. Feedback on customer service, setup duration, pricing, and ROI varies between the two products.
Features: IBM InfoSphere DataStage is praised for its strong data integration capabilities, comprehensive set of connectors, efficient workflow management, and robust ETL functionalities. On the other hand, IBM Cloud Pak for Data is valued for its robust analytics capabilities, ease of use, comprehensive data management tools, seamless integration, and advanced data governance and security features. It also offers AI-driven capabilities like machine learning and predictive analytics.
Pricing and ROI: The available data does not provide any information about the setup cost for IBM InfoSphere DataStage. Similarly, the pricing and licensing information for IBM Cloud Pak for Data is not provided in the available data source., IBM InfoSphere DataStage has no available data to determine its ROI, while there is also no information or insights about the ROI of IBM Cloud Pak for Data.
Room for Improvement: IBM InfoSphere DataStage does not have specific areas for improvement identified in the available responses. Similarly, there is no specific feedback or review available for IBM Cloud Pak for Data on what needs improvement.
Deployment and customer support: Based on the available summaries, it is not possible to compare the user reviews regarding the duration to establish IBM InfoSphere DataStage and IBM Cloud Pak for Data as the feedback related to these aspects is not provided for both products., Based on the available data, there is not enough information to provide a summary of the customer service and support of IBM InfoSphere DataStage. The customer service and support of IBM Cloud Pak for Data received a lack of feedback from the reviews provided.
The summary above is based on 24 interviews we conducted recently with IBM InfoSphere DataStage and IBM Cloud Pak for Data users. To access the review's full transcripts, download our report.
"What I found most helpful in IBM Cloud Pak for Data is containerization, which means it's easy to shift and leave in terms of moving to other clouds. That's an advantage of IBM Cloud Pak for Data."
"One of Cloud Pak's best features is the Watson Knowledge Catalog, which helps you implement data governance."
"Its data preparation capabilities are highly valuable."
"You can model the data there, connect the data models with the business processes and create data lineage processes."
"DataStage allows me to connect to different data sources."
"It is a scalable solution, and we have had no issues with its scalability in our company. I rate the solution's scalability a nine out of ten."
"The most valuable features are data virtualization and reporting."
"Cloud Pak's most valuable features are IBM MQ, IBM App Connect, IBM API Connect, and ISPF."
"ETL is the most valuable feature."
"The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities."
"Once you have Infosphere up and running properly, it is stable."
"Compared to other ETL tools, DataStage has excellent debugging and development capabilities. And the availability of connectors, even though we sometimes have to opt for specific ones. Also, the availability of patches is good."
"When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses."
"Finding logs is very easy on the solution."
"The most valuable feature is the ability to transfer information via notes."
"The performance optimization is quite good in DataStage. It provides parallelism and pipelining mechanisms"
"One thing that bugs me is how much infrastructure Cloud Pak requires for the initial deployment. It doesn't allow you to start small. The smallest permitted deployment is too big. It's a huge problem that prevents us from implementing the solution in many scenarios."
"Cloud Pak would be improved with integration with cloud service providers like Cloudera."
"The product is trying to be more maturity in terms of connectors. That, I believe, is an area where Cloud Pak can improve."
"The technical support could be a little better."
"The product must improve its performance."
"The solution could have more connectors."
"The tool depends on the control plane, an OpenShift container platform utilized as an orchestration layer...So, we have communicated this issue to IBM and asked if it is feasible to adapt the solution to work on a Kubernetes platform that we support."
"The solution's user experience is an area that has room for improvement."
"The solution can be a bit more user-friendly, similar to Informatica."
"Reduced cost would allow more customers to choose the product. It's quite expensive in relation to the cost of other similar solutions."
"The initial setup can be complex."
"I want the tool to continue with the on-prem version, not the cloud one."
"It would be great if they can include some basic version of data quality checking features."
"I'd like to be able to do more with the data and metadata, including copy and pasting, et cetera."
"Currently lacking virtualization ability."
"Its documentation is not up to the mark. While building APIs, we had a lot of problems trying to get around it because it is not very user-friendly. We tried to get hold of API documentation, but the documentation is not very well thought out. It should be more structured and elaborate. In terms of additional features, I would like to see good reporting on performance and performance-tuning recommendations that can be based on AI. I would also like to see better data profiling information being reported on InfoSphere."
IBM Cloud Pak for Data is ranked 16th in Data Integration with 11 reviews while IBM InfoSphere DataStage is ranked 7th in Data Integration with 37 reviews. IBM Cloud Pak for Data is rated 8.0, while IBM InfoSphere DataStage is rated 7.8. The top reviewer of IBM Cloud Pak for Data writes "A scalable data analytics and digital transformation tool that provides useful features and integrations". On the other hand, the top reviewer of IBM InfoSphere DataStage writes "User-friendly with a lot of functions for transmission rules, but has slow performance and not suitable for a huge volume of data". IBM Cloud Pak for Data is most compared with Azure Data Factory, Informatica Cloud Data Integration, Palantir Foundry, Denodo and IBM InfoSphere Information Server, whereas IBM InfoSphere DataStage is most compared with SSIS, Azure Data Factory, Talend Open Studio, Informatica PowerCenter and IBM InfoSphere Information Server. See our IBM Cloud Pak for Data vs. IBM InfoSphere DataStage report.
See our list of best Data Integration vendors.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.