Compare IBM InfoSphere DataStage vs. Pentaho Data Integration

IBM InfoSphere DataStage is ranked 6th in Data Integration Tools with 9 reviews while Pentaho Data Integration is ranked 12th in Data Integration Tools with 1 review. IBM InfoSphere DataStage is rated 8.0, while Pentaho Data Integration is rated 8.0. The top reviewer of IBM InfoSphere DataStage writes "Powerful, reliable and the ability to run it in parallel mode makes it very fast". On the other hand, the top reviewer of Pentaho Data Integration writes "Free to use, easy to set up, and has a great metadata injection feature". IBM InfoSphere DataStage is most compared with SSIS, Informatica PowerCenter, Talend Open Studio, Azure Data Factory and SAP Data Services, whereas Pentaho Data Integration is most compared with SSIS, Informatica PowerCenter, Talend Open Studio, Azure Data Factory and Oracle Data Integrator (ODI).
You must select at least 2 products to compare!
Most Helpful Review
Use Pentaho Data Integration? Share your opinion.
Find out what your peers are saying about Microsoft, Informatica, Talend and others in Data Integration Tools. Updated: July 2020.
430,988 professionals have used our research since 2012.
Quotes From Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

ETL is the most valuable feature.Finding logs is very easy on the solution.The most valuable feature is the product's versatility to inject data.The ETL tools are probably the most valuable feature. It has an IBM tool, a friendly UI and it makes things more comfortable.The data lineage report can be filtered for reporting. The reports are user-friendly and take less time to find what you need.DataStage works better with Linux operating systems when the application services are hosted on Linux system equipment, but it's powerful on Windows too.The most valuable feature is the ability to transfer information via notes.The product is a stable and powerful data management solution that can run in parallel mode for enhanced speed.

More IBM InfoSphere DataStage Pros »

The solution has a free to use community version.

More Pentaho Data Integration Pros »

There are three things that could improve - the cloud, monitoring and cloud integration. It's a solid product but not a modern one and of course it depends what you're looking for.The template mapping could be easier.The interface needs improvement. It is really too technical. That is the main problem.Reduced cost would allow more customers to choose the product. It's quite expensive in relation to the cost of other similar solutions.We would be happy to see in next versions the ability to return several parameters from jobs. Now, jobs can return just one parameter. If they could return several parameters, that would be great.I really like this tool, but the administration should be on the same client application because a lot of administration features are not on the client-side, and they usually need to have administrative access. It's quite complicated to force IT teams to have separate administrative access from the developers.The documentation and in-application help for this solution need to be improved, especially for new features.The interface needs work to be more user-friendly.

More IBM InfoSphere DataStage Cons »

It's not very stable, at least not in the case of the community edition. I'm working with the community edition right now and I think perhaps it is because of that it is not very stable, it causes the system to sometimes hang. I'm not sure if this is the case for pair tiers.

More Pentaho Data Integration Cons »

Pricing and Cost Advice
Small and medium-sized companies cannot afford to pay for this solution.Pricing varies based on use, and it is not as costly as some competing enterprise solutions.

More IBM InfoSphere DataStage Pricing and Cost Advice »

Information Not Available
Use our free recommendation engine to learn which Data Integration Tools solutions are best for your needs.
430,988 professionals have used our research since 2012.
Average Words per Review
Avg. Rating
Average Words per Review
Avg. Rating
Popular Comparisons
Compared 16% of the time.
Compared 21% of the time.
Also Known As
IBM InfoSphere DataStage integrates data across multiple systems using a high performance parallel framework, and it supports extended metadata management and enterprise connectivity. The scalable platform provides more flexible integration of all types of data, including big data at rest (Hadoop-based) or in motion (stream-based), on distributed and mainframe platforms.

Pentaho data integration prepares and blends data to create a complete picture of your business that drives actionable insights. The complete data integration platform delivers accurate, "analytics ready" data to end users from any source. With visual tools to eliminate coding and complexity, Pentaho puts big data and all data sources at the fingertips of business and IT users alike.

Learn more about IBM InfoSphere DataStage
Learn more about Pentaho Data Integration
Sample Customers
Dubai Statistics Center, Etisalat Egypt66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Top Industries
Computer Software Company44%
Comms Service Provider10%
Insurance Company8%
Comms Service Provider18%
Financial Services Firm18%
Healthcare Company18%
Computer Software Company37%
Media Company10%
Comms Service Provider10%
Find out what your peers are saying about Microsoft, Informatica, Talend and others in Data Integration Tools. Updated: July 2020.
430,988 professionals have used our research since 2012.

See our list of best Data Integration Tools vendors and best Cloud Data Integration vendors.

We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.