Compare IBM InfoSphere DataStage vs. Pentaho Data Integration

Cancel
You must select at least 2 products to compare!
Most Helpful Review
Find out what your peers are saying about IBM InfoSphere DataStage vs. Pentaho Data Integration and other solutions. Updated: July 2021.
523,975 professionals have used our research since 2012.
Quotes From Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros
"DataStage works better with Linux operating systems when the application services are hosted on Linux system equipment, but it's powerful on Windows too.""The data lineage report can be filtered for reporting. The reports are user-friendly and take less time to find what you need.""The ETL tools are probably the most valuable feature. It has an IBM tool, a friendly UI and it makes things more comfortable.""The most valuable feature is the product's versatility to inject data.""Finding logs is very easy on the solution.""ETL is the most valuable feature.""Once you have Infosphere up and running properly, it is stable.""The most valuable feature is the data integration for data warehousing."

More IBM InfoSphere DataStage Pros »

"The solution has a free to use community version.""The amount of data that it loads and processes is good.""Pentaho Data Integration is quite simple to learn, and there is a lot of information available online.""It's my understanding that the product can scale."

More Pentaho Data Integration Pros »

Cons
"I really like this tool, but the administration should be on the same client application because a lot of administration features are not on the client-side, and they usually need to have administrative access. It's quite complicated to force IT teams to have separate administrative access from the developers.""We would be happy to see in next versions the ability to return several parameters from jobs. Now, jobs can return just one parameter. If they could return several parameters, that would be great.""Reduced cost would allow more customers to choose the product. It's quite expensive in relation to the cost of other similar solutions.""The interface needs improvement. It is really too technical. That is the main problem.""The template mapping could be easier.""There are three things that could improve - the cloud, monitoring and cloud integration. It's a solid product but not a modern one and of course it depends what you're looking for.""The setup is extremely difficult.""The pricing should be lower."

More IBM InfoSphere DataStage Cons »

"It's not very stable, at least not in the case of the community edition. I'm working with the community edition right now and I think perhaps it is because of that it is not very stable, it causes the system to sometimes hang. I'm not sure if this is the case for pair tiers.""I would like to see improvements made for real-time data processing.""I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking.""The product needs more plugins."

More Pentaho Data Integration Cons »

Pricing and Cost Advice
"The cost is too high.""It's very expensive.""Our internal team takes care of group licensing and cost. We don't have individual licenses. We have group licensing at the company level. Usually, IBM doesn't charge anything separately on the licensing side. For storage and everything else, we are paying around $6,000 per month, which is not very high. It includes Linux data storage, execution, and licensing. They're charging $40 for one-hour execution. Based on that, we are spending around $2,000 on the production environment and $1,000 on the lower environment for testing and development-side executions. For the mainframe, we are using the Db2 mainframe database, and we are spending around $1,000 on the Db2 mainframe database as well. All this comes out to be around $6,000. We, however, would like to have some cost reduction.""The price is expensive but there are no licensing fees.""It is quite expensive."

More IBM InfoSphere DataStage Pricing and Cost Advice »

"The price of the regular version is not reasonable and it should be lower.""Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."

More Pentaho Data Integration Pricing and Cost Advice »

report
Use our free recommendation engine to learn which Data Integration Tools solutions are best for your needs.
523,975 professionals have used our research since 2012.
Questions from the Community
Top Answer: The price is expensive but there are no licensing fees.
Top Answer: The initial setup could be more straightforward.
Top Answer: Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition :… more »
Top Answer: Depends upon the technologies being used. If you're using Oracle for both OLTP and OLAP then you'll get a lot of value from an Oracle solution. The other question is how up to date do you want your… more »
Top Answer: It's my understanding that the product can scale.
Ranking
5th
Views
15,856
Comparisons
12,897
Reviews
14
Average Words per Review
467
Rating
7.6
16th
Views
8,693
Comparisons
7,154
Reviews
4
Average Words per Review
624
Rating
7.8
Popular Comparisons
Also Known As
Kettle
Learn More
Overview
IBM InfoSphere DataStage integrates data across multiple systems using a high performance parallel framework, and it supports extended metadata management and enterprise connectivity. The scalable platform provides more flexible integration of all types of data, including big data at rest (Hadoop-based) or in motion (stream-based), on distributed and mainframe platforms.

Pentaho data integration prepares and blends data to create a complete picture of your business that drives actionable insights. The complete data integration platform delivers accurate, "analytics ready" data to end users from any source. With visual tools to eliminate coding and complexity, Pentaho puts big data and all data sources at the fingertips of business and IT users alike.

Offer
Learn more about IBM InfoSphere DataStage
Learn more about Pentaho Data Integration
Sample Customers
Dubai Statistics Center, Etisalat Egypt
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Top Industries
REVIEWERS
Computer Software Company63%
Aerospace/Defense Firm13%
Healthcare Company13%
Financial Services Firm13%
VISITORS READING REVIEWS
Computer Software Company29%
Comms Service Provider14%
Financial Services Firm11%
Insurance Company6%
REVIEWERS
Government15%
Comms Service Provider15%
Healthcare Company15%
Financial Services Firm15%
VISITORS READING REVIEWS
Computer Software Company28%
Comms Service Provider19%
Financial Services Firm8%
Government7%
Company Size
REVIEWERS
Small Business40%
Midsize Enterprise4%
Large Enterprise56%
REVIEWERS
Small Business28%
Midsize Enterprise24%
Large Enterprise48%
Find out what your peers are saying about IBM InfoSphere DataStage vs. Pentaho Data Integration and other solutions. Updated: July 2021.
523,975 professionals have used our research since 2012.

IBM InfoSphere DataStage is ranked 5th in Data Integration Tools with 13 reviews while Pentaho Data Integration is ranked 16th in Data Integration Tools with 4 reviews. IBM InfoSphere DataStage is rated 7.6, while Pentaho Data Integration is rated 7.8. The top reviewer of IBM InfoSphere DataStage writes "A powerful tool with parallel data streams". On the other hand, the top reviewer of Pentaho Data Integration writes "Free to use, easy to set up, and has a great metadata injection feature". IBM InfoSphere DataStage is most compared with SSIS, Talend Open Studio, Azure Data Factory, IBM InfoSphere Information Server and Oracle GoldenGate, whereas Pentaho Data Integration is most compared with Talend Open Studio, SSIS, Informatica PowerCenter, Oracle Data Integrator (ODI) and Azure Data Factory. See our IBM InfoSphere DataStage vs. Pentaho Data Integration report.

See our list of best Data Integration Tools vendors and best Cloud Data Integration vendors.

We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.