We performed a comparison between IBM InfoSphere DataStage and Pentaho Data Integration and Analytics based on real PeerSpot user reviews.
Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."ETL is the most valuable feature."
"Once you have Infosphere up and running properly, it is stable."
"Offers great flexibility."
"I am impressed with the tool's ETL tracing."
"The solution is stable."
"Finding logs is very easy on the solution."
"The Hierarchical Data Stage is good."
"The most valuable feature for our data processing needs is IBM InfoSphere DataStage's capability to handle ETL tasks with large record volumes."
"Provides a good open source option."
"The area where Lumada has helped us is in the commercial area. There are many extractions to compose reports about our sales team performance and production steps. Since we are using Lumada to gather data from each industry in each country. We can get data from Argentina, Chile, Brazil, and Colombia at the same time. We can then concentrate and consolidate it in only one place, like our data warehouse. This improves our production performance and need for information about the industry, production data, and commercial data."
"Data transformation within Pentaho is a nice feature that they have and that I value."
"It has improved our data integration capabilities."
"We're using the PDI and the repository function, and they give us the ability to easily generate reporting and output, and to access data. We also like the ability to schedule."
"One of the most valuable features is the ability to create many API integrations. I'm always working with advertising agents and using Facebook and Instagram to do campaigns. We use Pentaho to get the results from these campaigns and to create dashboards to analyze the results."
"It makes it pretty simple to do some fairly complicated things. Both I and some of our other BI developers have made stabs at using, for example, SQL Server Integration Services, and we found them a little bit frustrating compared to Data Integration. So, its ease of use is right up there."
"Sometimes, it took a whole team about two weeks to get all the data to prepare and present it. After the optimization of the data, it took about one to two hours to do the whole process. Therefore, it has helped a lot when you talk about money, because it doesn't take a whole team to do it, just one person to do one project at a time and run it when you want to run it. So, it has helped a lot on that side."
"We would be happy to see in next versions the ability to return several parameters from jobs. Now, jobs can return just one parameter. If they could return several parameters, that would be great."
"The solution can be a bit more user-friendly, similar to Informatica."
"The interface needs improvement. It is really too technical. That is the main problem."
"I'd like to be able to do more with the data and metadata, including copy and pasting, et cetera."
"The error messaging needs to be improved."
"It takes a lot of time to actually trigger your job and then go into the logs and other stuff. So all of this is really time-consuming."
"Working with some of the big data components is good, but I can see improvements are needed."
"Currently lacking virtualization ability."
"As far as I remember, not all connectors worked very well. They can add more connectors and more drivers to the process to integrate with more flows."
"The web interface is rusty, and the biggest problem with Pentaho is debugging and troubleshooting. It isn't easy to build the pipeline incrementally. At least in our case, it's hard to find a way to execute step by step in the debugging mode."
"The performance could be improved. If they could have analytics perform well on large volumes, that would be a big deal for our products."
"The reporting definitely needs improvement. There are a lot of general, basic features that it doesn't have. A simple feature you would expect a reporting tool to have is the ability to search the repository for a report. It doesn't even have that capability. That's been a feature that we've been asking for since the beginning and it hasn't been implemented yet."
"Since Hitachi took over, I don't feel that the documentation is as good within the solution. It used to have very good help built right in."
"The support for the Enterprise Edition is okay, but what they have done in the last three or four years is move more and more things to that edition. The result is that they are breaking the Community Edition. That's what our impression is."
"If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was."
"I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it."
More Pentaho Data Integration and Analytics Pricing and Cost Advice →
IBM InfoSphere DataStage is ranked 7th in Data Integration with 37 reviews while Pentaho Data Integration and Analytics is ranked 15th in Data Integration with 48 reviews. IBM InfoSphere DataStage is rated 7.8, while Pentaho Data Integration and Analytics is rated 8.0. The top reviewer of IBM InfoSphere DataStage writes "User-friendly with a lot of functions for transmission rules, but has slow performance and not suitable for a huge volume of data". On the other hand, the top reviewer of Pentaho Data Integration and Analytics writes "It's flexible and can do almost anything I want it to do". IBM InfoSphere DataStage is most compared with IBM Cloud Pak for Data, SSIS, Azure Data Factory, Talend Open Studio and Informatica PowerCenter, whereas Pentaho Data Integration and Analytics is most compared with SSIS, Azure Data Factory, Talend Open Studio, Oracle Data Integrator (ODI) and AWS Database Migration Service. See our IBM InfoSphere DataStage vs. Pentaho Data Integration and Analytics report.
See our list of best Data Integration vendors.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.