We performed a comparison between Azure Data Factory and Pentaho Data Integration and Analytics based on real PeerSpot user reviews.
Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Its integrability with the rest of the activities on Azure is most valuable."
"ADF is another ETL tool similar to Informatica that can transform data or copy it from on-prem to the cloud or vice versa. Once we have the data, we can apply various transformations to it and schedule our pipeline according to our business needs. ADF integrates with Databricks. We can call our Databricks notebooks and schedule them via ADF."
"It makes it easy to collect data from different sources."
"It is very modular. It works well. We've used Data Factory and then made calls to libraries outside of Data Factory to do things that it wasn't optimized to do, and it worked really well. It is obviously proprietary in regards to Microsoft created it, but it is pretty easy and direct to bring in outside capabilities into Data Factory."
"Data Factory's best features are connectivity with different tools and focusing data ingestion using pipeline copy data."
"The most valuable feature of Azure Data Factory is that it has a good combination of flexibility, fine-tuning, automation, and good monitoring."
"The most valuable features of the solution are its ease of use and the readily available adapters for connecting with various sources."
"The flexibility that Azure Data Factory offers is great."
"It's my understanding that the product can scale."
"Pentaho Data Integration is quite simple to learn, and there is a lot of information available online."
"The area where Lumada has helped us is in the commercial area. There are many extractions to compose reports about our sales team performance and production steps. Since we are using Lumada to gather data from each industry in each country. We can get data from Argentina, Chile, Brazil, and Colombia at the same time. We can then concentrate and consolidate it in only one place, like our data warehouse. This improves our production performance and need for information about the industry, production data, and commercial data."
"Data transformation within Pentaho is a nice feature that they have and that I value."
"The fact that it enables us to leverage metadata to automate data pipeline templates and reuse them is definitely one of the features that we like the best. The metadata injection is helpful because it reduces the need to create and maintain additional ETLs. If we didn't have that feature, we would have lots of duplicated ETLs that we would have to create and maintain. The data pipeline templates have definitely been helpful when looking at productivity and costs."
"Its drag-and-drop interface lets me and my team implement all the solutions that we need in our company very quickly. It's a very good tool for that."
"It has improved our data integration capabilities."
"This solution allows us to create pipelines using a minimal amount of custom coding."
"Azure Data Factory should be cheaper to move data to a data center abroad for calamities in case of disasters."
"When working with AWS, we have noticed that the difference between ADF and AWS is that AWS is more customer-focused. They're more responsive compared to any other company. ADF is not as good as AWS, but it should be. If AWS is ten out of ten, ADF is around eight out of ten. I think AWS is easier to understand from the GUI perspective compared to ADF."
"Lacks a decent UI that would give us a view of the kinds of requests that come in."
"I rate Azure Data Factory six out of 10 for stability. ADF is stable now, but we had problems recently with indexing on an SQL database. It's slow when dealing with a huge volume of data. It depends on whether the database is configured as general purpose or hyperscale."
"There is no built-in function for automatically adding notifications concerning the progress or outline of a pipeline run."
"In the next release, it's important that some sort of scheduler for running tasks is added."
"The pricing model should be more transparent and available online."
"For some of the data, there were some issues with data mapping. Some of the error messages were a little bit foggy. There could be more of a quick start guide or some inline examples. The documentation could be better."
"Since Hitachi took over, I don't feel that the documentation is as good within the solution. It used to have very good help built right in."
"I would like to see more improvements with AS400 DB2."
"A big problem after deploying something that we do in Lumada is with Git. You get a binary file to do a code review. So, if you need to do a review, you have to take pictures of the screen to show each step. That is the biggest bug if you are using Git."
"I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it."
"Its basic functionality doesn't need a whole lot of change. There could be some improvement in the consistency of the behavior of different transformation steps. The software did start as open-source and a lot of the fundamental, everyday transformation steps that you use when building ETL jobs were developed by different people. It is not a seamless paradigm. A table input step has a different way of thinking than a data merge step."
"One thing that I don't like, just a little, is the backward compatibility."
"If you develop it on MacBook, it'll be quite a hassle."
"I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."
More Pentaho Data Integration and Analytics Pricing and Cost Advice →
Azure Data Factory is ranked 1st in Data Integration with 81 reviews while Pentaho Data Integration and Analytics is ranked 16th in Data Integration with 48 reviews. Azure Data Factory is rated 8.0, while Pentaho Data Integration and Analytics is rated 8.0. The top reviewer of Azure Data Factory writes "The data factory agent is quite good but pricing needs to be more transparent". On the other hand, the top reviewer of Pentaho Data Integration and Analytics writes "It's flexible and can do almost anything I want it to do". Azure Data Factory is most compared with Informatica PowerCenter, Informatica Cloud Data Integration, Alteryx Designer, Snowflake and Microsoft Azure Synapse Analytics, whereas Pentaho Data Integration and Analytics is most compared with SSIS, Talend Open Studio, Oracle Data Integrator (ODI), AWS Glue and SAP Data Services. See our Azure Data Factory vs. Pentaho Data Integration and Analytics report.
See our list of best Data Integration vendors.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.