Collibra Catalog vs Pentaho Data Integration and Analytics comparison

Cancel
You must select at least 2 products to compare!
Collibra Logo
1,049 views|874 comparisons
100% willing to recommend
Hitachi Vantara Logo
3,247 views|1,075 comparisons
94% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Collibra Catalog and Pentaho Data Integration and Analytics based on real PeerSpot user reviews.

Find out what your peers are saying about Informatica, Alation, Collibra and others in Metadata Management.
To learn more, read our detailed Metadata Management Report (Updated: May 2024).
771,212 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Collibra Catalog is simple to use and user-friendly for those who are not technically inclined since it is easy to find while also easy to see data lineage diagrams.""Collibra Catalog has significantly enhanced data governance and compliance for our team, primarily through its valuable feature of endpoint lineage enabling visual representation of the data.""Collibra Catalog's best feature is the data quality checker.""The data lineage capability is valuable as it shows how different sources are connected and how data flows, which is crucial for projects like migrations. Moreover, data lineage visualization in Collibra Catalog aids our data governance initiatives.""We have had no complaints about the stability."

More Collibra Catalog Pros →

"The abstraction is quite good.""We use Lumada’s ability to develop and deploy data pipeline templates once and reuse them. This is very important. When the entire pipeline is automated, we do not have any issues in respect to deployment of code or with code working in one environment but not working in another environment. We have saved a lot of time and effort from that perspective because it is easy to build ETL pipelines.""Its drag-and-drop interface lets me and my team implement all the solutions that we need in our company very quickly. It's a very good tool for that.""The fact that it enables us to leverage metadata to automate data pipeline templates and reuse them is definitely one of the features that we like the best. The metadata injection is helpful because it reduces the need to create and maintain additional ETLs. If we didn't have that feature, we would have lots of duplicated ETLs that we would have to create and maintain. The data pipeline templates have definitely been helpful when looking at productivity and costs.""Provides a good open source option.""The graphical nature of the development interface is most useful because we've got people with quite mixed skills in the team. We've got some very junior, apprentice-level people, and we've got support analysts who don't have an IT background. It allows us to have quite complicated data flows and embed logic in them. Rather than having to troll through lines and lines of code and try and work out what it's doing, you get a visual representation, which makes it quite easy for people with mixed skills to support and maintain the product. That's one side of it.""I can create faster instructions than writing with SQL or code. Also, I am able to do some background control of the data process with this tool. Therefore, I use it as an ELT tool. I have a station area where I can work with all the information that I have in my production databases, then I can work with the data that I created.""Pentaho Data Integration is quite simple to learn, and there is a lot of information available online."

More Pentaho Data Integration and Analytics Pros →

Cons
"The tool's overall functionalities need to improve since, nowadays, many tools, from a business perspective, are easy to use.""Collibra Catalog could improve its automation to increase the efficiency of the software.""I'd like to see more integration with other reporting sources.""A key area for improvement in Collibra Catalog lies in its integration capabilities, particularly with a broader range of sources."

More Collibra Catalog Cons →

"I would like to see improvements made for real-time data processing.""The reporting definitely needs improvement. There are a lot of general, basic features that it doesn't have. A simple feature you would expect a reporting tool to have is the ability to search the repository for a report. It doesn't even have that capability. That's been a feature that we've been asking for since the beginning and it hasn't been implemented yet.""Lumada could have more native connectors with other vendors, such as Google BigQuery, Microsoft OneDrive, Jira systems, and Facebook or Instagram. We would like to gather data from modern platforms using Lumada, which is a better approach. As a comparison, if you open Power BI to retrieve data, then you can get data from many vendors with cloud-native connectors, such as Azure, AWS, Google BigQuery, and Athena Redshift. Lumada should have more native connectors to help us and facilitate our job in gathering information from these new modern infrastructures and tools.""In the Community edition, it would be nice to have more modules that allow you to code directly within the application. It could have R or Python completely integrated into it, but this could also be because I'm using an older version.""I was not happy with the Pentaho Report Designer because of the way it was set up. There was a zone and, under it, another zone, and under that another one, and under that another one. There were a lot of levels and places inside the report, and it was a little bit complicated. You have to search all these different places using a mouse, clicking everywhere... each report is coded in a binary file... You cannot search with a text search tool...""I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking.""The testing and quality could really improve. Every time that there is a major release, we are very nervous about what is going to get broken. We have had a lot of experience with that, as even the latest one was broken. Some basic things get broken. That doesn't look good for Hitachi at all. If there is one place I would advise them to spend some money and do some effort, it is with the quality. It is not that hard to start putting in some unit tests so basic things don't get broken when they do a new release. That just looks horrible, especially for an organization like Hitachi.""Its basic functionality doesn't need a whole lot of change. There could be some improvement in the consistency of the behavior of different transformation steps. The software did start as open-source and a lot of the fundamental, everyday transformation steps that you use when building ETL jobs were developed by different people. It is not a seamless paradigm. A table input step has a different way of thinking than a data merge step."

More Pentaho Data Integration and Analytics Cons →

Pricing and Cost Advice
  • "I think they can bring a few more features and align better with other quality products."
  • "Collibra Catalog is fairly priced - I would rate their pricing seven out of ten."
  • "The product is highly priced compared to other vendors."
  • "Collibra offers a per-user licensing model."
  • More Collibra Catalog Pricing and Cost Advice →

  • "There is a good open source option (Community Edition)​."
  • "The price of the regular version is not reasonable and it should be lower."
  • "Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
  • "It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
  • "I think Lumada's price is fair compared to some of the others, like BusinessObjects, which is was the other thing that I used at my previous job. BusinessObject's price was more reasonable before SAP acquired it. They jacked the price up significantly. Oracle's OBIEE tool was also prohibitively expensive."
  • "When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
  • "The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
  • "The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
  • More Pentaho Data Integration and Analytics Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Metadata Management solutions are best for your needs.
    771,212 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:The data lineage capability is valuable as it shows how different sources are connected and how data flows, which is crucial for projects like migrations Moreover, data lineage visualization in… more »
    Top Answer:I'd like to see more integration with other reporting sources like Qlik Sense, beyond the currently supported ones like Tableau and Power BI.
    Top Answer:Hi Rajneesh yes here is the feature comparison between the community and enterprise edition :… more »
    Top Answer: In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, it… more »
    Top Answer:My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could use… more »
    Ranking
    3rd
    out of 27 in Metadata Management
    Views
    1,049
    Comparisons
    874
    Reviews
    5
    Average Words per Review
    405
    Rating
    7.8
    15th
    out of 101 in Data Integration
    Views
    3,247
    Comparisons
    1,075
    Reviews
    10
    Average Words per Review
    1,105
    Rating
    7.5
    Comparisons
    Also Known As
    Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
    Learn More
    Overview

    Collibra Catalog is both a report and a data catalog that helps analysts and information users spend less time looking for essential clinical, financial, and operational data and more time solving critical business challenges.

    Pentaho Data Integration stands as a versatile platform designed to cater to the data integration and analytics needs of organizations, regardless of their size. This powerful solution is the go-to choice for businesses seeking to seamlessly integrate data from diverse sources, including databases, files, and applications. Pentaho Data Integration facilitates the essential tasks of cleaning and transforming data, ensuring it's primed for meaningful analysis. With a wide array of tools for data mining, machine learning, and statistical analysis, Pentaho Data Integration empowers organizations to glean valuable insights from their data. What sets Pentaho Data Integration apart is its maturity and a vibrant community of users and developers, making it a reliable and cost-effective option. Pentaho Data Integration offers a range of features, including a comprehensive ETL toolkit, data cleaning and transformation capabilities, robust data analysis tools, and seamless deployment options for data integration and analytics solutions, making it a go-to solution for organizations seeking to harness the power of their data.

    Sample Customers
    AXA XL, DNB, Adobe, PMI, Holland America Line, UC Davis Health, Cox Automotive
    66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
    Top Industries
    VISITORS READING REVIEWS
    Financial Services Firm26%
    Computer Software Company14%
    Energy/Utilities Company6%
    Manufacturing Company6%
    REVIEWERS
    Healthcare Company19%
    Financial Services Firm19%
    Comms Service Provider11%
    Manufacturing Company11%
    VISITORS READING REVIEWS
    Financial Services Firm19%
    Computer Software Company14%
    Comms Service Provider12%
    Government7%
    Company Size
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise9%
    Large Enterprise75%
    REVIEWERS
    Small Business27%
    Midsize Enterprise31%
    Large Enterprise42%
    VISITORS READING REVIEWS
    Small Business21%
    Midsize Enterprise11%
    Large Enterprise68%
    Buyer's Guide
    Metadata Management
    May 2024
    Find out what your peers are saying about Informatica, Alation, Collibra and others in Metadata Management. Updated: May 2024.
    771,212 professionals have used our research since 2012.

    Collibra Catalog is ranked 3rd in Metadata Management with 5 reviews while Pentaho Data Integration and Analytics is ranked 15th in Data Integration with 48 reviews. Collibra Catalog is rated 7.8, while Pentaho Data Integration and Analytics is rated 8.0. The top reviewer of Collibra Catalog writes "A user-friendly for those who are not technically inclined and useful for cataloging various reports". On the other hand, the top reviewer of Pentaho Data Integration and Analytics writes "It's flexible and can do almost anything I want it to do". Collibra Catalog is most compared with Informatica Enterprise Data Catalog, Ab Initio Co>Operating System, Talend Data Management Platform, Palantir Foundry and Denodo, whereas Pentaho Data Integration and Analytics is most compared with SSIS, Azure Data Factory, Talend Open Studio, Oracle Data Integrator (ODI) and AWS Glue.

    We monitor all Metadata Management reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.