Anonymous UserSenior Consulting Analyst at a construction company
Viplove KhushalaniInnovation Lead at a tech services company
We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
"The ability to collaborate and approve business terms for the business glossary data dictionary is excellent."
"From the perspective of a data user, the solution has a great search capability."
"It's not a complicated product."
"The diagram showing the limits is absolutely magnificent."
"I find the metadata feature valuable because it replaces the coding."
"The catalog feature is definitely valuable because it makes ingesting the metadata of any application quite simple. You don't have to do things manually. You can just schedule an import, and it will just refresh the metadata of whatever application you want. That's what I like. I usually work on the technical side, and other than that, I usually create integrations. I integrate Collibra with different environments or applications. I'm a developer, so I cannot vouch for the business, but for me, it is quite developer-friendly with the Java API interface and the REST API interface that they have provided. It is good for creating dashboards based on the needs of each and every role. It can be user-specific or group-specific. We usually create dashboards and give them to our business users, and they are quite happy with that."
"In terms of data governance, as I mentioned, it can be a one-stop solution for all of your data governance needs."
"The customization and the dashboards are pretty good."
"Its connection to on-premise products is the most valuable. We mostly use the on-premise connection, which is seamless. This is what we prefer in this solution over other solutions. We are using it the most for the orchestration where the data is coming from different categories. Its other features are very much similar to what they are giving us in open source. Their push-down approach is the most advantageous, where they push most of the processing on to the same data source. This means that they have a serverless kind of thing, and they don't process the data inside a product such as Data Hub. They process the data from where the data is coming out. If it is coming from HANA, to capture the data or process it for analytics, orchestration, or management, they go to the HANA database and give it out. They don't process it on Data Hub. This push-down approach increases the processing speed a little bit because the data is processed where it is sitting. That's the best part and an advantage. I have used another product where they used to capture the data first and then they used to process it and give it. In Data Hub, it is in reverse. They process it first and give it, and then they put their own manipulations. They lead in terms of business functions. No other solution has business functions already implemented to perform business analysis. They have a lot of prebuilt business functions for machine learning and orchestration, which we can use directly to get an analysis out from the existing data. Most of the data is sitting as enterprise data there. That's a major advantage that they have."
"The connectors for metadata ingestion need to be improved."
"No easy way to connect to different data sources."
"While connecting with the data source, it's not very easy. If there's a firewall, it is difficult to connect with the database. It's not easy when you are configuring on the database."
"The workflows and the language they use needs to be improved."
"If someone changes the metadata, we can't see who changed it."
"It should have more integrations with things like CyberArk because its main purpose is GDPR implementation. We have to have more scope for things that implement more privacy. CyberArk makes sure your credentials are vaulted and your things are secure when you're creating your integrations or connecting to an application. I do believe that they are working on this feature."
"Recently, I find that the default process of issue management in Collibra is really complex — It wasn't really helpful to us."
"It would be better if there was a way to import all data and metadata in an automatic way in one block form."
"In 2018, connecting it to outside sources, such as IoT products or IoT-enabled big data Hadoop, was a little complex. It was not smooth at the beginning. It was unstable. It took a lot of time for the initial data load. Sometimes, the connection broke, and we had to restart the process, which was a major issue, but they might have improved it now. It is very smooth with SAP HANA on-premise system, SAP Cloud Platform, and SAP Analytics Cloud. It could be because these are their own products, and they know how to integrate them. With Hadoop, they might have used open-source technologies, and that's why it was breaking at that time. They are providing less embedded integration because they want us to use their other products. For example, they don't want to go and remove SAP Analytics Cloud and put everything in Data Hub. They want us to use SAP Analytics Cloud somewhere else and not inside the Data Hub. On the integration part, it lacks real-time analytics, and it is slow. They should embed the SAP Analytics Cloud inside Data Hub or support some kind of analysis. They do provide some analysis, but it is not extensive. They are moreover open source. So, we need a lot of developers or data scientists to go in and implement Python algorithms. It would be better if they can provide their own existing algorithms and give some connections and drop-down menus to go and just configure those. It will make things really quick by increasing the embedded integrations. It will also improve the process efficiency and processing power. Its performance needs improvement. It is a little slow. It is not the best in the market, and there are other products that are much better than this. In terms of technology and performance, it is a little slow as compared to Microsoft and other data orchestration products. I haven't used other products, but I have read about those products, their settings, and the milliseconds that they do. In Azure Purview, they say that they can copy, manage, or transform the data within milliseconds. They say that they can transform 100 gigabytes of data within three to five seconds, which is something SAP cannot do. It generally takes a lot of time to process that much amount of data. However, I have never tested out Azure."
"I am not so much aware of price details. Initially, there was an add-on NuSoft license to use the DVC connector that NuSoft gives to create integrations, but Collibra is now phasing out of it slowly. Collibra is cutting ties with them is what we have been led to believe, and we have started developing on Spring Boot, which is open source."
"I think it is on a yearly basis, but I'm not involved with the pricing session, so I have no idea. There are several licenses for different models. You get one license per product, but a lot of features are controlled separately through different licenses. So, as and when you want to use a feature, you have to procure the license for that feature."
"I think they have a trust issue. I did not like the way they recently went through the process. They were like, "Finish this SOW first, only then will we sign the other SOW." Or, "Finish this code." I didn't like that much."
"I would say it's probably in line with what other vendors charge for licensing."
"It is substantial, and we do pay yearly."
Collibra Data Governance Center is an enterprise-wide data governance solution that puts people and processes first, automating data governance and management to quickly and securely deliver trusted data to the business users who need it.
The SAP® Data Hub solution enables sophisticated data operations management. It gives you the capability and flexibility to connect enterprise data and Big Data and gain a deep understanding of data and information processes across sources and systems throughout the distributed landscape. The unified solution provides visibility and control into data opportunities, integrating cloud and on-premise information and driving data agility and business value. Distributed processing power enables greater speed and efficiency.
Collibra Governance is ranked 1st in Data Governance with 18 reviews while SAP Data Hub is ranked 7th in Data Governance with 1 review. Collibra Governance is rated 7.8, while SAP Data Hub is rated 6.0. The top reviewer of Collibra Governance writes "Offers many features for the best relative price". On the other hand, the top reviewer of SAP Data Hub writes "Good push-down approach, on-premise connection, and integration with SAP products, but needs better performance and integration with other solutions". Collibra Governance is most compared with Informatica Axon, Alation Data Catalog, Microsoft Azure Purview, Ataccama ONE Platform and Zaloni Data Platform, whereas SAP Data Hub is most compared with SAP Data Services, Palantir Foundry, Azure Data Factory, SAP Process Orchestration and Denodo.
See our list of best Data Governance vendors.
We monitor all Data Governance reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.