We performed a comparison between Databricks and Informatica PowerCenter based on our users’ reviews in four categories. After reading all of the collected data, you can find our conclusion below.
Comparison Results: PeerSpot users consistently feel Databricks is a more complete solution, providing better integrations, features, and ease of use. The cloud-based architecture makes scaling seamless.
"Databricks helps crunch petabytes of data in a very short period of time."
"The fast data loading process and data storage capabilities are great."
"Databricks integrates well with other solutions."
"It is a cost-effective solution."
"Specifically for data science and data analytics purposes, it can handle large amounts of data in less time. I can compare it with Teradata. If a job takes five hours with Teradata databases, Databricks can complete it in around three to three and a half hours."
"Imageflow is a visual tool that helps make it easier for business people to understand complex workflows."
"The initial setup is pretty easy."
"Databricks provides a consistent interface for data engineers to work with data in a consistent language on a single integrated platform for ingesting, processing, and serving data to the end user."
"It is easy to use, and it is quick for developing things. It is fairly powerful, and it can integrate with a lot of different platforms without much hassle."
"The most valuable features are the metadata repository and the data warehouse application console."
"Good product if you are trying implement data quality, data integration, and data management projects."
"The most complex task, in this case, was to read and transform BLOB data, and Java transformation in Informatica Power Center was a great solution."
"It is very comprehensive in terms of connector and transformation capabilities from both a source and target perspective."
"Complex transformations can be easily achieved by using PowerCenter. The processing layer does transformations and other things. About 80% of my transformations can be achieved by using the middle layer. For the remaining 15% to 20% transformations, I can go in and create stored procedures in the respective databases. Mapplets is the feature through which we can reuse transformations across pipelines. Transformations and caching are the key features that we have been using frequently. Informatica PowerCenter is one of the best solutions or products in the data integration space. We have extensively used PowerCenter for integration purposes. We usually look at the best bridge solution in our architecture so that it can sustain for maybe a couple of years. Usually, we go with the solution that fits best and has proven and time-tested technology."
"The most valuable features of Informatica PowerCenter are the ease of use, and development, and is simple to find resources."
"Enterprise-scale ETL solution that's very stable and is easy to scale. It integrates and connects with multiple new systems, both structured and semi-structured."
"It's not easy to use, and they need a better UI."
"The solution has some scalability and integration limitations when consolidating legacy systems."
"When I used the support, I had communication problems because of the language barrier with the agent. The accent was difficult to understand."
"The interface of Databricks could be easier to use when compared to other solutions. It is not easy for non-data scientists. The user interface is important before we had to write code manually and as solutions move to "No code AI" it is critical that the interface is very good."
"Databricks may not be as easy to use as other tools, but if you simplify a tool too much, it won't have the flexibility to go in-depth. Databricks is completely in the programmer's hands. I prefer flexibility rather than simplicity."
"Generative AI is catching up in areas like data governance and enterprise flavor. Hence, these are places where Databricks has to be faster."
"The solution could improve by providing better automation capabilities. For example, working together with more of a DevOps approach, such as continuous integration."
"Databricks doesn't offer the use of Python scripts by itself and is not connected to GitHub repositories or anything similar. This is something that is missing. if they could integrate with Git tools it would be an advantage."
"It would be nice to have all tools in one place. CDC needs more effort, as it's only easy to develop if you are familiar with Linux."
"PowerCenter could integrate better with cloud applications. We had to do a lot of configuration work using API integrations to connect with cloud applications. Informatica Cloud Data Integration has a generic connector that you can use directly, so it's much easier."
"As a connector to big data, it is not well developed. We've had problems connecting Informatica with Hadoop. The functionality to connect Informatica with Hadoop, for me it's not good."
"The initial setup is not straightforward. You need expertise to do it."
"Support could be better."
"Its interface can be modernized. It is an old product. I have been working with it for 14 years, and it still looks the same. It hasn't been modernized much. It also needs to handle more modern formats, such as JSON files. It works with the old text files and databases, but it does not always work with the newer, modern stuff. You need to make your own programs to support that kind of stuff. Support is also a kind of difficult with Informatica. They don't do direct support and rely on using their distributors around the globe for support, which means that you kind of have to go through this layer of different companies before you get help."
"Its licensing can be improved. It should be features-wise and not bundle-wise. A bundle will definitely be costly. In addition, we might use one or two features. That's why the pricing model should be based on the features. The model should be flexible enough based on the features. Their support should also be more responsive to premium customers."
"I would like to see improvements made to the custom transformations. It should be more open for users that want to write their own code and use cases."
Databricks is ranked 1st in Data Science Platforms with 78 reviews while Informatica PowerCenter is ranked 3rd in Data Integration with 78 reviews. Databricks is rated 8.2, while Informatica PowerCenter is rated 8.0. The top reviewer of Databricks writes "A nice interface with good features for turning off clusters to save on computing". On the other hand, the top reviewer of Informatica PowerCenter writes "Stable, provides good support, and integrating it with other systems is very fast, but its pricing is expensive". Databricks is most compared with Amazon SageMaker, Dataiku, Microsoft Azure Machine Learning Studio, Dremio and Azure Stream Analytics, whereas Informatica PowerCenter is most compared with Informatica Cloud Data Integration, Azure Data Factory, SSIS, AWS Glue and Oracle Data Integrator (ODI). See our Databricks vs. Informatica PowerCenter report.
We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.