Databricks vs Informatica PowerCenter comparison

Cancel
You must select at least 2 products to compare!
Databricks Logo
28,492 views|18,008 comparisons
96% willing to recommend
Informatica Logo
19,607 views|16,419 comparisons
90% willing to recommend
Comparison Buyer's Guide
Executive Summary
Updated on Sep 5, 2022

We performed a comparison between Databricks and Informatica PowerCenter based on our users’ reviews in four categories. After reading all of the collected data, you can find our conclusion below.

  • Ease of Deployment: Users feel the overall setup of both solutions is straightforward and simple. Some users with more complex architectures feel that Informatica PowerCenter can be challenging to set up and may require an experienced technician.
  • Features: Databricks gives users the option of working with several different programming languages. This solution is cloud-based so start-up time is easy and super fast. Users find that error messages can sometimes be vague and ambiguous. This makes it challenging to debug, slows the overall process down, and negatively affects productivity.

    Informatica Powercenter users tell us the solution is very good for integrating a huge amount of data in a very short period of time. The enterprise-scale ETL solution is robust, very stable, and easy to scale. Users would like to see the solution modernized, as it is a bit dated. It also should handle more modern formats, such as JSON, to make it more competitive in the marketplace.
  • Pricing: Users feel both solutions are expensive.
  • Service and Support: Users are satisfied and feel the support they get for both products is timely, knowledgeable, and helpful.

Comparison Results: PeerSpot users consistently feel Databricks is a more complete solution, providing better integrations, features, and ease of use. The cloud-based architecture makes scaling seamless.

To learn more, read our detailed Databricks vs. Informatica PowerCenter Report (Updated: February 2023).
771,212 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Databricks helps crunch petabytes of data in a very short period of time.""The fast data loading process and data storage capabilities are great.""Databricks integrates well with other solutions.""It is a cost-effective solution.""Specifically for data science and data analytics purposes, it can handle large amounts of data in less time. I can compare it with Teradata. If a job takes five hours with Teradata databases, Databricks can complete it in around three to three and a half hours.""Imageflow is a visual tool that helps make it easier for business people to understand complex workflows.""The initial setup is pretty easy.""Databricks provides a consistent interface for data engineers to work with data in a consistent language on a single integrated platform for ingesting, processing, and serving data to the end user."

More Databricks Pros →

"It is easy to use, and it is quick for developing things. It is fairly powerful, and it can integrate with a lot of different platforms without much hassle.""The most valuable features are the metadata repository and the data warehouse application console.""Good product if you are trying implement data quality, data integration, and data management projects.""The most complex task, in this case, was to read and transform BLOB data, and Java transformation in Informatica Power Center was a great solution.""It is very comprehensive in terms of connector and transformation capabilities from both a source and target perspective.""Complex transformations can be easily achieved by using PowerCenter. The processing layer does transformations and other things. About 80% of my transformations can be achieved by using the middle layer. For the remaining 15% to 20% transformations, I can go in and create stored procedures in the respective databases. Mapplets is the feature through which we can reuse transformations across pipelines. Transformations and caching are the key features that we have been using frequently. Informatica PowerCenter is one of the best solutions or products in the data integration space. We have extensively used PowerCenter for integration purposes. We usually look at the best bridge solution in our architecture so that it can sustain for maybe a couple of years. Usually, we go with the solution that fits best and has proven and time-tested technology.""The most valuable features of Informatica PowerCenter are the ease of use, and development, and is simple to find resources.""Enterprise-scale ETL solution that's very stable and is easy to scale. It integrates and connects with multiple new systems, both structured and semi-structured."

More Informatica PowerCenter Pros →

Cons
"It's not easy to use, and they need a better UI.""The solution has some scalability and integration limitations when consolidating legacy systems.""When I used the support, I had communication problems because of the language barrier with the agent. The accent was difficult to understand.""The interface of Databricks could be easier to use when compared to other solutions. It is not easy for non-data scientists. The user interface is important before we had to write code manually and as solutions move to "No code AI" it is critical that the interface is very good.""Databricks may not be as easy to use as other tools, but if you simplify a tool too much, it won't have the flexibility to go in-depth. Databricks is completely in the programmer's hands. I prefer flexibility rather than simplicity.""Generative AI is catching up in areas like data governance and enterprise flavor. Hence, these are places where Databricks has to be faster.""The solution could improve by providing better automation capabilities. For example, working together with more of a DevOps approach, such as continuous integration.""Databricks doesn't offer the use of Python scripts by itself and is not connected to GitHub repositories or anything similar. This is something that is missing. if they could integrate with Git tools it would be an advantage."

More Databricks Cons →

"It would be nice to have all tools in one place. CDC needs more effort, as it's only easy to develop if you are familiar with Linux.""PowerCenter could integrate better with cloud applications. We had to do a lot of configuration work using API integrations to connect with cloud applications. Informatica Cloud Data Integration has a generic connector that you can use directly, so it's much easier.""As a connector to big data, it is not well developed. We've had problems connecting Informatica with Hadoop. The functionality to connect Informatica with Hadoop, for me it's not good.""The initial setup is not straightforward. You need expertise to do it.""Support could be better.""Its interface can be modernized. It is an old product. I have been working with it for 14 years, and it still looks the same. It hasn't been modernized much. It also needs to handle more modern formats, such as JSON files. It works with the old text files and databases, but it does not always work with the newer, modern stuff. You need to make your own programs to support that kind of stuff. Support is also a kind of difficult with Informatica. They don't do direct support and rely on using their distributors around the globe for support, which means that you kind of have to go through this layer of different companies before you get help.""Its licensing can be improved. It should be features-wise and not bundle-wise. A bundle will definitely be costly. In addition, we might use one or two features. That's why the pricing model should be based on the features. The model should be flexible enough based on the features. Their support should also be more responsive to premium customers.""I would like to see improvements made to the custom transformations. It should be more open for users that want to write their own code and use cases."

More Informatica PowerCenter Cons →

Pricing and Cost Advice
  • "Whenever we want to find the actual costing, we have to send an email to Databricks, so having the information available on the internet would be helpful."
  • "I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly."
  • "Licensing on site I would counsel against, as on-site hardware issues tend to really delay and slow down delivery."
  • "We find Databricks to be very expensive, although this improved when we found out how to shut it down at night."
  • "The pricing depends on the usage itself."
  • "I am based in South Africa, where it is expensive adapting to the cloud, and then there is the price for the tool itself."
  • "The price is okay. It's competitive."
  • "Databricks uses a price-per-use model, where you can use as much compute as you need."
  • More Databricks Pricing and Cost Advice →

  • "We have found the pricing very cost-effective. The licensing is CPU and data source-based."
  • "Cost could be improved."
  • "Licensing is a one time cost. But maintenance costs depend on what you want, how long you need it. Maintenance is a kind of insurance. With health insurance, you don't know whether you will get sick or need to go to hospital or not but you have to have insurance. It's the same thing with support. If you have that expertise in resolving issues, if you have enough experience in your IT department, I would say you don't need the support. But in practice, they recommend you go with the support. If you want support you have to pay for it."
  • "Price-wise, it's more expensive than SSIS, but it's a better tool, so it has more features. Licensing is on a yearly basis."
  • "Its maintenance is expensive."
  • "It's much more expensive, almost three times more expensive than most other solutions."
  • "We are satisfied with the pricing."
  • "I consider this to be an expensive product."
  • More Informatica PowerCenter Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
    771,212 professionals have used our research since 2012.
    Comparison Review
    Anonymous User
    Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations There are many third-party vendors offering ETL solutions, but two of the most popular are PowerCenter Informatica and Microsoft SSIS (SQL Server Integration Services). Each technology has its advantages but there are also similarities on how they carry out the extract-transform-load processes and only differ in terminologies. If you’re in the process of choosing ETL tools and PowerCenter Informatica and Microsoft SSIS made it to your shortlist, here is a short comparative discussion detailing the differences between the two, as well as their benefits. Package Configuration Most enterprise data integration projects would require the capacity to develop a solution in one platform and test and deploy it in a separate environment without having to manually change the established workflow. In order to achieve this seamless movement between two environments, your ETL technology should allow the dynamic update of the project’s properties using the content or a parameter file or configuration. Both Informatica and SSIS support this functionality using different methodologies. In Informatica, every session can have more than one source and one or more destination connections. There are… Read more →
    Questions from the Community
    Top Answer:Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with… more »
    Top Answer:We researched AWS SageMaker, but in the end, we chose Databricks Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It… more »
    Top Answer:Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their… more »
    Top Answer:Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up and… more »
    Top Answer:SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called SSIS. The collection helps organizations boost productivity with code-free… more »
    Top Answer:Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge… more »
    Ranking
    1st
    Views
    28,492
    Comparisons
    18,008
    Reviews
    47
    Average Words per Review
    441
    Rating
    8.3
    3rd
    out of 101 in Data Integration
    Views
    19,607
    Comparisons
    16,419
    Reviews
    28
    Average Words per Review
    478
    Rating
    7.5
    Comparisons
    Also Known As
    Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
    PowerCenter
    Learn More
    Overview

    Databricks is an industry-leading data analytics platform which is a one-stop product for all data requirements. Databricks is made by the creators of Apache Spark, Delta Lake, ML Flow, and Koalas. It builds on these technologies to deliver a true lakehouse data architecture, making it a robust platform that is reliable, scalable, and fast. Databricks speeds up innovations by synthesizing storage, engineering, business operations, security, and data science.

    Databricks is integrated with Microsoft Azure, Amazon Web Services, and Google Cloud Platform. This enables users to easily manage a colossal amount of data and to continuously train and deploy machine learning models for AI applications. The platform handles all analytic deployments, ranging from ETL to models training and deployment.

    Databricks deciphers the complexities of processing data to empower data scientists, engineers, and analysts with a simple collaborative environment to run interactive and scheduled data analysis workloads. The program takes advantage of AI’s cost-effectivity, flexibility, and cloud storage.

    Databricks Key Features

    Some of Databricks key features include:

    • Cloud-native: Works well on any prominent cloud provider.
    • Data storage: Stores a broad range of data, including structured, unstructured, and streaming.
    • Self-governance: Built-in governance and security controls.
    • Flexibility: Flexible for small-scale jobs as well as running large-scale jobs like Big Data processing because it’s built from Spark and is specifically optimized for Cloud environments.
    • Data science tools: Production-ready data tooling, from engineering to BI, AI, and ML.
    • Familiar languages: While Databricks is Spark-based, it allows commonly used programming languages like R, SQL, Scala, and Python to be used.
    • Team sharing workspaces: Creates an environment that provides interactive workspaces for collaboration, which allow multiple members to collaborate for data model creation, machine learning, and data extraction.
    • Data source: Performs limitless Big Data analytics by connecting to Cloud providers AWS, Azure, and Google, as well as on-premises SQL servers, JSON and CSV.

    Reviews from Real Users

    Databricks stands out from its competitors for several reasons. Two striking features are its collaborative ability and its ability to streamline multiple programming languages.

    PeerSpot users take note of the advantages of these features. A Chief Research Officer in consumer goods writes, “We work with multiple people on notebooks and it enables us to work collaboratively in an easy way without having to worry about the infrastructure. I think the solution is very intuitive, very easy to use. And that's what you pay for.”

    A business intelligence coordinator in construction notes, “The capacity of use of the different types of coding is valuable. Databricks also has good performance because it is running in spark extra storage, meaning the performance and the capacity use different kinds of codes.”

    An Associate Manager who works in consultancy mentions, “The technology that allows us to write scripts within the solution is extremely beneficial. If I was, for example, able to script in SQL, R, Scala, Apache Spark, or Python, I would be able to use my knowledge to make a script in this solution. It is very user-friendly and you can also process the records and validation point of view. The ability to migrate from one environment to another is useful.”

    Informatica PowerCenter is a data integration and data visualization tool. The solution works as an enterprise data integration platform that helps organizations access, transform, and integrate data from various systems. The product is designed to support companies in the full cycle of a project, from its initial rollout to critical deployments. Informatica PowerCenter allows developers and analysts to collaborate while accelerating the work process to deploy projects within days instead of months.

    The Advanced edition of the product provides an additional real-time engine which allows companies to have always-on enterprise data integration. This ensures seamless collaboration and increment of data lineage visibility and impacts analysis.

    The Premium edition of the solution offers an early warning system that detects unexpected behaviors or incorrect utilization of resources in the workflows and alerts companies in the case that these occur. This version of the product also offers automatic data validation, which ensures data accuracy and reduces testing time and expenditure of resources for by up to 90%.

    Informatica PowerCenter Features

    The product provides users with various features which allow them to execute data integration initiatives such as analytics, data warehousing, data governance, consolidation, and application migration. The features of the solution include:

    • Collaboration: Informatica PowerCenter offers role-based tools and processes which enable business self-service while benefiting from high-quality IT resources.

    • Automation: Through various automations and easy-to-use software, users can utilize graphical and codeless tools and initiate effective data integration without additional knowledge.

    • Scalability: The tool provides high scalability to users, which ensures seamless performance and minimum downtime. PowerCenter also has adaptive load balancing, pushdown optimization, and dynamic partitioning.

    • Monitoring: Through the extensive monitoring feature, the operations and governance of the solution are easily overseen by users. The tool also provides alerts that can prevent damage to the system.

    • Real-time data: Through real-time data, users can monitor applications and analytics, ensuring their efficient operation.

    • Prototyping: Informatica lets its users collaborate with information technology to prototype, profile, and validate results in a timely manner.

    • Connectivity: Users can access and integrate data from different types of sources through high-performance connectors.

    • Automated data validation testing: The product offers script-free automated and repeatable audit and validation of data.

    • Data transformation: This feature allows users to use comprehensive parsing of JSON, PDF, XML, Microsoft Office, and the Internet of Things (IoT) for non-relation data.

    • Cloud applications connectivity: The product allows for seamless connection to cloud application sources and targets.

    Informatica PowerCenter Benefits

    The benefits of using Informatica PowerCenter include:

    • The tool can work over a wide range of systems and platforms and also allows for lean integration.

    • It enhances the quality and speed of performance and optimizes the cost of the process for your organization.

    • PowerCenter supports multiple databases, including TPump, Parallel Transporter Fastload, and Teradata MLoad.

    • The tool is very easy to monitor and maintain, which simplifies the data integration process for companies.

    • The centralized error logging system allows users to locate errors in a timely manner and correct them.

    • The tool can convert data from an application to another format, as it serves as one of the most powerful data transformation solutions.

    • PowerCenter can also serve as middleware between two applications.

    • The solution offers both parallel processing and load balancing.

    • PowerCenter is a tool with a high level of security, which also minimizes essential administration activities.

    • The solution ensures the quality of information, as it does not allow invalid or unwanted data to be uploaded to the source.

    Reviews from Real Users

    Yahya T., a developer and architect at L'Oreal, says the product is stable, provides good support, and integrating it with other systems is very fast.

    Mohamed E., a senior manager for Data management and data governance at a tech company, says PowerCenter is stable, mature, and offers flexibility in building the pipeline and has a drag-and-drop mode because it's GUI-based; technical support is brilliant.

    Sample Customers
    Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
    University of Texas MD Anderson Cancer Center, LexisNexis, Rabobank
    Top Industries
    REVIEWERS
    Computer Software Company25%
    Financial Services Firm16%
    Manufacturing Company9%
    Retailer9%
    VISITORS READING REVIEWS
    Financial Services Firm15%
    Computer Software Company12%
    Manufacturing Company9%
    Healthcare Company6%
    REVIEWERS
    Computer Software Company22%
    Financial Services Firm20%
    Retailer7%
    Insurance Company7%
    VISITORS READING REVIEWS
    Financial Services Firm18%
    Computer Software Company12%
    Manufacturing Company8%
    Insurance Company7%
    Company Size
    REVIEWERS
    Small Business27%
    Midsize Enterprise14%
    Large Enterprise59%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise11%
    Large Enterprise71%
    REVIEWERS
    Small Business16%
    Midsize Enterprise11%
    Large Enterprise73%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    Buyer's Guide
    Databricks vs. Informatica PowerCenter
    February 2023
    Find out what your peers are saying about Databricks vs. Informatica PowerCenter and other solutions. Updated: February 2023.
    771,212 professionals have used our research since 2012.

    Databricks is ranked 1st in Data Science Platforms with 78 reviews while Informatica PowerCenter is ranked 3rd in Data Integration with 78 reviews. Databricks is rated 8.2, while Informatica PowerCenter is rated 8.0. The top reviewer of Databricks writes "A nice interface with good features for turning off clusters to save on computing". On the other hand, the top reviewer of Informatica PowerCenter writes "Stable, provides good support, and integrating it with other systems is very fast, but its pricing is expensive". Databricks is most compared with Amazon SageMaker, Dataiku, Microsoft Azure Machine Learning Studio, Dremio and Azure Stream Analytics, whereas Informatica PowerCenter is most compared with Informatica Cloud Data Integration, Azure Data Factory, SSIS, AWS Glue and Oracle Data Integrator (ODI). See our Databricks vs. Informatica PowerCenter report.

    We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.