Pentaho Data Integration and Analytics vs SSIS comparison

Cancel
You must select at least 2 products to compare!
Hitachi Vantara Logo
3,346 views|1,127 comparisons
94% willing to recommend
Microsoft Logo
Read 69 SSIS reviews
19,568 views|15,878 comparisons
79% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Pentaho Data Integration and Analytics and SSIS based on real PeerSpot user reviews.

Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Pentaho Data Integration and Analytics vs. SSIS Report (Updated: March 2024).
768,857 professionals have used our research since 2012.
Q&A Highlights
Question: Which ETL tool would you recommend to populate data from OLTP to OLAP?
Answer: We have experiences only in Pentaho Data Integrator (open source competitor of the Oracle Data Integrator). But OLAP exporting wasn't in our scope until now.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"We're using the PDI and the repository function, and they give us the ability to easily generate reporting and output, and to access data. We also like the ability to schedule.""The graphical nature of the development interface is most useful because we've got people with quite mixed skills in the team. We've got some very junior, apprentice-level people, and we've got support analysts who don't have an IT background. It allows us to have quite complicated data flows and embed logic in them. Rather than having to troll through lines and lines of code and try and work out what it's doing, you get a visual representation, which makes it quite easy for people with mixed skills to support and maintain the product. That's one side of it.""One of the most valuable features is the ability to create many API integrations. I'm always working with advertising agents and using Facebook and Instagram to do campaigns. We use Pentaho to get the results from these campaigns and to create dashboards to analyze the results.""Pentaho Data Integration is quite simple to learn, and there is a lot of information available online.""Data transformation within Pentaho is a nice feature that they have and that I value.""We also haven't had to create any custom Java code. Almost everywhere it's SQL, so it's done in the pipeline and the configuration. That means you can offload the work to people who, while they are not less experienced, are less technical when it comes to logic.""It has improved our data integration capabilities​.""Flexible deployment, in any environment, is very important to us. That is the key reason why we ended up with these tools. Because we have a very highly secure environment, we must be able to install it in multiple environments on multiple different servers. The fact that we could use the same tool in all our environments, on-prem and in the cloud, was very important to us."

More Pentaho Data Integration and Analytics Pros →

"We can connect with multiple data sources easily using an external connector in SSIS.""It has good data integration and good processes.""It's something I needed for bulk imports. I'm not a big fan of it, but I haven't seen anything better.""The simplicity of the solution is great. The solution also offers excellent integration.""The performance and stability are good.""The technical support is very good.""The workflow features have been very valuable. You can have automated workflows and all the steps are controlled. The workflow functionality of integration services is excellent.""It's saved time using visualization descriptions."

More SSIS Pros →

Cons
"Lumada could have more native connectors with other vendors, such as Google BigQuery, Microsoft OneDrive, Jira systems, and Facebook or Instagram. We would like to gather data from modern platforms using Lumada, which is a better approach. As a comparison, if you open Power BI to retrieve data, then you can get data from many vendors with cloud-native connectors, such as Azure, AWS, Google BigQuery, and Athena Redshift. Lumada should have more native connectors to help us and facilitate our job in gathering information from these new modern infrastructures and tools.""​I could not connect to our Hadoop environment in an easy and flexible way, and it was important to scale our data warehouse​.""I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking.""In terms of the flexibility to deploy in any environment, such as on-premise or in the cloud, we can do the cloud deployment only through virtual machines. We might also be able to work on different environments through Docker or Kubernetes, but we don't have an Azure app or an AWS app for easy deployment to the cloud. We can only do it through virtual machines, which is a problem, but we can manage it. We also work with Databricks because it works with Spark. We can work with clustered servers, and we can easily do the deployment in the cloud. With a right-click, we can deploy Databricks through the app on AWS or Azure cloud.""The reporting definitely needs improvement. There are a lot of general, basic features that it doesn't have. A simple feature you would expect a reporting tool to have is the ability to search the repository for a report. It doesn't even have that capability. That's been a feature that we've been asking for since the beginning and it hasn't been implemented yet.""If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was.""The testing and quality could really improve. Every time that there is a major release, we are very nervous about what is going to get broken. We have had a lot of experience with that, as even the latest one was broken. Some basic things get broken. That doesn't look good for Hitachi at all. If there is one place I would advise them to spend some money and do some effort, it is with the quality. It is not that hard to start putting in some unit tests so basic things don't get broken when they do a new release. That just looks horrible, especially for an organization like Hitachi.""I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."

More Pentaho Data Integration and Analytics Cons →

"We've had issues in terms of the amount of data that is transferred when we are scheduling.""It needs more integration tools, so you can connect to different sources.""The solution should work on the GPU, graphical processing unit. There should also be piping integration available.""Sometimes we need to connect to AWS to get additional data sources, so we have to install some external LAN and not a regular RDBMS. We need external tools to connect. It would be great if SSIS included these tools. I'd also like some additional features for row indexing and data conversion.""It's difficult to refactor SSIS. It gets cumbersome to reuse the solution.""I would like to see more features in terms of the integration with Azure Data Factory.""When I compare Talend and SSIS, Talend provides more features. With Talend, we can handle a large volume of data. Talend is usually used to treat a large volume of data, which makes it better than SSIS on the data side. Talend also has a very good Talend Management Console to schedule the jobs and do other things. It can also be easily connected to version control tools such as GitHub or SVN. The last time I used SSIS, it was connected through TSS for the Windows Console version. I am not sure it has been improved or not. If it is not improved, Microsoft should improve it. They should change the product to provide another console.""The solution could improve on integrating with other types of data sources."

More SSIS Cons →

Pricing and Cost Advice
  • "There is a good open source option (Community Edition)​."
  • "The price of the regular version is not reasonable and it should be lower."
  • "Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
  • "It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
  • "I think Lumada's price is fair compared to some of the others, like BusinessObjects, which is was the other thing that I used at my previous job. BusinessObject's price was more reasonable before SAP acquired it. They jacked the price up significantly. Oracle's OBIEE tool was also prohibitively expensive."
  • "When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
  • "The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
  • "The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
  • More Pentaho Data Integration and Analytics Pricing and Cost Advice →

  • "This solution has provided an inexpensive tool, and it is easy to find experienced developers."
  • "My advice is to look at what your configuration will be because most companies have their own deals with Microsoft."
  • "This solution is included with the MSSQL server package."
  • "It would be beneficial if the solution had a less costly cloud offering."
  • "Based on my experience and understanding, Talend comes out to be a little bit expensive as compared to SSIS. The average cost of having Talend with Talend Management Console is around 72K per region, which is much higher than SSIS. SSIS works very well with Microsoft technologies, and if you have Microsoft technologies, it is not really expensive to have SSIS. If you have SQL Server, SSIS is free."
  • "We have an enterprise license for this solution."
  • "It comes bundled with other solutions, which makes it difficult to get the price on the specific product."
  • "All of my clients have this product included as part of their Microsoft license."
  • More SSIS Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
    768,857 professionals have used our research since 2012.
    Comparison Review
    Anonymous User
    Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations There are many third-party vendors offering ETL solutions, but two of the most popular are PowerCenter Informatica and Microsoft SSIS (SQL Server Integration Services). Each technology has its advantages but there are also similarities on how they carry out the extract-transform-load processes and only differ in terminologies. If you’re in the process of choosing ETL tools and PowerCenter Informatica and Microsoft SSIS made it to your shortlist, here is a short comparative discussion detailing the differences between the two, as well as their benefits. Package Configuration Most enterprise data integration projects would require the capacity to develop a solution in one platform and test and deploy it in a separate environment without having to manually change the established workflow. In order to achieve this seamless movement between two environments, your ETL technology should allow the dynamic update of the project’s properties using the content or a parameter file or configuration. Both Informatica and SSIS support this functionality using different methodologies. In Informatica, every session can have more than one source and one or more destination connections. There are… Read more →
    Answers from the Community
    RajneeshShukla
    AThiré - PeerSpot reviewerAThiré
    User

    There are two products I know about
    * TimeXtender : Microsoft based, Transformation logic is quiet good and can easily be extended with T-SQL , Has a semantic layer that generates metat data for cubes . price approx 40K$, works with tables
    . Attunity (Bought by Qlik) : technology agnostic , nice web interface , expensive > 100K€. Works with transaction logs

    There are many other pure ETL tools
    * ERWIN has a nice one ,

    Phil Wilkins - PeerSpot reviewerPhil Wilkins (Capgemini)
    Consultant

    Depends upon the technologies being used. If you're using Oracle for both OLTP and OLAP then you'll get a lot of value from an Oracle solution.


    The other question is how up to date do you want your OLAP DB to be? Goldengate is a good answer if you're looking to minimize latency, but it can be expensive. ODI is less expensive but better suited to bulkier data sets.  If an Oracle product wasn't the option I'd probably consider something like Informatica.

    Karoly Krokovay - PeerSpot reviewerKaroly Krokovay
    Real User

    Hi Rajneesh,
    yes here is the feature comparison between the community and enterprise edition : www.hitachivantara.com

    And a short description of the community edition: www.predictiveanalyticstoday.com

    And the download link: community.hitachivantara.com

    You can ask more from the great community: forums.pentaho.com

    Regards
    Károly

    Stefan Schäfer - PeerSpot reviewerStefan Schäfer
    User

    We usually use Talend.
    Look here: community.talend.com

    GaryM - PeerSpot reviewerGaryM
    Real User

    As someone mentioned, if you're purely Oracle shop and staying that way then there's value with prioritizing Oracle tools.  However, let me contrast that with this caveat...


    Consider expectations for tool and vendor longevity. Oracle has a long history of retiring and/or replacing tools leaving customers in the cold with prior versions/tools (I've been burned multiple times by Oracle product retirements or replacements including OWB, Oracle Designer2k, Oracle Express, Oracle OEDW, their purchase of Sagent ETL which as later abandoned).


    But I would also consider these questions and relative prioritization:  


    What is your organization's plans for moving to other database technologies?  


    Where is your org going with on-prem versus cloud solutions?  How important are PaaS versus IaaS solutions?  


    Where is your current staff's expertise?  


    Prioritize mature over immature tools. 


    How many sources do you have?  What are their technologies and does the integration tool support them?


    Is it just moving data from a single ERP such as Oracle EBS to Olap? When you say Olap what do you mean by that?  Are you talking Oracle Olap product or something else?  That makes a really big difference of course - if your ETL tool doesn't support your source(s) and target(s) then it shouldn't be considered.


    Given the industry's trajectory, I myself would highly prioritize PaaS solutions over others.

    Yeap Jiun Aing - PeerSpot reviewerYeap Jiun Aing
    Real User

    What is the OLAP that you are using? Hosted in Cloud or on-premise? 


    The target DB should have its tool to extract data.

    Efe Dagistanli - PeerSpot reviewerEfe Dagistanli
    User

    Pentaho is a really nice tool if opensource is the only option. 


    Please think about issues such as upgrade and disaster in the future. These operations are very easy in Pentaho.


    I can only suggest one thing for replication and that is Qlik. (ex-Attunity).

    RajneeshShukla - PeerSpot reviewerRajneeshShukla
    Real User

    Hi Karoly, Thanks for your input. community: forums.pentaho.com is not allowing new registrations for new users. I guess they accept queries from customers only and not from any one. Do you know any other forum, community, SMEs contacts who can help on queries?

    Questions from the Community
    Top Answer:Hi Rajneesh yes here is the feature comparison between the community and enterprise edition :… more »
    Top Answer: In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, it… more »
    Top Answer:My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could use… more »
    Top Answer:SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called SSIS. The collection helps organizations boost productivity with code-free… more »
    Top Answer:The product's deployment phase is easy.
    Top Answer:If you don't want to pay a lot of money, you can go for SSIS, as its open-source version is available. When it comes to licensing, SSIS can be expensive.
    Ranking
    16th
    out of 100 in Data Integration
    Views
    3,346
    Comparisons
    1,127
    Reviews
    15
    Average Words per Review
    1,193
    Rating
    7.7
    2nd
    out of 100 in Data Integration
    Views
    19,568
    Comparisons
    15,878
    Reviews
    35
    Average Words per Review
    471
    Rating
    7.7
    Comparisons
    Also Known As
    Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
    SQL Server Integration Services
    Learn More
    Overview

    Pentaho Data Integration stands as a versatile platform designed to cater to the data integration and analytics needs of organizations, regardless of their size. This powerful solution is the go-to choice for businesses seeking to seamlessly integrate data from diverse sources, including databases, files, and applications. Pentaho Data Integration facilitates the essential tasks of cleaning and transforming data, ensuring it's primed for meaningful analysis. With a wide array of tools for data mining, machine learning, and statistical analysis, Pentaho Data Integration empowers organizations to glean valuable insights from their data. What sets Pentaho Data Integration apart is its maturity and a vibrant community of users and developers, making it a reliable and cost-effective option. Pentaho Data Integration offers a range of features, including a comprehensive ETL toolkit, data cleaning and transformation capabilities, robust data analysis tools, and seamless deployment options for data integration and analytics solutions, making it a go-to solution for organizations seeking to harness the power of their data.

    SSIS is a versatile tool for data integration tasks like ETL processes, data migration, and real-time data processing. Users appreciate its ease of use, data transformation tools, scheduling capabilities, and extensive connectivity options. It enhances productivity and efficiency within organizations by streamlining data-related processes and improving data quality and consistency.

    Sample Customers
    66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
    1. Amazon.com 2. Bank of America 3. Capital One 4. Coca-Cola 5. Dell 6. E*TRADE 7. FedEx 8. Ford Motor Company 9. Google 10. Home Depot 11. IBM 12. Intel 13. JPMorgan Chase 14. Kraft Foods 15. Lockheed Martin 16. McDonald's 17. Microsoft 18. Morgan Stanley 19. Nike 20. Oracle 21. PepsiCo 22. Procter & Gamble 23. Prudential Financial 24. RBC Capital Markets 25. SAP 26. Siemens 27. Sony 28. Toyota 29. UnitedHealth Group 30. Visa 31. Walmart 32. Wells Fargo
    Top Industries
    REVIEWERS
    Healthcare Company19%
    Financial Services Firm19%
    Comms Service Provider11%
    Manufacturing Company11%
    VISITORS READING REVIEWS
    Financial Services Firm19%
    Computer Software Company13%
    Comms Service Provider12%
    Government7%
    REVIEWERS
    Financial Services Firm23%
    Government8%
    Retailer8%
    Healthcare Company8%
    VISITORS READING REVIEWS
    Financial Services Firm17%
    Computer Software Company12%
    Government7%
    Healthcare Company6%
    Company Size
    REVIEWERS
    Small Business27%
    Midsize Enterprise31%
    Large Enterprise42%
    VISITORS READING REVIEWS
    Small Business21%
    Midsize Enterprise11%
    Large Enterprise68%
    REVIEWERS
    Small Business27%
    Midsize Enterprise18%
    Large Enterprise55%
    VISITORS READING REVIEWS
    Small Business18%
    Midsize Enterprise13%
    Large Enterprise69%
    Buyer's Guide
    Pentaho Data Integration and Analytics vs. SSIS
    March 2024
    Find out what your peers are saying about Pentaho Data Integration and Analytics vs. SSIS and other solutions. Updated: March 2024.
    768,857 professionals have used our research since 2012.

    Pentaho Data Integration and Analytics is ranked 16th in Data Integration with 48 reviews while SSIS is ranked 2nd in Data Integration with 69 reviews. Pentaho Data Integration and Analytics is rated 8.0, while SSIS is rated 7.6. The top reviewer of Pentaho Data Integration and Analytics writes "It's flexible and can do almost anything I want it to do". On the other hand, the top reviewer of SSIS writes "Maintaining the solution and contacting its support team is easy". Pentaho Data Integration and Analytics is most compared with Azure Data Factory, Talend Open Studio, Oracle Data Integrator (ODI), AWS Glue and SAP Data Services, whereas SSIS is most compared with Informatica PowerCenter, Talend Open Studio, IBM InfoSphere DataStage, Oracle Data Integrator (ODI) and AWS Glue. See our Pentaho Data Integration and Analytics vs. SSIS report.

    See our list of best Data Integration vendors.

    We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.