Pentaho Data Integration and Analytics vs SSIS comparison

Cancel
You must select at least 2 products to compare!
Hitachi Vantara Logo
3,247 views|1,075 comparisons
94% willing to recommend
Microsoft Logo
Read 69 SSIS reviews
19,105 views|15,500 comparisons
79% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Pentaho Data Integration and Analytics and SSIS based on real PeerSpot user reviews.

Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Pentaho Data Integration and Analytics vs. SSIS Report (Updated: May 2024).
771,212 professionals have used our research since 2012.
Q&A Highlights
Question: Which ETL tool would you recommend to populate data from OLTP to OLAP?
Answer: We have experiences only in Pentaho Data Integrator (open source competitor of the Oracle Data Integrator). But OLAP exporting wasn't in our scope until now.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Flexible deployment, in any environment, is very important to us. That is the key reason why we ended up with these tools. Because we have a very highly secure environment, we must be able to install it in multiple environments on multiple different servers. The fact that we could use the same tool in all our environments, on-prem and in the cloud, was very important to us.""I can use Python, which is open-source, and I can run other scripts, including Linux scripts. It's user-friendly for running any object-based language. That's a very important feature because we live in a world of open-source.""We also haven't had to create any custom Java code. Almost everywhere it's SQL, so it's done in the pipeline and the configuration. That means you can offload the work to people who, while they are not less experienced, are less technical when it comes to logic.""I can create faster instructions than writing with SQL or code. Also, I am able to do some background control of the data process with this tool. Therefore, I use it as an ELT tool. I have a station area where I can work with all the information that I have in my production databases, then I can work with the data that I created.""One of the most valuable features is the ability to create many API integrations. I'm always working with advertising agents and using Facebook and Instagram to do campaigns. We use Pentaho to get the results from these campaigns and to create dashboards to analyze the results.""Pentaho Data Integration is quite simple to learn, and there is a lot of information available online.""The amount of data that it loads and processes is good.""The fact that it's a low-code solution is valuable. It's good for more junior people who may not be as experienced with programming."

More Pentaho Data Integration and Analytics Pros →

"The UI is very user-friendly.""The main value of any Microsoft product is the ease of use. You can achieve more with less time. That's what's beneficial for me. With many competitors, you might need to spend more time coming up with a solution because you have to focus on taking care of the product.""I have used most of the standard SQL features, but the ones that stand out are the Data Flows and Bulk Import.""The initial setup was easy.""Data Flows are the main component we use. These can range from a simple source to sink ETL, to many source to many sink dataflows.""SSIS' best feature is SFTP connectivity.""SSIS integrates well with SQL servers and Microsoft products.""The simplicity of the solution is great. The solution also offers excellent integration."

More SSIS Pros →

Cons
"In the Community edition, it would be nice to have more modules that allow you to code directly within the application. It could have R or Python completely integrated into it, but this could also be because I'm using an older version.""Some of the scheduling features about Lumada drive me buggy. The one issue that always drives me up the wall is when Daylight Savings Time changes. It doesn't take that into account elegantly. Every time it changes, I have to do something. It's not a big deal, but it's annoying.""I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors.""​I work with the Community Edition, therefore I do not have support. There was an issue that I could not resolve with community support.​""I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it.""Since Hitachi took over, I don't feel that the documentation is as good within the solution. It used to have very good help built right in.""I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking.""Lumada could have more native connectors with other vendors, such as Google BigQuery, Microsoft OneDrive, Jira systems, and Facebook or Instagram. We would like to gather data from modern platforms using Lumada, which is a better approach. As a comparison, if you open Power BI to retrieve data, then you can get data from many vendors with cloud-native connectors, such as Azure, AWS, Google BigQuery, and Athena Redshift. Lumada should have more native connectors to help us and facilitate our job in gathering information from these new modern infrastructures and tools."

More Pentaho Data Integration and Analytics Cons →

"Microsoft should offer an on-premises support warranty for those using that deployment. They seem to be withdrawing from on-premises options.""The solution could improve on integrating with other types of data sources.""SSIS is cumbersome despite its drag-and-drop functionality. For example, let's say I have 50 tables with 30 columns. You need to set a data type for each column and table. That's around 1,500 objects. It gets unwieldy adding validation for every column. Previously, SSIS automatically detected the data type, but I think they removed this feature. It would automatically detect if it's an integer, primary key, or foreign key column. You had fewer problems building the model.""We'd like them to develop data exploration more.""The high prices attached to the product can be an area of concern where improvements are required.""We have a stability problem because when something works, it works one time. The next time, it doesn't work.""SSIS is stable, but extensive ETL data processing can have some performance issues.""A change in the metadata source cripples the whole ETL process, requiring each module to be manually reopened."

More SSIS Cons →

Pricing and Cost Advice
  • "There is a good open source option (Community Edition)​."
  • "The price of the regular version is not reasonable and it should be lower."
  • "Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
  • "It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
  • "I think Lumada's price is fair compared to some of the others, like BusinessObjects, which is was the other thing that I used at my previous job. BusinessObject's price was more reasonable before SAP acquired it. They jacked the price up significantly. Oracle's OBIEE tool was also prohibitively expensive."
  • "When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
  • "The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
  • "The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
  • More Pentaho Data Integration and Analytics Pricing and Cost Advice →

  • "This solution has provided an inexpensive tool, and it is easy to find experienced developers."
  • "My advice is to look at what your configuration will be because most companies have their own deals with Microsoft."
  • "This solution is included with the MSSQL server package."
  • "It would be beneficial if the solution had a less costly cloud offering."
  • "Based on my experience and understanding, Talend comes out to be a little bit expensive as compared to SSIS. The average cost of having Talend with Talend Management Console is around 72K per region, which is much higher than SSIS. SSIS works very well with Microsoft technologies, and if you have Microsoft technologies, it is not really expensive to have SSIS. If you have SQL Server, SSIS is free."
  • "We have an enterprise license for this solution."
  • "It comes bundled with other solutions, which makes it difficult to get the price on the specific product."
  • "All of my clients have this product included as part of their Microsoft license."
  • More SSIS Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
    771,212 professionals have used our research since 2012.
    Comparison Review
    Anonymous User
    Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations There are many third-party vendors offering ETL solutions, but two of the most popular are PowerCenter Informatica and Microsoft SSIS (SQL Server Integration Services). Each technology has its advantages but there are also similarities on how they carry out the extract-transform-load processes and only differ in terminologies. If you’re in the process of choosing ETL tools and PowerCenter Informatica and Microsoft SSIS made it to your shortlist, here is a short comparative discussion detailing the differences between the two, as well as their benefits. Package Configuration Most enterprise data integration projects would require the capacity to develop a solution in one platform and test and deploy it in a separate environment without having to manually change the established workflow. In order to achieve this seamless movement between two environments, your ETL technology should allow the dynamic update of the project’s properties using the content or a parameter file or configuration. Both Informatica and SSIS support this functionality using different methodologies. In Informatica, every session can have more than one source and one or more destination connections. There are… Read more →
    Answers from the Community
    RajneeshShukla
    AThiré - PeerSpot reviewerAThiré
    User

    There are two products I know about
    * TimeXtender : Microsoft based, Transformation logic is quiet good and can easily be extended with T-SQL , Has a semantic layer that generates metat data for cubes . price approx 40K$, works with tables
    . Attunity (Bought by Qlik) : technology agnostic , nice web interface , expensive > 100K€. Works with transaction logs

    There are many other pure ETL tools
    * ERWIN has a nice one ,

    Phil Wilkins - PeerSpot reviewerPhil Wilkins (Capgemini)
    Consultant

    Depends upon the technologies being used. If you're using Oracle for both OLTP and OLAP then you'll get a lot of value from an Oracle solution.


    The other question is how up to date do you want your OLAP DB to be? Goldengate is a good answer if you're looking to minimize latency, but it can be expensive. ODI is less expensive but better suited to bulkier data sets.  If an Oracle product wasn't the option I'd probably consider something like Informatica.

    Karoly Krokovay - PeerSpot reviewerKaroly Krokovay
    Real User

    Hi Rajneesh,
    yes here is the feature comparison between the community and enterprise edition : www.hitachivantara.com

    And a short description of the community edition: www.predictiveanalyticstoday.com

    And the download link: community.hitachivantara.com

    You can ask more from the great community: forums.pentaho.com

    Regards
    Károly

    Stefan Schäfer - PeerSpot reviewerStefan Schäfer
    User

    We usually use Talend.
    Look here: community.talend.com

    GaryM - PeerSpot reviewerGaryM
    Real User

    As someone mentioned, if you're purely Oracle shop and staying that way then there's value with prioritizing Oracle tools.  However, let me contrast that with this caveat...


    Consider expectations for tool and vendor longevity. Oracle has a long history of retiring and/or replacing tools leaving customers in the cold with prior versions/tools (I've been burned multiple times by Oracle product retirements or replacements including OWB, Oracle Designer2k, Oracle Express, Oracle OEDW, their purchase of Sagent ETL which as later abandoned).


    But I would also consider these questions and relative prioritization:  


    What is your organization's plans for moving to other database technologies?  


    Where is your org going with on-prem versus cloud solutions?  How important are PaaS versus IaaS solutions?  


    Where is your current staff's expertise?  


    Prioritize mature over immature tools. 


    How many sources do you have?  What are their technologies and does the integration tool support them?


    Is it just moving data from a single ERP such as Oracle EBS to Olap? When you say Olap what do you mean by that?  Are you talking Oracle Olap product or something else?  That makes a really big difference of course - if your ETL tool doesn't support your source(s) and target(s) then it shouldn't be considered.


    Given the industry's trajectory, I myself would highly prioritize PaaS solutions over others.

    Yeap Jiun Aing - PeerSpot reviewerYeap Jiun Aing
    Real User

    What is the OLAP that you are using? Hosted in Cloud or on-premise? 


    The target DB should have its tool to extract data.

    Efe Dagistanli - PeerSpot reviewerEfe Dagistanli
    User

    Pentaho is a really nice tool if opensource is the only option. 


    Please think about issues such as upgrade and disaster in the future. These operations are very easy in Pentaho.


    I can only suggest one thing for replication and that is Qlik. (ex-Attunity).

    RajneeshShukla - PeerSpot reviewerRajneeshShukla
    Real User

    Hi Karoly, Thanks for your input. community: forums.pentaho.com is not allowing new registrations for new users. I guess they accept queries from customers only and not from any one. Do you know any other forum, community, SMEs contacts who can help on queries?

    Questions from the Community
    Top Answer:Hi Rajneesh yes here is the feature comparison between the community and enterprise edition :… more »
    Top Answer: In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, it… more »
    Top Answer:My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could use… more »
    Top Answer:SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called SSIS. The collection helps organizations boost productivity with code-free… more »
    Top Answer:The product's deployment phase is easy.
    Top Answer:If you don't want to pay a lot of money, you can go for SSIS, as its open-source version is available. When it comes to licensing, SSIS can be expensive.
    Ranking
    15th
    out of 101 in Data Integration
    Views
    3,247
    Comparisons
    1,075
    Reviews
    10
    Average Words per Review
    1,105
    Rating
    7.5
    2nd
    out of 101 in Data Integration
    Views
    19,105
    Comparisons
    15,500
    Reviews
    35
    Average Words per Review
    474
    Rating
    7.8
    Comparisons
    Also Known As
    Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
    SQL Server Integration Services
    Learn More
    Overview

    Pentaho Data Integration stands as a versatile platform designed to cater to the data integration and analytics needs of organizations, regardless of their size. This powerful solution is the go-to choice for businesses seeking to seamlessly integrate data from diverse sources, including databases, files, and applications. Pentaho Data Integration facilitates the essential tasks of cleaning and transforming data, ensuring it's primed for meaningful analysis. With a wide array of tools for data mining, machine learning, and statistical analysis, Pentaho Data Integration empowers organizations to glean valuable insights from their data. What sets Pentaho Data Integration apart is its maturity and a vibrant community of users and developers, making it a reliable and cost-effective option. Pentaho Data Integration offers a range of features, including a comprehensive ETL toolkit, data cleaning and transformation capabilities, robust data analysis tools, and seamless deployment options for data integration and analytics solutions, making it a go-to solution for organizations seeking to harness the power of their data.

    SSIS is a versatile tool for data integration tasks like ETL processes, data migration, and real-time data processing. Users appreciate its ease of use, data transformation tools, scheduling capabilities, and extensive connectivity options. It enhances productivity and efficiency within organizations by streamlining data-related processes and improving data quality and consistency.

    Sample Customers
    66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
    1. Amazon.com 2. Bank of America 3. Capital One 4. Coca-Cola 5. Dell 6. E*TRADE 7. FedEx 8. Ford Motor Company 9. Google 10. Home Depot 11. IBM 12. Intel 13. JPMorgan Chase 14. Kraft Foods 15. Lockheed Martin 16. McDonald's 17. Microsoft 18. Morgan Stanley 19. Nike 20. Oracle 21. PepsiCo 22. Procter & Gamble 23. Prudential Financial 24. RBC Capital Markets 25. SAP 26. Siemens 27. Sony 28. Toyota 29. UnitedHealth Group 30. Visa 31. Walmart 32. Wells Fargo
    Top Industries
    REVIEWERS
    Healthcare Company19%
    Financial Services Firm19%
    Comms Service Provider11%
    Manufacturing Company11%
    VISITORS READING REVIEWS
    Financial Services Firm19%
    Computer Software Company14%
    Comms Service Provider12%
    Government7%
    REVIEWERS
    Financial Services Firm23%
    Government8%
    Retailer8%
    Healthcare Company8%
    VISITORS READING REVIEWS
    Financial Services Firm17%
    Computer Software Company12%
    Government7%
    Healthcare Company6%
    Company Size
    REVIEWERS
    Small Business27%
    Midsize Enterprise31%
    Large Enterprise42%
    VISITORS READING REVIEWS
    Small Business21%
    Midsize Enterprise11%
    Large Enterprise68%
    REVIEWERS
    Small Business27%
    Midsize Enterprise17%
    Large Enterprise56%
    VISITORS READING REVIEWS
    Small Business18%
    Midsize Enterprise13%
    Large Enterprise69%
    Buyer's Guide
    Pentaho Data Integration and Analytics vs. SSIS
    May 2024
    Find out what your peers are saying about Pentaho Data Integration and Analytics vs. SSIS and other solutions. Updated: May 2024.
    771,212 professionals have used our research since 2012.

    Pentaho Data Integration and Analytics is ranked 15th in Data Integration with 48 reviews while SSIS is ranked 2nd in Data Integration with 69 reviews. Pentaho Data Integration and Analytics is rated 8.0, while SSIS is rated 7.6. The top reviewer of Pentaho Data Integration and Analytics writes "It's flexible and can do almost anything I want it to do". On the other hand, the top reviewer of SSIS writes "Maintaining the solution and contacting its support team is easy". Pentaho Data Integration and Analytics is most compared with Azure Data Factory, Talend Open Studio, Oracle Data Integrator (ODI), AWS Glue and SAP Data Services, whereas SSIS is most compared with Informatica PowerCenter, Talend Open Studio, IBM InfoSphere DataStage, Oracle Data Integrator (ODI) and AWS Glue. See our Pentaho Data Integration and Analytics vs. SSIS report.

    See our list of best Data Integration vendors.

    We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.