Which one is best for ETL - Pentaho or Jaspersoft?

547
46

I need to design reports using Jaspersoft iReport. Is it better to use Pentaho or Jaspersoft in order to clean the data first?

Thanks for your help.

Anonymous avatar x30
Guest
As seen in
Logosasseeninsmall

46 Answers

Francesco principe li?1418641588

Hi,
I've used all tools that other persons have spoken above, but if you really want an integrated open source tool, you have to use pentaho. Infact with it you can:
- read data
- clean data
- present it, in olap analisys or flat mode
so without try to integrate different software (talend, jasper and so on) you can choose a already integrated platform with 0 integration cost

Like (0)21 September 15
0 aluwgyd8da6h0ys8sqmboj5cetneyus8uxlxujjuq9ziushaazv5juwmnq5?1414330594
Greg DecklerReal User

In general, Jaspersoft has more of a focus on reporting and analysis and Pentaho is much better at data integration, ETL and workflow. My experience with Jaspersoft is that it uses Talend whereas Pentaho uses Pentaho Data Integrator (Kettle). Kettle was developed by Pentaho. Jaspersoft uses Talend Open Studio but has some missing functionality.

Like (0)17 September 15
Anonymous avatar x30

Pentaho definitely offers stronger data cleansing \ preparation when compared to Jaspersoft, but then you've got the frustration of having to get both Pentaho and Jasper iReport to speak the same language every time you add an additional dataset. Neither of these possibilities is a good solution for really messy data.

Are you limited only to those two tools? I have had some good experience with Sisense's cube features for data preparation, and then you have all the visualization and dashboard features in the same platform. Otherwise I would look into DW as suggested by others.

Like (0)17 September 15
David wolpert avatar 1431454098?1431454096

Both JasperETL and Pentaho Kettle can help you clean and prepare your data for reporting. Since you plan on using Jaspersoft for creating the reports you'd be better off using JasperETL. Two reasons for that; if later on you decide to upgrade to a commercial version of Jaspersoft you'd simply import the ETL jobs into the enterprise edition and take advantage of the additional features. Additionally there are plug-ins for JasperETL that can be helpful for reporting. One last tip: try Jaspersoft Studio it has replaced iReport with a lot more functionality while backwards compatible with iReport created jrxmls.

Like (0)16 September 15
Anonymous avatar x30

according to my view Pentaho is good for data integration or etl

Like (0)16 September 15
Patrick de witt li?1414329470
Patrick de WittConsultantTOP 5

Hi,

As already mentioned by others Jaspersoft uses a version of Talend and call it Jaspersoft ETL. The version is an older version normally than is available from Talend itself. I would suggest using the latest version of Talend itself.

There are two versions, TOS (Talend Open Studio) and the Commercial versions.

TOS is free and also in different versions like DI (Data Integration) and MDM (Meta Data Management). The last one includes the possibilities of DI (ETL) but also Data Profiling where you can monitor the quality of your data.

Talend has several versions and the Commercial versions have as benefit the integration and sharing between the versions and also scheduling and monitoring of jobs included which is real neat.

So for starting the best is to take TOS MDM get to know it and how to use it. If you are going for real professional use in production I would suggest to take the commercial versions because it save you a lot on management and development on scheduling and monitoring of jobs.

I am currently working with TOS MDM, and also have worked with TOS DI in combination with Jaspersoft Server, because we already have the management and monitoring arranged in another way and is out of my scope.

Kind regards,
Patrick

Like (1)16 September 15
Anonymous avatar x30
DirectorBA027Real User

Dear Ariel,

Hello.

We had not used Jaspersoft or Pentaho before. However based on brief reading on Gartner BI magic quadrant report, it sounds like Pentaho could better.

Again it is much dependent on which part of the data needs to be cleaned (e.g. misspelling of name vs. dates), and the nature of data cleaning process (names formatting vs date formatting) needs to be done.

Alternatively your colleague can also try QlikView, which has basic ETL functionalities and QlikView is a magic quadrant leader.

A perpetually free QlikView personal edition for desktop license can be downloaded and used from http://www.qlik.com/us/explore/products/free-download?ga-link=hero

Warm regards,

Teoh Sin Hin

Like (0)16 September 15
3919b651 995e 4a2c b8d9 6a2cd2916479 avatar?1437899131

I recommend to use Talend Open Studio as it is tied up with Jaspersoft.

Like (0)16 September 15
Anonymous avatar x30

I have not used Jaspersoft to comment on it.

Pentaho Kettle is a good ETL tool and open source too. Learning curve is
small and has a GUI IDE. Kettle is written in Java.

At a generic level, for ETL selection
a. What is the volume of data?
b. Complexity of the transformation needed?
c. Performance of the overall system
d. Format of data (Pentaho Kettle is good to support all the 4 pretty
well)

Hope this helps.

Good luck

Prashanth

Like (0)16 September 15
Anonymous avatar x30
Jun WuReal User

Hi there,

I have no experience on Jaspersoft, but with Pentaho Spoon I can have my job done. With a couple of hours learning, I can start to author my own transformation. If your job is staight-forward and relatively simple, you can definitely use Pentaho. I am not sure about more complex jobs. FYI.

Jerry

Like (0)16 September 15
Picture 742 1361119649
MartySmithReal User

The data cleansing capability of the two products is very comparable. But it really depends on the volume of data and the time allowed to clean and load the data.. I would not suggest either product for large data volumes

Like (0)16 September 15
Carolyn french li?1419984313
Carolyn FrenchConsultantTOP 10

It is better to use Pentaho for unstructured big data ETLs but both tool should be sufficient for data cleanup.

Like (0)15 September 15
Anonymous avatar x30
Ted WichmanReal User

Neither really specialize in data cleansing. Data cleansing is as much a process as it is a technology. Depending on the size of your dataset, Excel can be a powerful tool in the identification of the "unclean" data. From there you will want to set up a process around making sure your data either conforms to your cleanliness standard from the source (fix it before it is a problem) or make filtering out unclean data part of an ETL process that is invoked on it's way to your reports.
I would venture that Pentaho (Kettle) would have a more mature set of tools that could assist you in your quest. Either way, you will need a detailed understanding of your source data.

Like (0)15 September 15
Gaur pan li?1429195572
Gaur PanConsultant

My vote goes to Jaspersoft. Here are my thoughts.

Community support is more in Jaspersoft as compared to Pentaho. Also, the
community members are more active of Jaspersoft community, hence if you are
using CE, the chances of problem resolution is much more.

Jaspesoft uses R. R is more advanced and more capable in Predictive
analysis.

Jaspersoft scores higher than Pentaho in Mobile BI compatibility. They have
BI for iOS, Android platforms. Functionality wise and user interface wise
Jaspersoft mobile BI is better. They even have an opensouce Apple iphone
application. Pentaho users have to often rely on third party tools to
enhance mobile BI experience.

Summarizing Jasper has a heavy focus on reporting and analysis. It has a
better web 2.0 user interface than Pentaho and is generally easier to use.
It benefits from better marketing, informational web sites, and
documentation. Overall, it makes researching and finding what users need
easier than Pentaho. Whereas Pentaho focuses on data integration, ETL, and
workflow automation. It has a user interface that is more difficult for
business users than is Jasper’s.

Hope this helps.

For any additional information, revert to me please.

Thanks
Gary

Like (0)15 September 15
Ca215cc1 dad1 4b51 9338 5fd882a36e4f avatar?1437466881
Eric SmetsReal User

Hello,

For cleaning the data the ETL of pentaho works like a charm, this is the better way to go.

As we are using Pentaho (community edition) for ETL jobs, we used this also for building reports.

Jasper reports is easier to integrate in an other software from a developpers point, But the pentaho report it self

are OK and easy to use.

Regards,

Eric Smets

Like (0)15 September 15
Anonymous avatar x30
vpresear79230Real User

Pentaho is going to be much better for ETL especially if dealing with any sort of big data. Their Pentaho Data Integration (PDI) tool is solid. They also have a reporting tool.

Like (0)15 September 15
27551a2d e1ec 409a 9c71 52a5b4363ae9 avatar?1442333167
Roberto Camacho ConsultantTOP 10

I don't use any of them. I use Microsoft SSIS. Regards

Like (0)15 September 15
Ben inkster li?1414335324
Ben InksterReal UserPOPULAR

If it's just a matter of cleaning up data to output (regardless of the
reporting system), then it mainly depends on whether you are comparing the
community version or subscription version. I think Pentaho is stronger in
the community version while Jaspersoft is stronger is the subscription
version.

Since you are looking to design reports using iReport (Jaspersoft), I would
lean towards using Jaspersoft to clean the data. The outputs of the cleanup
process will line up nicely with the inputs for iReport.

Ben Inkster

Like (0)15 September 15
Gareth crawshaw li?1414337963
Gareth CrawshawConsultantTOP 5POPULAR

Only have background in Pentaho usage so can’t speak to how easy Jaspersoft is to use, however, I have to say Pentaho DI is excellent and is one of the most straightforward ETL tools I’ve seen in 15 years of working in BI. It is an extremely user friendly interface and has an impressive and passionate user community who can help you if you get stuck, as well as excellent documentation on how to do things. I’d thoroughly recommend using Pentaho DI

Gareth P Crawshaw | BIS Technical Analyst | Equifax UK

Like (0)15 September 15
Anonymous avatar x30

You report design and generation tools are not generally ideally
suited for ETL purpose. Depending on the complexity of data and
volume, you can choose one of the leading ETL tools in the market.
There are some Open Source system based ETL tools are also available.
Please check it under Eclipse platform. Otherwise I would recommend
one of these tools: Informatica MDM / Power Center, MS SQL Data
Extractor, SAP Data Integrator or Oracle Warehouse Builder. You may
also choose to write your own script and PL/SQL if the cleansing does
not have much challenges and straightforward.

Like (0)15 September 15
Anonymous avatar x30

Here you go..

Jaspersoft has moved on now and the latest version of iReport is called
Jaspersoft Studio, an IDE based on Eclipse. Jaspersoft ETL is not a bolt-on
ETL tool likes Pentaho's, it is an OEM of the Talend Data Integration
Product. Talend is an all-encompassing ETL tool which not only allows you
to bring data from virtually anywhere (including Google Analytics,
Salesforce, SAP etc) but the intuitive workflow designer makes it easy to
get up and running. The deployment of completed workflows can only be
described as a breeze and its worth downloading and having a play ..
"Seeing is Believing".

Pentaho, on the other hand, primarily (I think) is a BI Company and
therefore as a consequence its ETL tool does not stack up to the Jaspersoft
Offering. But like all these things the proof is in the pudding ..

I hope that helps!

Will

Like (0)15 September 15
Ivan de vargas lopes jr cbpp pmp avatar 1433254730?1433254728

Ideally, you use an ETL tool like Talend or SSIS (SQL Server Integration Services), because they are specialized tools for data processing. But between the two tools mentioned, the Pentaho is simpler than the Jaspersoft.

Like (0)15 September 15
Reviewer183738 li?1421772048
Angel MonjarásReal UserTOP 10

Hello Ariel!
Sorry. Not familiar enough with those tools.
Have a nice day!

On 15/09/15 06:46, Ariel Lindenfeld - Community Manager at IT Central

Like (0)15 September 15
0 xkxrfbvfqvj3m6uezkooe6nf1wwtv92ez sgd6k3jjogjcy6v85pwhz1pl7?1414330426

JasperSoft unlike Pentaho does not have its own ETL tool. It bundles Talend
Studio.
As for Talend vs Pentaho ETL (Kettle), Kettle is more intuitive (easier to
start), but less feature-rich. Kettle is better integrated with Pentaho
than Talend with JasperSoft.
One can find a lot of comparisons like this:
http://www.robertomarchetto.com/talend_studio_vs_kettle_pentao_pdi_comparison

Like (0)15 September 15
Picture 2326 1370856731
Nikhilesh TiwariReal UserTOP REVIEWERPOPULAR

Jaspersoft is having tieup with talend. With jaspersoft commercial version they provide talend. Hence if you plan to use jaspersoft it would be more recommended to use talend rather than kettle.

Like (1)15 September 15
Anonymous avatar x30
chaferUser

I personally use Talend for Data Integration and ETL on a daily basis. It is versatile and Jaspersoft uses it in their 3rd-party product for ETL called JaspersoftETL.

There are free versions of Talend products, called "Open Studio," as well as Enterprise versions. Talend Open Studio is their ETL tool and Talend Data Quality is their cleansing data product. What is really nice about the Open Studio products are the many data connecting components and the ease of use in doing data integration.

https://www.talend.com/products/talend-open-studio

Hope this helps.

Thank you,
Christopher Hafer

Like (2)15 September 15
5bcd1d62 0eca 403c b7ed df8611fcf47f avatar
Avinash MendseConsultantTOP 20

Dear Sir,

You can use Crystal Reports for Java. It have reach UI and more options as Drill Down, Hyperlink etc.

Thanks

Like (0)15 September 15
Anonymous avatar x30
Saudi MostafaConsultantTOP 20

Pentaho is good
It open source but it needs trining
Kind Regards
Dr. Saudi A. Baset
IT Advisor
Tel +2010-1621778

--------------------------------------------

Like (0)15 September 15
Anonymous avatar x30
David DReal User

I had used Talend and is quite easy to learn and to make complex process with files or data, Even Big Data. if you wanna start to develop I recommened Talend because learning curve. it´s faster to understand.

Like (0)15 September 15
Nikos tsantanis li?1427801887

Hi,
Actually we have been using Pentaho ETL and we are quite satisfied with this. We wouldn’t have the minimum hesitation to use in even in complex projects.
As per Jasper, even if we are aware of it we haven’t used it yet; however, if the front end is from Jasper as well, I would consider using it first.
I hope I as kind of helpful.

Like (0)15 September 15
A3d6b732 0d91 4200 a2f5 1500a9d7c9d6 avatar
Steve CornettReal UserTOP 5

I have never worked with nor seen Pentaho, but Jaspersoft is for embedding analytics/reports within applications predominantly in a Java application framework. I don't believe it has any "clean the data first" capabilities, which is more of a data quality/ETL requirement.

Like (0)15 September 15
Anonymous avatar x30
John TwistReal User

I would rather use Pentaho, but I may be biased because I have been using
it for such a long time now.

Like (0)15 September 15
179b8e92 620d 4bcb ab3c a075cbbbe6a5 avatar?1441719278

If i would start a new Integration/ETL project today with an open source tool, I will use Talend DI, for sure.

Like (0)15 September 15
23aee285 59b8 429e 9aca f2e6f3d8682f avatar
Igor Korelic PhDReal UserTOP 5

Jaspersoft is strict reporting tool, has no ETL capability. Pentaho has some ETL capabilities, but out of my experience I Will always use Talend DI.

Like (0)15 September 15
179b8e92 620d 4bcb ab3c a075cbbbe6a5 avatar?1441719278

Hello,
Like Bruce said, Jaspersoft is a visualization tool, I does not apply for cleasing and transformation. If your need is to clean and transform your data first and you need an open source solution, two ETL tools there are very used are Talend and Pentaho, in my experience I recommend you the first one.

Like (0)15 September 15
Naresh babu li?1423826335
Naresh BabuReal UserTOP 20

use Jaspersoft , easy integration and reporting is also easy.if you are going for datawarehouse and needs faster execution go for Netezza DB , ETL Data stage / Informatica(MDM serves your purpose of data cleaning, clensing). for reporting can use MSTR/Cognos/Tableau.SSRS/SSIS also serves your purpose.

Like (0)15 September 15
Efe30d2d fa77 4924 9b4b 6d861abe53e3 avatar?1456215423

I would prefer Jaspersoft over pentaho for reporting.

I prefer pentaho over jaspersoft for creating dashboards..

Both the tools are community not enterprise.

Thanks,
Sadakar

Like (0)15 September 15
D9591144 9d45 42cd 9eed 22e9f69a9634 avatar

Makes no difference really from a design and report point of view.

They both are created from the Kettle codebase however Pentaho with its
latest release is revamping its data ingestion solutions so its ahead of
the curve against Jasper ETL.

If you already paid for the full Jasper licences then its a no brainer to
use Jasper toolkit.

REgards

Like (0)15 September 15
Anonymous avatar x30
Amrish BadalConsultant

Pentaho. Performance wise and adaptability to existing architecture. Datacleaning is easy

Like (0)15 September 15
Anonymous avatar x30

Hello Ariel,

Pentaho is a more complete suite.

Like (0)15 September 15
Yusuf fachruddin li?1414338222

Hi,

using Pentaho Spoon for ETL, easy to use with a lot of features for clean
and transform your data from source to target.

Regards,

Yusuf Fachruddin

Like (0)15 September 15
Ruslan podgaets avatar 1433964457?1433964455

I've used Talend ETL in the past (Which is what's included in Jaspersoft
package as an ETL tool). Attempted pentaho in the past but for a few
reasons it didn't work out. Talend is buggy at times but handles small to
medium volumes well.

Like (0)15 September 15
Anonymous avatar x30
Bruce WesterReal User

I have limited knowledge of Jaspersoft, from my research it is primarily a visualization tool and it does not have the back end cleansing capabilities that Pentaho offers. If you need to clean up your data, I think Pentaho is the better of these options.

Like (1)15 September 15
Anonymous avatar x30
Haroon SarfrazConsultant

Pentaho has more data integration and analytics options, however it also depends what budget you have for your task in hand.

Regards,

Haroon Sarfraz Janjua

Project Manager

Like (0)15 September 15
Aa7a7ea4 b06f 439b a44a 455a0f6d2326 avatar?1436271351

Hello,

For ETL, I would suggest using Pentaho as it does have some advantages over Jaspersoft.

Regards,

Mohammad Aamer

BI Functional Consultant

Like (0)15 September 15
Ca215cc1 dad1 4b51 9338 5fd882a36e4f avatar?1437466881
Eric SmetsReal User

Hello,

For cleaning the data the ETL of pentaho works like a charm, this is the
better way to go.

As we are using Pentaho (community edition) for ETL jobs, we used this also
for building reports.

Jasper reports is easier to integrate in an other software from a
developpers point, But the pentaho report it self

are OK and easy to use.

Regards,

Eric Smets

Like (0)15 September 15
As seen in
Logosasseeninsmall

Sign Up with Email