Compare Amazon EMR vs. Pentaho Data Integration

Cancel
You must select at least 2 products to compare!
Most Helpful Review
Find out what your peers are saying about Apache, Cloudera, IBM and others in Hadoop. Updated: March 2021.
473,605 professionals have used our research since 2012.
Quotes From Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros
"The initial setup is pretty straightforward."

More Amazon EMR Pros »

"The solution has a free to use community version.""The amount of data that it loads and processes is good.""Pentaho Data Integration is quite simple to learn, and there is a lot of information available online."

More Pentaho Data Integration Pros »

Cons
"The dashboard management could be better. Right now, it's lacking a bit."

More Amazon EMR Cons »

"It's not very stable, at least not in the case of the community edition. I'm working with the community edition right now and I think perhaps it is because of that it is not very stable, it causes the system to sometimes hang. I'm not sure if this is the case for pair tiers.""I would like to see improvements made for real-time data processing.""I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking."

More Pentaho Data Integration Cons »

Pricing and Cost Advice
Information Not Available
"The price of the regular version is not reasonable and it should be lower.""Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."

More Pentaho Data Integration Pricing and Cost Advice »

report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
473,605 professionals have used our research since 2012.
Questions from the Community
Top Answer: The initial setup is pretty straightforward.
Top Answer: The price of the solution may be a bit more than other competitors, such as Microsoft.
Top Answer: The dashboard management could be better. Right now, it's lacking a bit. I'd like more of a remote connection between my computer and the solution. We have multi-factor authentication, and at one… more »
Top Answer: Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition :… more »
Top Answer: Depends upon the technologies being used. If you're using Oracle for both OLTP and OLAP then you'll get a lot of value from an Oracle solution. The other question is how up to date do you want your… more »
Top Answer: Pentaho Data Integration is quite simple to learn, and there is a lot of information available online.
Ranking
7th
out of 22 in Hadoop
Views
2,514
Comparisons
2,091
Reviews
1
Average Words per Review
492
Rating
4.0
15th
Views
10,067
Comparisons
8,185
Reviews
3
Average Words per Review
652
Rating
7.7
Popular Comparisons
Also Known As
Amazon Elastic MapReduce
Kettle
Learn More
Overview
Amazon Elastic MapReduce (Amazon EMR) is a web service that makes it easy to quickly and cost-effectively process vast amounts of data. Amazon EMR simplifies big data processing, providing a managed Hadoop framework that makes it easy, fast, and cost-effective for you to distribute and process vast amounts of your data across dynamically scalable Amazon EC2 instances.

Pentaho data integration prepares and blends data to create a complete picture of your business that drives actionable insights. The complete data integration platform delivers accurate, "analytics ready" data to end users from any source. With visual tools to eliminate coding and complexity, Pentaho puts big data and all data sources at the fingertips of business and IT users alike.

Offer
Learn more about Amazon EMR
Learn more about Pentaho Data Integration
Sample Customers
Yelp
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Top Industries
VISITORS READING REVIEWS
Computer Software Company27%
Media Company24%
Comms Service Provider13%
Insurance Company5%
REVIEWERS
Government15%
Comms Service Provider15%
Healthcare Company15%
Financial Services Firm15%
VISITORS READING REVIEWS
Computer Software Company27%
Comms Service Provider20%
Financial Services Firm8%
Government7%
Company Size
No Data Available
REVIEWERS
Small Business25%
Midsize Enterprise25%
Large Enterprise50%
Find out what your peers are saying about Apache, Cloudera, IBM and others in Hadoop. Updated: March 2021.
473,605 professionals have used our research since 2012.

Amazon EMR is ranked 7th in Hadoop with 1 review while Pentaho Data Integration is ranked 15th in Data Integration Tools with 3 reviews. Amazon EMR is rated 4.0, while Pentaho Data Integration is rated 7.6. The top reviewer of Amazon EMR writes "Stable but could offer better dashboard management and workarounds for multi-factor authentication". On the other hand, the top reviewer of Pentaho Data Integration writes "Free to use, easy to set up, and has a great metadata injection feature". Amazon EMR is most compared with Cloudera Distribution for Hadoop, Hortonworks Data Platform, Apache Spark, HPE Ezmeral Data Fabric and BlueData, whereas Pentaho Data Integration is most compared with Talend Open Studio, SSIS, Informatica PowerCenter, IBM InfoSphere DataStage and Oracle Data Integrator (ODI).

See our list of .

We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.