We performed a comparison between Amazon EMR and Pentaho Business Analytics based on real PeerSpot user reviews.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop."In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."
"The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions."
"It allows users to access the data through a web interface."
"When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark."
"The solution helps us manage huge volumes of data."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"This is the best tool for hosts and it's really flexible and scalable."
"The initial setup is pretty straightforward."
"I use the BI Server, CDE Dashboards, Saiku, and Kettle, because these tools are very good and highly experienced."
"We were able to install it without any assistance from tech support."
"The initial setup is pretty straightforward."
"Pentaho Business Analytics' best features include the ease of developing data flows and the wide range of options to connect to databases, including those on the cloud."
"Pentaho is an analytics platform that can be used when an organization has a lot of big data storage systems already installed and needs to manage and analyze that data. It has a specific use case for unstructured data, such as documents, and needs to be able to search and analyze it."
"Easy to use components to create the job."
"The most valuable feature of Pentaho is the Tableau report."
"The initial setup was time-consuming."
"The problem for us is it starts very slow."
"The product must add some of the latest technologies to provide more flexibility to the users."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"The most complicated thing is configuring to the cluster and ensure it's running correctly."
"There is room for improvement in pricing."
"As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"Deployment is not simple. It is not simple because we are dealing with a lot of data; we are dealing with a lot of storage. So, it's not a simple process."
"Pentaho Business Analytics' user interface is outdated."
"Logging capability is needed."
"Version control would be a good addition."
"Pentaho, at the general level, should greatly improve the easy construction of its dashboards and easy integration of information from different sources without technical user intervention."
"Another concern is that Pentaho is not customizable or interactive."
"We did not achieve the ROI. The work delivered to users had lesser value than the subscription cost."
"The repository should be improved."
Amazon EMR is ranked 3rd in Hadoop with 20 reviews while Pentaho Business Analytics is ranked 19th in BI (Business Intelligence) Tools with 42 reviews. Amazon EMR is rated 7.8, while Pentaho Business Analytics is rated 8.0. The top reviewer of Amazon EMR writes "Provides efficient data processing features and has good scalability ". On the other hand, the top reviewer of Pentaho Business Analytics writes "Flexible, easy to understand, and simple to set up". Amazon EMR is most compared with Snowflake, Cloudera Distribution for Hadoop, Azure Data Factory, Amazon Redshift and Apache Spark, whereas Pentaho Business Analytics is most compared with Microsoft Power BI, Databricks, KNIME, SAP Crystal Reports and Microsoft SQL Server Reporting Services.
We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.