We performed a comparison between Amazon EMR and Pentaho Business Analytics based on real PeerSpot user reviews.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop."The initial setup is pretty straightforward."
"The solution is pretty simple to set up."
"When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark."
"This is the best tool for hosts and it's really flexible and scalable."
"The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions."
"It allows users to access the data through a web interface."
"The project management is very streamlined."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"The most valuable feature of Pentaho is the Tableau report."
"Pentaho Business Analytics' best features include the ease of developing data flows and the wide range of options to connect to databases, including those on the cloud."
"Pentaho is an analytics platform that can be used when an organization has a lot of big data storage systems already installed and needs to manage and analyze that data. It has a specific use case for unstructured data, such as documents, and needs to be able to search and analyze it."
"The initial setup is pretty straightforward."
"Easy to use components to create the job."
"We were able to install it without any assistance from tech support."
"I use the BI Server, CDE Dashboards, Saiku, and Kettle, because these tools are very good and highly experienced."
"The problem for us is it starts very slow."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"The product's features for storing data in static clusters could be better."
"Amazon EMR is continuously improving, but maybe something like CI/CD out-of-the-box or integration with Prometheus Grafana."
"The dashboard management could be better. Right now, it's lacking a bit."
"The legacy versions of the solution are not supported in the new versions."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"There is room for improvement in pricing."
"We did not achieve the ROI. The work delivered to users had lesser value than the subscription cost."
"Pentaho, at the general level, should greatly improve the easy construction of its dashboards and easy integration of information from different sources without technical user intervention."
"Deployment is not simple. It is not simple because we are dealing with a lot of data; we are dealing with a lot of storage. So, it's not a simple process."
"Version control would be a good addition."
"Another concern is that Pentaho is not customizable or interactive."
"Pentaho Business Analytics' user interface is outdated."
"The repository should be improved."
"Logging capability is needed."
Amazon EMR is ranked 3rd in Hadoop with 20 reviews while Pentaho Business Analytics is ranked 21st in BI (Business Intelligence) Tools with 42 reviews. Amazon EMR is rated 7.8, while Pentaho Business Analytics is rated 8.0. The top reviewer of Amazon EMR writes "Provides efficient data processing features and has good scalability ". On the other hand, the top reviewer of Pentaho Business Analytics writes "Flexible, easy to understand, and simple to set up". Amazon EMR is most compared with Snowflake, Cloudera Distribution for Hadoop, Azure Data Factory, Amazon Redshift and Apache Spark, whereas Pentaho Business Analytics is most compared with Microsoft Power BI, Databricks, Microsoft SQL Server Reporting Services, SAP Crystal Reports and Tableau.
We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.