Compare Apache Spark vs. SAP HANA

Apache Spark is ranked 1st in Hadoop with 7 reviews while SAP HANA is ranked 1st in Embedded Database Software with 11 reviews. Apache Spark is rated 7.8, while SAP HANA is rated 8.2. The top reviewer of Apache Spark writes "Fast performance and has an easy initial setup". On the other hand, the top reviewer of SAP HANA writes "We can get more tenders because of the lower cost while providing a better product or service". Apache Spark is most compared with Spring Boot, AWS Lambda and Azure Stream Analytics, whereas SAP HANA is most compared with SQL Server, Oracle Database and Teradata.
Cancel
You must select at least 2 products to compare!
Apache Spark Logo
11,048 views|9,201 comparisons
SAP HANA Logo
10,414 views|8,535 comparisons
Most Helpful Review
Find out what your peers are saying about Apache, Cloudera, Hortonworks and others in Hadoop. Updated: December 2019.
383,162 professionals have used our research since 2012.
Quotes From Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros
The solution is very stable.The most valuable feature of this solution is its capacity for processing large amounts of data.I found the solution stable. We haven't had any problems with it.The scalability has been the most valuable aspect of the solution.Features include machine learning, real time streaming, and data processing.The fault tolerant feature is provided.It provides a scalable machine learning library.

Read more »

The memory is the solution's most valuable feature. It's the main feature of HANA. Others are still the regular IT databases that are on storage and are therefore much slower than HANA. The solution is quite fast.The user interface is very good. You can do any kind of reporting analytics from the platform.The data storage requirement is reduced from the original database to the HANA database.The functionality is of the solution is very good.One feature I find very valuable, is the response time of the application on the database memory.Integration is the most valuable feature we use SAP HANA for.The most value for us was in terms of using it to issue tenders online. We host our server, but it is open to the public, so clients who want to buy those tenders were able to go online, put their tender documents up, and we could evaluate them using SAP.If you want to scale with new processes and new reports, that's fairly easy.

Read more »

Cons
The solution needs to optimize shuffling between workers.When you first start using this solution, it is common to run into memory errors when you are dealing with large amounts of data.It needs a new interface and a better way to get some data. In terms of writing our scripts, some processes could be faster.The management tools could use improvement. Some of the debugging tools need some work as well. They need to be more descriptive.It should support more programming languages.Needs to provide an internal schedule to schedule spark jobs with monitoring capability.

Read more »

Unlike other databases, it lacks management features that legacy databases like Oracle or SQL servers have. They need to make the solution easier to manage and offer tools that make management more effective. A lot of things you have on traditional databases you have to develop into HANA.When you do a report on a non-SAP platform, you face some compatibility problems.I think that the pricing is high and it needs improvement.The challenge right now is all databases are on S4 HANA architecture. You're running it for HANA, but not all the functionalities are available. If they can speed up getting all the databases on S4 HANA that would help.If the developers were to enhance or improve the application logic while processing the transactions, that would be great.FI, or the financial module of SAP, has room for improvement. It has to have some better localization for the Middle East, especially in regards to taxes and the letter of credit cycle. I would like to see better localization from the HCM.The interface is a little bit hard to customize. You almost have to consult the SAP original developer to change it.In terms of improvement, the speed is not as good as we thought it would be. That is why we are trying different solutions that will be built with different technologies.

Read more »

Pricing and Cost Advice
Information Not Available
People who are technical will accept the cost, but financially they will assess whether this solution will bring them revenue or not. People often ask, how will I profit when the cost is so high?Set up a consortium of consulting partners and hardware vendors to define your tech. Landscape TCO (total cost of ownership) and then approach the OEM for pricing (on-premise or on cloud or a hybrid model). Check if you can bring your own licenses for some of the existing application licenses on the new platform, to reduce TCO.

Read more »

report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
383,162 professionals have used our research since 2012.
Ranking
1st
out of 24 in Hadoop
Views
11,048
Comparisons
9,201
Reviews
7
Average Words per Review
205
Avg. Rating
8.0
Views
10,414
Comparisons
8,535
Reviews
11
Average Words per Review
428
Avg. Rating
8.3
Top Comparisons
Compared 33% of the time.
Compared 11% of the time.
Compared 43% of the time.
Compared 9% of the time.
Compared 7% of the time.
Also Known As
SAP High-Performance Analytic Appliance, HANA
Learn
Apache
SAP
Overview

Spark provides programmers with an application programming interface centered on a data structure called the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. It was developed in response to limitations in the MapReduce cluster computing paradigm, which forces a particular linear dataflowstructure on distributed programs: MapReduce programs read input data from disk, map a function across the data, reduce the results of the map, and store reduction results on disk. Spark's RDDs function as a working set for distributed programs that offers a (deliberately) restricted form of distributed shared memory

The SAP HANA® platform helps you reimagine business by combining a robust database with services for creating innovative applications. It enables real-time business by converging trans-actions and analytics on one in-memory platform. Running on premise or in the cloud, SAP HANA untangles IT complexity, bringing huge savings in data management and empowering decision makers everywhere with new insight and predictive power.

Offer
Learn more about Apache Spark
Learn more about SAP HANA
Sample Customers
NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi SolutionsUnilever, NHS 24, adidas Group, CHIO Aachen, Hamburg Port Authority (HPA), Bangkok Airways Public Company Limited
Top Industries
REVIEWERS
Financial Services Firm29%
Software R&D Company29%
Non Profit14%
Marketing Services Firm14%
VISITORS READING REVIEWS
Software R&D Company30%
Comms Service Provider13%
Financial Services Firm9%
Media Company9%
REVIEWERS
Energy/Utilities Company29%
Construction Company14%
Transportation Company14%
Non Tech Company14%
VISITORS READING REVIEWS
Software R&D Company38%
Comms Service Provider10%
Retailer7%
Media Company7%
Find out what your peers are saying about Apache, Cloudera, Hortonworks and others in Hadoop. Updated: December 2019.
383,162 professionals have used our research since 2012.
We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.
Sign Up with Email