Compare Apache Spark vs. SAP HANA

Cancel
You must select at least 2 products to compare!
Apache Spark Logo
11,296 views|9,230 comparisons
SAP HANA Logo
10,572 views|8,519 comparisons
Most Helpful Review
Find out what your peers are saying about Apache, Cloudera, IBM and others in Hadoop. Updated: January 2021.
456,495 professionals have used our research since 2012.
Quotes From Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros
"I found the solution stable. We haven't had any problems with it.""The scalability has been the most valuable aspect of the solution.""The most valuable feature of this solution is its capacity for processing large amounts of data.""The solution is very stable.""I feel the streaming is its best feature.""The features we find most valuable are the machine learning, data learning, and Spark Analytics.""The main feature that we find valuable is that it is very fast.""The processing time is very much improved over the data warehouse solution that we were using."

More Apache Spark Pros »

"It has a very huge bandwidth and data transfer.""This solution is very fast.""If you want to scale with new processes and new reports, that's fairly easy.""The most value for us was in terms of using it to issue tenders online. We host our server, but it is open to the public, so clients who want to buy those tenders were able to go online, put their tender documents up, and we could evaluate them using SAP.""Integration is the most valuable feature we use SAP HANA for.""One feature I find very valuable, is the response time of the application on the database memory.""The functionality is of the solution is very good.""The data storage requirement is reduced from the original database to the HANA database."

More SAP HANA Pros »

Cons
"It needs a new interface and a better way to get some data. In terms of writing our scripts, some processes could be faster.""The management tools could use improvement. Some of the debugging tools need some work as well. They need to be more descriptive.""When you first start using this solution, it is common to run into memory errors when you are dealing with large amounts of data.""The solution needs to optimize shuffling between workers.""When you want to extract data from your HDFS and other sources then it is kind of tricky because you have to connect with those sources.""We've had problems using a Python process to try to access something in a large volume of data. It crashes if somebody gives me the wrong code because it cannot handle a large volume of data.""We use big data manager but we cannot use it as conditional data so whenever we're trying to fetch the data, it takes a bit of time.""I would like to see integration with data science platforms to optimize the processing capability for these tasks."

More Apache Spark Cons »

"The solution is very expensive, however. The pricing depends on the number of users and many other factors that affect licensing.""The inclusion of a well-performing Time Machine is vital.""In terms of improvement, the speed is not as good as we thought it would be. That is why we are trying different solutions that will be built with different technologies.""The interface is a little bit hard to customize. You almost have to consult the SAP original developer to change it.""FI, or the financial module of SAP, has room for improvement. It has to have some better localization for the Middle East, especially in regards to taxes and the letter of credit cycle. I would like to see better localization from the HCM.""If the developers were to enhance or improve the application logic while processing the transactions, that would be great.""The challenge right now is all databases are on S4 HANA architecture. You're running it for HANA, but not all the functionalities are available. If they can speed up getting all the databases on S4 HANA that would help.""I think that the pricing is high and it needs improvement."

More SAP HANA Cons »

Pricing and Cost Advice
Information Not Available
"People who are technical will accept the cost, but financially they will assess whether this solution will bring them revenue or not. People often ask, how will I profit when the cost is so high?""It is expensive, which isn't a problem for us because SAP HANA is processing the data so fast.""SAP HANA is an expensive product.""It is expensive.""Setup and licensing require planning and proper budgeting, as it is not cheap."

More SAP HANA Pricing and Cost Advice »

report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
456,495 professionals have used our research since 2012.
Questions from the Community
Top Answer: SQreamDB is a GPU DB. It is not suitable for real-time oltp of course. Cassandra is best suited for OLTP database use cases, when you need a scalable database (instead of SQL server, Postgres)… more »
Top Answer: I love every core functionality of Apache Spark Initially they have only provided RDD basic interface to process the data across distributed cluster. Then it evolved to dataframe and dataset interface… more »
Top Answer: Apache spark is available in cloud services like AWS cloud, Azure. We have to use the specific service for our use case. For example we can use AWS Glue which runs spark for ETL process, AWS EMR… more »
Ask a question

Earn 20 points

Ranking
1st
out of 22 in Hadoop
Views
11,296
Comparisons
9,230
Reviews
12
Average Words per Review
388
Rating
8.3
Views
10,572
Comparisons
8,519
Reviews
21
Average Words per Review
508
Rating
8.0
Popular Comparisons
Compared 29% of the time.
Compared 9% of the time.
Compared 6% of the time.
Compared 5% of the time.
Compared 34% of the time.
Compared 17% of the time.
Compared 9% of the time.
Compared 5% of the time.
Also Known As
SAP High-Performance Analytic Appliance, HANA
Learn
Apache
SAP
Overview

Spark provides programmers with an application programming interface centered on a data structure called the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. It was developed in response to limitations in the MapReduce cluster computing paradigm, which forces a particular linear dataflowstructure on distributed programs: MapReduce programs read input data from disk, map a function across the data, reduce the results of the map, and store reduction results on disk. Spark's RDDs function as a working set for distributed programs that offers a (deliberately) restricted form of distributed shared memory

The SAP HANA® platform helps you reimagine business by combining a robust database with services for creating innovative applications. It enables real-time business by converging trans-actions and analytics on one in-memory platform. Running on premise or in the cloud, SAP HANA untangles IT complexity, bringing huge savings in data management and empowering decision makers everywhere with new insight and predictive power.

Offer
Learn more about Apache Spark
Learn more about SAP HANA
Sample Customers
NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi SolutionsUnilever, NHS 24, adidas Group, CHIO Aachen, Hamburg Port Authority (HPA), Bangkok Airways Public Company Limited
Top Industries
REVIEWERS
Financial Services Firm44%
Computer Software Company22%
Marketing Services Firm11%
Non Profit11%
VISITORS READING REVIEWS
Computer Software Company26%
Comms Service Provider18%
Media Company11%
Financial Services Firm10%
REVIEWERS
Energy/Utilities Company18%
Financial Services Firm12%
Consumer Goods Company12%
Construction Company6%
VISITORS READING REVIEWS
Computer Software Company40%
Comms Service Provider18%
Media Company5%
Manufacturing Company5%
Company Size
REVIEWERS
Small Business39%
Midsize Enterprise19%
Large Enterprise42%
REVIEWERS
Small Business38%
Midsize Enterprise10%
Large Enterprise52%
Find out what your peers are saying about Apache, Cloudera, IBM and others in Hadoop. Updated: January 2021.
456,495 professionals have used our research since 2012.

Apache Spark is ranked 1st in Hadoop with 12 reviews while SAP HANA is ranked 1st in Embedded Database Software with 21 reviews. Apache Spark is rated 8.2, while SAP HANA is rated 8.0. The top reviewer of Apache Spark writes "Good Streaming features enable to enter data and analysis within Spark Stream". On the other hand, the top reviewer of SAP HANA writes "Very robust solution with good data access". Apache Spark is most compared with Spring Boot, Azure Stream Analytics, AWS Batch, AWS Lambda and Apache NiFi, whereas SAP HANA is most compared with SQL Server, Oracle Database, MySQL, Oracle Database In-Memory and IBM Db2 Database.

See our list of .

We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.