2018-08-14T07:42:00Z

What do you like most about Apache Hadoop?

Julia Miller - PeerSpot reviewer
  • 0
  • 5
PeerSpot user
22

22 Answers

GM
Real User
Top 5
2023-12-29T12:05:06Z
Dec 29, 2023

It's open-source, so it's very cost-effective.

Search for a product comparison
Miodrag Milojevic - PeerSpot reviewer
Real User
Top 5Leaderboard
2023-08-01T14:27:00Z
Aug 1, 2023

It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database.

YM
Real User
Top 10
2023-08-01T12:17:28Z
Aug 1, 2023

The most valuable feature is scalability and the possibility to work with major information and open source capability.

Aria Amini - PeerSpot reviewer
Real User
Top 5Leaderboard
2023-07-26T11:57:00Z
Jul 26, 2023

Its integration is Hadoop's best feature because that allows us to support different tools in a big data platform.

AM
Real User
Top 20
2022-09-29T11:28:03Z
Sep 29, 2022

Apache Hadoop can manage large amounts and volumes of data with relative ease, which is a feature that is beneficial.

YT
Real User
Top 20
2022-09-05T12:51:45Z
Sep 5, 2022

One valuable feature is that we can download data.

Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
763,955 professionals have used our research since 2012.
MB
Real User
Top 20
2022-07-21T16:29:00Z
Jul 21, 2022

I liked that Apache Hadoop was powerful, had a lot of tools, and the fact that it was free and community-developed.

Juliet Hoimonthi - PeerSpot reviewer
Real User
Top 5
2022-07-05T06:27:00Z
Jul 5, 2022

What I like about Apache Hadoop is that it's for big data, in particular big data analysis, and it's the easier solution. I like the data processing feature for AI/ML use cases the most because some solutions allow me to collect data from relational databases, while Hadoop provides me with more options for newer technologies.

DM
Real User
Top 20
2022-04-27T08:19:03Z
Apr 27, 2022

The scalability of Apache Hadoop is very good.

DK
Real User
2022-01-14T10:24:00Z
Jan 14, 2022

We selected Apache Hadoop because it is not dependent on third-party vendors.

DD
Real User
2021-10-05T18:57:00Z
Oct 5, 2021

Hadoop is extensible — it's elastic.

GA
Real User
2020-12-08T22:10:56Z
Dec 8, 2020

Hadoop is designed to be scalable, so I don't think that it has limitations in regards to scalability.

SS
Real User
2020-10-19T09:33:27Z
Oct 19, 2020

The performance is pretty good.

JP
Real User
2020-07-14T08:15:56Z
Jul 14, 2020

The solution is easy to expand. We haven't seen any issues with it in that sense. We've added 10 servers, and we've added two nodes. We've been expanding since we started using it since we started out so small. Companies that need to scale shouldn't have a problem doing so.

Abhik Ray - PeerSpot reviewer
Real User
Top 5
2020-02-07T02:52:00Z
Feb 7, 2020

The most valuable features are powerful tools for ingestion, as data is in multiple systems.

it_user1093134 - PeerSpot reviewer
Real User
2019-12-16T08:14:00Z
Dec 16, 2019

The most valuable feature is the database.

it_user1208307 - PeerSpot reviewer
Real User
2019-12-16T08:13:00Z
Dec 16, 2019

It's good for storing historical data and handling analytics on a huge amount of data.

YM
Real User
Top 10
2019-11-27T05:42:00Z
Nov 27, 2019

The ability to add multiple nodes without any restriction is the solution's most valuable aspect.

Lucas Dreyer - PeerSpot reviewer
Real User
Top 5Leaderboard
2019-09-29T07:27:00Z
Sep 29, 2019

What comes with the standard setup is what we mostly use, but Ambari is the most important.

MB
Real User
Top 20
2019-07-28T07:35:00Z
Jul 28, 2019

The best thing about this solution is that it is very powerful and very cheap.

Real User
2019-07-16T01:59:00Z
Jul 16, 2019

The most valuable features are the ability to process the machine data at a high speed, and to add structure to our data so that we can generate relevant analytics.

SF
Real User
2018-08-14T07:42:00Z
Aug 14, 2018

Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing.

The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect...
Download Apache Hadoop ReportRead more

Related articles