Apache Hadoop Logo

Apache Hadoop pros and cons

Vendor: Apache
3.9 out of 5
1,776 followers
Post review
 

Apache Hadoop Pros review quotes

JP
Jul 14, 2020
The solution is easy to expand. We haven't seen any issues with it in that sense. We've added 10 servers, and we've added two nodes. We've been expanding since we started using it since we started out so small. Companies that need to scale shouldn't have a problem doing so.
Abhik Ray - PeerSpot reviewer
Jul 18, 2022
The most important feature is its ability to handle large volumes. Some of our customers have really large volumes, and it is capable of handling their data in terms of the core volume and daily incremental volume. So, its processing power and speed are most valuable.
AM
Sep 29, 2022
Apache Hadoop can manage large amounts and volumes of data with relative ease, which is a feature that is beneficial.
Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,415 professionals have used our research since 2012.
Juliet Hoimonthi - PeerSpot reviewer
Jul 5, 2022
What I like about Apache Hadoop is that it's for big data, in particular big data analysis, and it's the easier solution. I like the data processing feature for AI/ML use cases the most because some solutions allow me to collect data from relational databases, while Hadoop provides me with more options for newer technologies.
Mar 6, 2018
Initially, with RDBMS alone, we had a lot of work and few servers running on-premise and on cloud for the PoC and incubation. With the use of Hadoop and ecosystem components and tools, and managing it in Amazon EC2, we have created a Big Data "lab" which helps us to centralize all our work and solutions into a single repository. This has cut down the time in terms of maintenance, development and, especially, data processing challenges.
MB
Jul 28, 2019
The best thing about this solution is that it is very powerful and very cheap.
SF
Aug 14, 2018
Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing.
GM
Dec 29, 2023
It's open-source, so it's very cost-effective.
Lucas Dreyer - PeerSpot reviewer
Sep 29, 2019
What comes with the standard setup is what we mostly use, but Ambari is the most important.
Aria Amini - PeerSpot reviewer
Jul 26, 2023
Its integration is Hadoop's best feature because that allows us to support different tools in a big data platform.
 

Apache Hadoop Cons review quotes

JP
Jul 14, 2020
The solution needs a better tutorial. There are only documents available currently. There's a lot of YouTube videos available. However, in terms of learning, we didn't have great success trying to learn that way. There needs to be better self-paced learning.
Abhik Ray - PeerSpot reviewer
Jul 18, 2022
It requires a great deal of learning curve to understand. The overall Hadoop ecosystem has a large number of sub-products. There is ZooKeeper, and there are a whole lot of other things that are connected. In many cases, their functionalities are overlapping, and for a newcomer or our clients, it is very difficult to decide which of them to buy and which of them they don't really need. They require a consulting organization for it, which is good for organizations such as ours because that's what we do, but it is not easy for the end customers to gain so much knowledge and optimally use it.
AM
Sep 29, 2022
I mentioned it definitely, and this is probably the only feature we can improve a little bit because the terminal and coding screen on Hadoop is a little outdated, and it looks like the old C++ bio screen. If the UI and UX can be improved slightly, I believe it will go a long way toward increasing adoption and effectiveness.
Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,415 professionals have used our research since 2012.
Juliet Hoimonthi - PeerSpot reviewer
Jul 5, 2022
What could be improved in Apache Hadoop is its user-friendliness. It's not that user-friendly, but maybe it's because I'm new to it. Sometimes it feels so tough to use, but it could be because of two aspects: one is my incompetency, for example, I don't know about all the features of Apache Hadoop, or maybe it's because of the limitations of the platform. For example, my team is maintaining the business glossary in Apache Atlas, but if you want to change any settings at the GUI level, an advanced level of coding or programming needs to be done in the back end, so it's not user-friendly.
Mar 6, 2018
Based on our needs, we would like to see a tool for data visualization and enhanced Ambari for management, plus a pre-built IoT hub/model. These would reduce our efforts and the time needed to prove to a customer that this will help them.
MB
Jul 28, 2019
The upgrade path should be improved because it is not as easy as it should be.
SF
Aug 14, 2018
I would like to see more direct integration of visualization applications.
GM
Dec 29, 2023
The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support.
Lucas Dreyer - PeerSpot reviewer
Sep 29, 2019
In the next release, I would like to see Hive more responsive for smaller queries and to reduce the latency.
Aria Amini - PeerSpot reviewer
Jul 26, 2023
It could be more user-friendly.