Apache Hadoop Scalability

Syed Afroz Pasha - PeerSpot reviewer
Head Of Data Governance at Alibaba Group

Around 70 to 80 people use the product in our organization.

View full review »
Miodrag Milojevic - PeerSpot reviewer
Senior Data Archirect at Yettel

The scalability includes adding nodes and it is not so easy to do. It is a detailed process that requires precision. 

There are almost 25 users, including data engineers and others, but no specialists. We plan to increase endpoint users and introduce running reports, automated reports, or reports based on some tools. 

View full review »
Juliet Hoimonthi - PeerSpot reviewer
Manager at Robi Axiata Limited

I'm not sure how scalable Apache Hadoop is.

View full review »
Buyer's Guide
Apache Hadoop
March 2024
Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
765,234 professionals have used our research since 2012.
GM
Data Architect at a computer software company with 51-200 employees

It can be scalable in certain cases. Typically, for startups or product-based companies with limited budgets during product development, Apache Hadoop is often the only viable option. They cannot afford the costs of other cloud-based systems, so Apache Hadoop plays a main role in those scenarios.

View full review »
AM
Credit & Fraud Risk Analyst at a financial services firm with 10,001+ employees

According to what I have seen in my current enterprise, once I joined the organization, it was fairly simple to have it for an employee, and this is true for everyone who's been onboarded in my own designation. I would imagine that it is fairly scalable across an enterprise.

I am fairly certain that we have between 10 and 15,000 employees who use it.

View full review »
Abhik Ray - PeerSpot reviewer
Co-Founder at Quantic

Its scalability is very good. Most of our clients have used it on-prem. So, to a large extent, it is up to them to provide hardware for large data, which they have. Its scalability is linear. As long as the hardware is given to it, there are no complaints.

About 70% of its users are from a client's IT in terms of setting it up and providing support to make sure that the pipeline is there. Business users are about 30%. They are the people who use the analytics derived from the warehouse or data lake. Collectively, there are about 120 users. The size of the data is mostly in terms of the number of records it handles, which could be 30 or 40 million.

View full review »
RC
Senior Associate at a financial services firm with 10,001+ employees

No. This is by default a cluster-based setup and hence scaling is just a matter of adding on new data nodes.

View full review »
Aria Amini - PeerSpot reviewer
Data Engineer at Behsazan Mellat

Apache Hadoop is very good for scalability because one of its main features is its scalability tool. For all the big data infrastructure, we have about ten employees working in the Hadoop environment as engineers and developers. One of our clients is a bank, and the Hadoop environment can retrieve a lot of data, so we could have an unlimited number of end users.

View full review »
YM
CEO at AM-BITS LLC

We may have 15 people working on this solution.

I rate the solution’s scalability a ten out of ten.

View full review »
DM
Data Analytics Practice head at bse

The scalability of Apache Hadoop is very good.

View full review »
it_user340983 - PeerSpot reviewer
Infrastructure Engineer at Zirous, Inc.

We have scaled two of the clusters that we have implemented; one in the cloud, one on-premise. Neither ran into any problems, but I can say with certainty that it is much, much easier to scale in a cloud environment than it is on-premise.

View full review »
Lucas Dreyer - PeerSpot reviewer
Data Engineer at BBD

This solution is scalable, and I can scale it almost indefinitely.

We have approximately two thousand users, half of the users are using it directly and another thousand using the products and systems running on it. Fifty are data engineers, fifteen direct appliances, and the rest are business users.

View full review »
AM
CEO

Since this is primarily for customer incubation, there is a need to process huge volumes of data, based on the proof of value engagement. During these processes, we scale the number of instances on demand (using Amazon spot instances), use them for a defined period, and scale down when the PoC is done. This gives us good flexibility and we pay only for usage.

View full review »
YT
Business data analyst at RBSG Internet operations

The scalability of the solution is good. Approximately 100 people are currently using this solution within our company. 

View full review »
JP
Vice President - Finance & IT at a consumer goods company with 1-10 employees

The solution is easy to expand. We haven't seen any issues with it in that sense. We've added 10 servers, and we've added two nodes. We've been expanding since we started using it since we started out so small. Companies that need to scale shouldn't have a problem doing so.

We are supporting a multitenancy model and we get the data on supporting the users. I would say, per organization, we have eight to 10 users and probably have a total of around 40 users across the board.

View full review »
it_user265830 - PeerSpot reviewer
Senior Hadoop Engineer with 1,001-5,000 employees

Hadoop is known for its scalability. Yahoo stores approx. 455 PB in their Hadoop cluster.

View full review »
DD
Partner at a tech services company with 11-50 employees

Scalability is one of Hadoop's strong suits.

View full review »
MB
IT Expert at a comms service provider with 1,001-5,000 employees

Apache Hadoop is scalable. We had about 150 people using it at the organization. Some were data scientists, others were from the engineering side, and people from management because Apache Hadoop provided some reports.

View full review »
YM
CEO at AM-BITS LLC

It is possible to scale the solution. We work with companies that have hundreds of users.

View full review »
MB
IT Expert at a comms service provider with 1,001-5,000 employees

This is a scalable solution and we like what it does. It is currently serving about 100 users at our organization and it seems like it can handle more easily.

View full review »
SS
Technical Lead at a government with 201-500 employees

You can scale the solution if you need to. We find that it's pretty easy to expand it out.

There were about 13-20 people using it at any given time.

View full review »
GA
Founder & CTO at a tech services company with 1-10 employees

Hadoop is designed to be scalable, so I don't think that it has limitations in regards to scalability.

View full review »
it_user1093134 - PeerSpot reviewer
Technical Architect at RBSG Internet Operations

The solution is scalable. From a payments perspective, we're using the solution on a large scale.

View full review »
it_user693231 - PeerSpot reviewer
Big Data Engineer at a tech vendor with 5,001-10,000 employees

We have not had scalability issues.

View full review »
Buyer's Guide
Apache Hadoop
March 2024
Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
765,234 professionals have used our research since 2012.