Cloudera Distribution for Hadoop Scalability

LS
Head of Big Data and Analytics Competency center at OTP Bank Hungary

The scalability is fairly good, but we don't have much experience with it. Our users are mainly technical, automated users who are like second-service users. We have hundreds of service users who are taking and pushing forward the data from the data lake. Then we have 20 to 30 data scientists and advanced analytics people. We have a few users interested in Google analytics logs. In total, we'd have a maximum of 50 users. 

We push them towards the direction of using BI tools and we try to make it clear that it's sometimes important to go for raw data. We are not always able to push it to the data warehouse or to a data mark for them, but it's always risky from a production process perspective because then the solution will be sensitive to  source system changes.

View full review »
Shahan Rehman - PeerSpot reviewer
Senior Business Development Manager at BBI Consultancy

It is a scalable solution. Scalability-wise, I rate the solution a nine out of ten. Scalability depends on the environment, but it can scale up in an on-premises environment. There are challenges with its scalability on the cloud.

My company deals with around seven customers who use the product.

View full review »
Miodrag Milojevic - PeerSpot reviewer
Senior Data Archirect at Yettel

The scalability of Cloudera Distribution for Hadoop is excellent. It is not a straightforward process to scale, but it is easy to deploy a new server and connect it. Once connected, the data can be distributed effectively.

View full review »
Buyer's Guide
Cloudera Distribution for Hadoop
May 2024
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: May 2024.
769,599 professionals have used our research since 2012.
Hamid M. Hamid - PeerSpot reviewer
Data architect at Banking Sector

The solution is scalable. You can add many whenever you need.

We have 100s of users using this solution. We have plans to increase the usage.

View full review »
Miodrag-Stanic - PeerSpot reviewer
Senior Architect at Yettel

We have a team of four data engineers and seven data scientists. We all work on the system, but machine learning models are delivered to other systems in our company. We have 10-15 people using this solution in total. There are five data discovery engineers using Jupyter portals.

We scaled up from six data nodes to 12 data nodes. It took considerable time because we did it during the COVID. We had to wait for eternity to get the servers. When servers came, it took maybe three weeks.

I rate the solution’s scalability a seven out of ten.

View full review »
Thishen Govender - PeerSpot reviewer
BI Manager at Discovery Health

The solution is highly scalable. I rate it a perfect ten. We have 20 users for the solution in our company.

View full review »
KY
Senior IT Application Architect at a insurance company with 5,001-10,000 employees

We have 2000 to 3000 users in our organization.

View full review »
Thishen Govender - PeerSpot reviewer
BI Manager at Discovery Health

Cloudera is dependable, and it's completely scalable.

View full review »
RS
AD - Associate Director at a financial services firm with 10,001+ employees

Scalability is good. It's replicated and by default, with Big Data there is a replication factor.

Over the years we have grown, when we started we had 10 nodes now we have increased to a large number of nodes.

View full review »
Thishen Govender - PeerSpot reviewer
BI Manager at Discovery Health

While we have not yet done a lot to scale the solution, we think that is going to be quite scalable because it's working on a distributed architecture. 

We will probably start with 10 or 15 users once we roll the solution out into production, which will probably be at the end of this week. Afterward, the user base will be growing quite large by double digits in percentage. But that is just to start with. Over a few years, we plan to start thinking about rolling out our experiences to our international businesses as well. This would be a substantial increase in user base.

View full review »
KG
Vice President at a financial services firm with 10,001+ employees

Cloudera Distribution for Hadoop is really easy to scale. We can add more servers to it, so it's scalable.

View full review »
YM
CEO at AM-BITS LLC

We have ten customers using the product. They include data engineers, performance engineers, and environment engineers.

I rate its stability a ten out of ten.

View full review »
AK
Senior Data Architect Manager at Unifonic

The scalability is very good. 

View full review »
Hamid M. Hamid - PeerSpot reviewer
Data architect at Banking Sector

It is a scalable solution. There were about 20 users of this solution in my company. 10 people were required for the deployment and maintenance of the solution, including developers. 

View full review »
Sayyed Aadil - PeerSpot reviewer
Hadoop Admin at Tata Consultancy

It is a scalable solution. It is the best solution for larger companies. There are about 3000 users, and there are medical teams for medical data with about 1000 users. We require about six people for maintenance and deployment.

View full review »
Suresh_Srinivasan - PeerSpot reviewer
Co-Founder at FORMCEPT Technologies

This solution is scalable enough for us. 

We have created a product, using HDFS, and when our engineers install it for themselves or for customers, we use this solution. There are about 15 to 20 people using it at any point of time. 

View full review »
EricLin - PeerSpot reviewer
Chairman at Athemaster co.,ltd.

The tool is scalable. I rate the scalability an eight out of ten. It is easy to scale the product. Almost 20 to 25 people use the tool in our organization. We maintain the solution ourselves. We have nine engineers in our maintenance team.

View full review »
Mohammed Hamad - PeerSpot reviewer
AI & Data Engineering Lead at a tech services company with 10,001+ employees

CDH is scalable, but it's expensive to do it.

View full review »
DS
DBA team manager at a financial services firm with 1,001-5,000 employees

Not many people are currently using this solution at my organization, but I do believe it is scalable. I don't, however, have experience with upgrading or adding users. 

View full review »
KG
Associate Manager at a consultancy with 501-1,000 employees

This solution is scalable. We have 40 users for different projects in our organization.

We will continue to use this solution.

View full review »
it_user900987 - PeerSpot reviewer
Data Management at BCX

In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues. Currently, only about 10 people in total are using the solution. So we have about four business users and then four technical people. It's only limited to two environments.

View full review »
SC
Lead Consultant - Product Development at FIS (http://www.fisglobal.com/)

It is very stable, didn't face any performance issue.

View full review »
NK
Senior Software Engineer at a tech services company with 10,001+ employees

The scalability is good and it works on commodity hardware. One of the problems we have right now is that there is a lot of data and we're moving it from our Oracle solution. This means that there is a double cost, in terms of storage, during our transition to working with big data.

We are using a data lake that is a store for all of the data in our organization. There are more than25 projects, with between 25 and 30 people in each one, for a total of almost 1,000 people. All of them are dependent on this solution.

Most of our users are technicians who have problems to solve using the data available to them. A couple of them are data scientists and the remainder are upper management, who do the analysis.

View full review »
it_user347172 - PeerSpot reviewer
System Engineer at a tech company with 10,001+ employees

It's very easy to deploy and scale as large as you want. Once created on the CM management cluster, is difficult to scale up as needed, as you add more clusters to the same CM instance.

View full review »
it_user374058 - PeerSpot reviewer
Vice President - Big Data and Delivery at a computer software company with 51-200 employees
AG
Engineering Manager/Solution architect at a computer software company with 201-500 employees

This is a scalable solution. We have clients that have a large installation of Cloudera.

View full review »
it_user364473 - PeerSpot reviewer
R&D Solutions Architect at a tech vendor with 10,001+ employees

No issues encountered.

View full review »
it_user364431 - PeerSpot reviewer
Consultant at a tech consulting company with 51-200 employees

No issues encountered.

View full review »
it_user356769 - PeerSpot reviewer
Director of Data Architecture at a financial services firm with 501-1,000 employees

No issues with the current version.

View full review »
EricLin - PeerSpot reviewer
Chairman at Athemaster co.,ltd.

The Cloudera Distribution for Hadoop can be scaled. Our customers are enterprise-level companies and they have about 100 users for this solution.

View full review »
MG
Data engineer at a tech services company with 11-50 employees

It's scalable. You can add more nodes and you can expand your cluster easily.

View full review »
it_user347535 - PeerSpot reviewer
Software Engineer at a tech services company with 501-1,000 employees

No issues encountered.

View full review »
Buyer's Guide
Cloudera Distribution for Hadoop
May 2024
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: May 2024.
769,599 professionals have used our research since 2012.