Apache Hadoop Primary Use Case

Syed Afroz Pasha - PeerSpot reviewer
Head Of Data Governance at Alibaba Group

We use the Hadoop File System. We usually keep the data for our tables or big data on it. Hadoop has a query engine called Hive. We write SQL queries, and the tool usually processes in a parallel environment and gets us the data on Hive.

View full review »
Miodrag Milojevic - PeerSpot reviewer
Senior Data Archirect at Yettel

I have been using the latest version of Apache Hadoop. It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database.

View full review »
Juliet Hoimonthi - PeerSpot reviewer
Manager at Robi Axiata Limited

I'm from the data governance team, and this is how my team uses Apache Hadoop: there's a GUI called Apache Atlas, then there's an option called the "business glossary". My team uses the business glossary from Apache Atlas and also uses Apache Ranger. Apache Ranger is another GUI where you can check who is using which data source through the Apache Hadoop platform. My team also uses the Apache Hadoop platform for AI-related use cases and relevant data, so the data required from any kind of AI use case, that data is processed with ETL, specifically with the Talend tool. My team then loads the data in Apache Hadoop, uses that data by making some clusters, and uses the data for AI/ML cases.

View full review »
Buyer's Guide
Apache Hadoop
April 2024
Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,246 professionals have used our research since 2012.
GM
Data Architect at a computer software company with 51-200 employees

We work on Apache Hadoop for various customers. 

View full review »
Anand Viswanath - PeerSpot reviewer
Project Manager at Unimity Solutions

I use the solution in my company for security purposes.

In my company, we have intranet portals that we need to ensure are not accessible by outsiders. All the data that are within the internal applications is only accessible with valid credentials within the domain. In general, my company uses Apache Hadoop to secure our internal applications.

View full review »
AM
Credit & Fraud Risk Analyst at a financial services firm with 10,001+ employees

We use Apache Hadoop for analytics purposes.

View full review »
Abhik Ray - PeerSpot reviewer
Co-Founder at Quantic

Its main use case is to create a data warehouse or data lake, which is a collection of data from multiple product processors used by a banking organization. They have core banking, which has savings accounts or deposits as one system, and they have a CRM or customer information system. They also have a credit card system. All of them are separate systems in most cases, but there is a linkage between the data. So, the main motivation is to consolidate all that data in one place and link it wherever required so that it acts as a single version of the truth, which is used for management reporting, regulatory reporting, and various forms of analyses.

We have done two or three projects with Hadoop, and we have taken the latest version available at that time. So far, it was deployed on-premises.

View full review »
Aria Amini - PeerSpot reviewer
Data Engineer at Behsazan Mellat

We use the Apache Hadoop environment for use cases involving big data engineering. We have many applications, such as collecting, transforming, loading, and storing lag event data for big organizations.

View full review »
YM
CEO at AM-BITS LLC

This solution is used for a variety of purposes, including managing enterprise data hubs, monitoring network quality, implementing an AntiFraud system, and establishing a conveyor system.

View full review »
SF
Analytics Platform Manager at a consultancy with 10,001+ employees

We use it as a data lake for streaming analytical dashboards.

View full review »
Lucas Dreyer - PeerSpot reviewer
Data Engineer at BBD

The primary use case of this solution is data engineering and data files.

The deployment model we are using is private, on-premises.

View full review »
AM
CEO

Big Data analytics, customer incubation. 

We host our Big Data analytics "lab" on Amazon EC2. Customers are new to Big Data analytics so we do proofs of concept for them in this lab. Customers bring historical, structured data, or IoT data, or a blend of both. We ingest data from these sources into the Hadoop environment, build the analytics solution on top, and prove the value and define the roadmap for customers.

View full review »
YT
Business data analyst at RBSG Internet operations

We use the solution as a data link for our customer payment and SaaS information. We get data from various sources and then utilize and leverage that data.

View full review »
JP
Vice President - Finance & IT at a consumer goods company with 1-10 employees

As an example of a use case, when I was a contractor for Cisco, we were processing mobile network data and the volume was too big. RDBMS was not supporting anything. We started using the Hadoop framework to improve the process and get the results faster.

View full review »
MS
Works

We use this solution for our Enterprise Data Lake.

View full review »
DD
Partner at a tech services company with 11-50 employees

There are several use cases for Hadoop. Sometimes it's used for data warehousing. Other times, it's analytics. And In some cases, it's used to do transformation. For example, I have one client using it to decompress, compress, or encrypt data on ingestion. So, he used it like an ETL engine.

View full review »
MB
IT Expert at a comms service provider with 1,001-5,000 employees

We used Apache Hadoop mainly for ETL and data analysis.

View full review »
YM
CEO at AM-BITS LLC

We primarily use the solution for the enterprise data hub and big data warehouse extension.

View full review »
MB
IT Expert at a comms service provider with 1,001-5,000 employees

We primarily use this product to integrate legacy systems.

View full review »
CB
Database/Middleware Consultant (Currently at U.S. Department of Labor) at a tech services company with 51-200 employees
  • Content management solution
  • Unified Data solution
  • Apache Hadoop running on Linux
View full review »
GA
Founder & CTO at a tech services company with 1-10 employees

We mainly use Apache Hadoop for real-time streaming. Real-time streaming and integration using Spark streaming and the ecosystem of Spark technologies inside Hadoop.

View full review »
it_user1093134 - PeerSpot reviewer
Technical Architect at RBSG Internet Operations

We are primarily dumping all the prior payment transaction data into a loop system and then we use some of the plug and play analytics tools to translate it.

View full review »
Abhik Ray - PeerSpot reviewer
Co-Founder at Quantic

The primary use is as a data lake. 

View full review »
it_user576504 - PeerSpot reviewer
Software Architect at a tech services company with 10,001+ employees

Data aggregation for KPIs. The sources of data come in all forms so the data is unstructured. We needed high storage and aggregation of data, in the background.

View full review »
Buyer's Guide
Apache Hadoop
April 2024
Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,246 professionals have used our research since 2012.