quartz/content/BigData/Hadoop/Hadoop.md
2025-07-23 20:36:04 +03:00

468 B

!Screenshot 2025-07-23 at 12.20.09.png

Hadoop is an Open Source framework for:

  • Distributed storage (across many machines)
  • Distributed processing (run programs on many machines in parallel)

It is not a database — it is an ecosystem for managing and analyzing Big Data.

Hadoop Components Overview

!Screenshot 2025-07-23 at 11.58.48.png

  1. HDFS
  2. MapReduce
  3. Yarn

Hadoop Eccosystem