mirror of
https://github.com/jackyzha0/quartz.git
synced 2025-12-23 21:04:07 -06:00
1.1 KiB
1.1 KiB
YARN (Yet Another Resource Negotiator) is Hadoop’s cluster resource management system
- Multiple jobs running simultaneously
- Multiple jobs use same resources (disk, CPU, memory)
- Assign resources to jobs and tasks exclusively
YARN is in charge of:
- Allocates Resources
- Schedules Jobs
- allocate priorities to jobs by policies: FIFO scheduler, Fair scheduler, Capacity scheduler
Components:
-
ResourceManager
- oversees resource allocation across the cluster
-
NodeManager
- Each node in the cluster runs a NodeManager.
- This component manages the execution of containers on its node.
-
ApplicationMaster
- manages the lifecycle of applications.
- handles job scheduling and monitors progress.
-
Resource Container
- a logical bundle of resources (e.g., CPU, Memory) that is allocated by the ResourceManager
YARN ecosystem
Yarn can run other applications beside Hadoop MapReduce, that can integrate to the Hadoop ecosystem: • Apache Storm (Data Streaming engine) • Apache Spark (Data Batch and streaming engine) • Apache Solr (Search platform)
