quartz/content/BigData/Hadoop/Hadoop Eccosystem.md
2025-07-23 20:36:04 +03:00

18 lines
656 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

### Systems based on [[MapReduce]]
> Early generation frameworks for big data processing.
* [[Apache Hive]]
### Systems that replace MapReduce
> newer, faster frameworks with different architectures and performance improvements.
**Motivation**: [[MapReduce]] and [[Apache Hive|Hive]] are too slow!
- [[Google Dremel]]
- [[Apache Spark]]
- Replaces MapReduce with its own engine that works much faster without compromising consistency
- Architecture not based on Map-reduce but rather on two concepts:
- RDD (Resilient Distributed Dataset)
- DAG (Directed Acyclic Graph)
- Pros:
- Works much faster than MapReduce;
- fast growing community.