-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathprogram_design.txt
50 lines (11 loc) · 957 Bytes
/
program_design.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
File 1 File 2 File 3 File 4 File 5 File 6
Mappers compete with each other to grab files to map()
Mapper 1 Mapper 2 Mapper 3
<key,{value,value,value}> <key,{value,value,value}> <key,{value,value}>
<key,{value}>
↓ ↓ ↓
BLACK BOX: Given a key, return the partition number this key should go to
Partition 1 Partition 2 Partition 3 Partition 4
↓ ↓ ↓ ↓
Reducer 1 Reducer 2 Reducer 3 Reducer 4
Reducer threads sort the keys in the partitions and then reduce()