Reaves.dev

v0.1.0

built using

Phoenix v1.7.17

MapReduce

Stephen M. Reaves

::

2024-04-11

Notes about Lecture 9b for CS-6210

Summary

MapReduce

Gf1f1contentsm1mapf1->m1f2f2contentsm2mapf2->m2f3f3contentsm3mapf3->m3fnfncontentsmnmapfn->mnr1reducem1->r1r2reducem1->r2rmreducem1->rmm2->r1m2->r2m2->rmm3->r1m3->r2m3->rmmn->r1mn->r2mn->rmo1o1valuer1->o1o2o2valuer2->o2omomvaluerm->om

Why MapReduce?

Heavy Lifting done by runtime

Gcluster_upcluster_mrcluster_mMapcluster_rReducemmasterw1workerm:s->w1:nmapperw5workerm:s->w5:nreducerupuser programup:s->m:nforkup:s->w1:nforkup:s->w5:nforksplitsplit0split1split2split3split4split5split:e->w1:wsplit:e->w1:ww2workersplit:e->w2:wsplit:e->w2:ww3workersplit:e->w3:wsplit:e->w3:wi1 w1:e->i1:wi2 w2:e->i2:wi3 w3:e->i3:ww4workero1file0w4:e->o1:wo2file1w5:e->o2:wi1:e->w4:wi1:e->w5:wi2:e->w4:wi2:e->w5:wi3:e->w4:wi3:e->w5:w

Issues to be Handled by the Runtime

Master data structures: