TDSM 12.5

From The Data Science Design Manual Wikia
Revision as of 13:54, 12 December 2017 by Trtnguyen (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Definition

Map skew happens when some map tasks take much longer time to be completed than average.

Answer

When we combine counts from each file before emitting them, the information needs to be sent to reducers will reduce, hence the problem is mitigated.