TDSM 12.5

From The Data Science Design Manual Wikia
Jump to: navigation, search

Definition

Map skew happens when some map tasks take much longer time to be completed than average.

Answer

When we combine counts from each file before emitting them, the information needs to be sent to reducers will reduce, hence the problem is mitigated.