TDSM 12.5
From The Data Science Design Manual Wikia
Definition
Map skew happens when some map tasks take much longer time to be completed than average.
Answer
When we combine counts from each file before emitting them, the information needs to be sent to reducers will reduce, hence the problem is mitigated.