Difference between revisions of "TDSM 12.5"
From The Data Science Design Manual Wikia
(Created page with "===Definition=== '''Map skew''' happens when some map tasks take much longer time to be completed than average. ===Answer===") |
|||
Line 4: | Line 4: | ||
===Answer=== | ===Answer=== | ||
+ | |||
+ | When we combine counts from each file before emitting them, the information needs to be sent to reducers will reduce, hence the problem is mitigated. |
Latest revision as of 13:54, 12 December 2017
Definition
Map skew happens when some map tasks take much longer time to be completed than average.
Answer
When we combine counts from each file before emitting them, the information needs to be sent to reducers will reduce, hence the problem is mitigated.