TDSM 2.11
From The Data Science Design Manual Wikia
Let X be the annual salaries of high school graduates
Y be the annual salaries of high school graduates
n be the number of job positions
a) For each possible job title, the college graduates always made 5,000 dollars more than high school grads
⇒ˉY=ˉX+5000 and ∀i(1≤i≤n):Yi=Xi+5000
Correlation efficient of X and Y:
τ=∑ni=1(Xi−ˉX)(Yi−ˉY)√∑ni=1(Xi−ˉX)2√∑ni=1(Yi−ˉY)2=∑ni=1(Xi−ˉX)(Xi+5000−(ˉX+5000))√∑ni=1(Xi−ˉX)2√∑ni=1(Xi+5000−(ˉX+5000))2=∑ni=1(Xi−ˉX)2(√∑ni=1(Xi−ˉX)2)2=1
b) For each possible job title, the college graduates always made 25% more than high school grads
⇒
c) For each possible job title, the college graduates always made 15% less than high school grads
⇒