Covariance and Correlation
Last updated
Last updated
Covariance is a measure of how much two random variables vary together.
If X and Y are independent then . Warning: The converse is false: zero covariance does not always imply independence.
The key point is that measures the linear relationship between X and Y . In the above example and have a quadratic relationship that is completely missed by .
The units of covariance are ‘units of times units of ’. This makes it hard to compare covariances: if we change scales then the covariance changes as well. Correlation is a way to remove the scale from the covariance
ρ is dimensionless (it’s a ratio!).
−1 ≤ ρ ≤ 1. Furthermore, ρ = +1 if and only if Y = aX + b with a > 0, ρ = −1 if and only if Y = aX + b with a < 0.
Knowing squared correlation( ) helps to determine the variance in one variable given the other variable. If is 0.7 between x and y, it means that x predict 70% variation in y.
Correlation doesn't equal causation i.e if x and y are highly correlated doesn't means that either of x or y is cause of other.
Causation can be only determined by probabilty graphs.