Variance

The average squared distance of each value from the mean β€” a measure of spread.

Same mean, different variance
3040506070tightmean = 50variance = 6.0spreadmean = 50variance = 125.7

Variance ignores where the mean sits and measures how far the values are from it, after squaring those distances.

Definition

Variance measures how spread out the values in a dataset are around the mean. A small variance means the values cluster tightly together; a large variance means they are widely scattered.

To compute variance:

  1. Find the mean xˉ\bar{x}.
  2. Subtract the mean from each value to get the deviation.
  3. Square each deviation (so negatives become positive).
  4. Average the squared deviations.
Οƒ2=1nβˆ‘i=1n(xiβˆ’xΛ‰)2\sigma^2 = \frac{1}{n}\sum_{i=1}^{n}(x_i - \bar{x})^2
Computing variance

Dataset: 2, 4, 4, 4, 5, 5, 7, 9. Mean =xˉ=5= \bar{x} = 5.

Deviations: βˆ’3,βˆ’1,βˆ’1,βˆ’1,0,0,2,4-3, -1, -1, -1, 0, 0, 2, 4

Squared deviations: 9,1,1,1,0,0,4,169, 1, 1, 1, 0, 0, 4, 16

Οƒ2=9+1+1+1+0+0+4+168=328=4\sigma^2 = \frac{9+1+1+1+0+0+4+16}{8} = \frac{32}{8} = 4
Variance has squared units

If the data is in metres, variance is in metresΒ². That makes variance hard to interpret directly. The standard deviation Οƒ=Οƒ2\sigma = \sqrt{\sigma^2} brings things back to the original units.

Try it

Find the variance of: 1, 3, 5, 7, 9.

Solution

Mean =5= 5. Deviations: βˆ’4,βˆ’2,0,2,4-4, -2, 0, 2, 4. Squared: 16,4,0,4,1616, 4, 0, 4, 16.

Οƒ2=16+4+0+4+165=405=8\sigma^2 = \frac{16+4+0+4+16}{5} = \frac{40}{5} = 8

Related concepts