Section 1
Introduction to Measures of Variation
Property
A measure of center for a numerical data set summarizes all of its values with a single number, while a measure of variation describes with a single number how its values vary.
The first, and easiest, measure is the range. This is given by the difference between the highest and lowest data values.
While easy to compute, it only tells us how far apart the extreme values are and gives no indication about the spread of the data values between the two extremes.
Examples
- A student's test scores are 85, 92, 78, 95, and 88. The highest score is 95 and the lowest is 78. The range is .
- Two basketball players' points per game are recorded. Player A: {15, 17, 16, 18}. Player B: {5, 10, 25, 30}. Player A's range is , while Player B's is , showing Player B's scoring is less consistent.
- The data sets {2, 8, 8, 9, 12} and {2, 3, 4, 5, 12} both have a range of 10. However, the first set is clustered high while the second is more evenly spread, showing a limitation of using only the range.
Explanation
Measures of variability, like range, tell you about the spread of your data. While the mean or median tells you the center, variability describes if the data points are all clustered together or widely scattered apart.