Five-number summary, box-and-whisker plots, and measuring spread with IQR
Reserve & Extensions • K-12
A box plot (also called a box-and-whisker plot) gives you a visual snapshot of how data is spread out. In one compact diagram, you can see the center, spread, skewness, and outliers of a data set -- making it one of the most useful tools in statistics.
Every box plot is built from five key values:
| Statistic | What It Is |
|---|---|
| Minimum | Smallest value (excluding outliers) |
| Q1 (First Quartile) | Median of the lower half (25th percentile) |
| Median (Q2) | Middle value (50th percentile) |
| Q3 (Third Quartile) | Median of the upper half (75th percentile) |
| Maximum | Largest value (excluding outliers) |
Data (already sorted): 2, 4, 5, 7, 8, 10, 11, 13, 15, 18, 20
There are 11 values.
Five-number summary: 2, 5, 10, 15, 20
The IQR measures the spread of the middle 50% of the data. It is not affected by extreme values, making it a more robust measure of spread than the range.
From Example 1: IQR = 15 − 5 = 10.
A data point is an outlier if it falls:
Data: 3, 5, 7, 8, 9, 10, 11, 12, 14, 35
Q1 = 7, Q3 = 12, IQR = 5.
Lower fence: 7 − 1.5(5) = 7 − 7.5 = −0.5
Upper fence: 12 + 1.5(5) = 12 + 7.5 = 19.5
Any value below −0.5 or above 19.5 is an outlier. The value 35 is an outlier.
On the box plot, the right whisker extends to 14 (the largest non-outlier), and 35 is plotted as a separate dot.
Class A scores: Min=55, Q1=68, Median=75, Q3=84, Max=95
Class B scores: Min=40, Q1=72, Median=80, Q3=88, Max=98
Comparing:
The whiskers do NOT always extend to the minimum and maximum of the data. If there are outliers, the whiskers stop at the most extreme non-outlier values. Outliers are shown as separate points. Many students draw whiskers to every data point, which defeats the purpose of outlier detection.
If the median line is closer to Q1, the data is right-skewed (tail stretches right). If closer to Q3, it is left-skewed. If roughly centered in the box, the data is approximately symmetric. Longer whiskers also indicate skewness in that direction.
1. Find the five-number summary: 12, 15, 18, 22, 25, 28, 30, 35, 40.
Min = 12. Q1 = (15+18)/2 = 16.5. Median = 25. Q3 = (30+35)/2 = 32.5. Max = 40. Five-number summary: 12, 16.5, 25, 32.5, 40.
2. For the data in Problem 1, find the IQR and determine if there are any outliers.
IQR = 32.5 − 16.5 = 16. Lower fence: 16.5 − 24 = −7.5. Upper fence: 32.5 + 24 = 56.5. All values are within these fences. No outliers.
3. Data: 1, 2, 3, 4, 5, 6, 7, 8, 9, 50. Find Q1, Q3, IQR, and identify outliers.
Q1 = 3, Q3 = 8, IQR = 5. Upper fence: 8 + 7.5 = 15.5. The value 50 exceeds 15.5, so 50 is an outlier.
4. Two box plots show: Group X has IQR = 8 and Group Y has IQR = 20, but both have the same median. What does this tell you?
Both groups have the same typical value (same center), but Group Y has much more variability in the middle 50% of its data. Group Y's scores are more spread out around the median.
5. A box plot shows the median very close to Q3 and a long left whisker. Describe the shape of the distribution.
The distribution is left-skewed (negatively skewed). Most data is concentrated at the higher end, with a long tail stretching toward lower values.