Skip to main content icon/video/no-internet

The stem-and-leaf display is an exploratory data analysis technique developed by John Tukey to summarize graphically the characteristics of a distribution. It is especially easy to produce by hand, and it is effective for examining distributions for small sample sizes. (All major statistical packages provide point- and-click routines that can be used readily to examine the distributions in the case of large sample sizes.) At the top of the next page are side-by-side stem-and-leaf displays for 100 randomly selected IQ scores and the subset of all scores greater than 99.

To construct a stem-and-leaf display, we place the data in ascending order and break each score into two parts, a stem (the leading digit[s], which contains the most salient information) and a leaf (the trailing digit[s], which contains less salient information). Two common line depths (i.e., “stem widths”) are created by forming stems that include the digits 0–4 and 5–9 (see Random IQ display) or by forming stems that include the digits 0–1, 2–3, 4–5, 6–7, and 8–9 (see IQ>99 display).

In the stem-and-leaf display for the variable Random IQ, the first line (1 6 4) indicates there is one score in the line (“line depth”), the IQ score 64, which is partitioned into a stem (6) and a leaf (4). For scores below the median, line depths are calculated by counting the total number of scores from the bottom (lowest score) of the distribution up to, and including, the scores in that line. For scores above the median, line depths are calculated by counting the total number of scores from the top (highest score) of the distribution down to, and including, the scores in that line. For the line that contains the median (IQ score 104), the depth of the line is reported in parentheses (20). Note that the center three depths (31, 20, 49) add to N = 100 since they account for all the scores within the distribution. Note also that some computer programs report leaf frequency instead of depths for all lines.

Stem-and-leaf of Random IQ N = 100 Stem-and-leaf of IQ > 99 N = 69
Leaf Unit = 1.0 Leaf Unit = 1.0
Depth Stem Leaf 6 10 000111
1 6 4 15 10 222333333
1 6 25 10 4444455555
3 7 23 29 10 6777
6 7 678 (8) 10 88888899
7 8 3 32 11 0011111
13 8 556789 25 11 22333
22 9 011133444 20 11 55
31 9 667788999 18 11 667
(20) 10 00011122233333344444 15 11 88899
49 10 55555677788888899 10 12 01
32 11 001111122333 8 12 22
20 11 5566788899 6 12 445
10 12 012244 3 12
4 12 58 3 12 8
2 13 01 2 13 01

We can locate the center of the distribution (i.e., find the median) by eye or by counting in from either end until half of the observations are counted. We can examine the range of the data by locating the minimum and maximum values. We can locate the quartiles of the distribution either by eye or by counting. Turned on its side, the stem-and-leaf display forms a “digital histogram” that can be used to examine distribution shape, locate peaks and gaps, and identify unusual observations.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading