Measures of Variability

Nicole M. Nickens

doi:10.4135/9781506326139

Entry
Reader's guide
Entries A-Z
Subject index

Return to Entries

Measures of Variability

By: Nicole M. Nickens
In:The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation
Chapter DOI:https://doi.org/10.4135/9781506326139.n423
Subject:Education

Request Permissions

Show page numbers Hide page numbers

Observations within a data set are not of equal value; they vary along a given scale. The extent to which they vary between and among themselves can be indicated by measures of variation or variability. Measures of variability show the amount of dispersion in the data set or, in other words, how much the observations or values are spread out along the scale. Dispersion within a data set can be measured or described in several ways, including the range, interquartile range, and standard deviation. This entry provides a definition, description, and calculation of each measure of variability along with advantages and disadvantages in using each. It also includes a discussion of standard deviation in a normal distribution, or the empirical rule, and Chebyshev’s theorem.

Measures of central tendency (such as mean, median, and mode) each provide a single value that represents or is descriptive of the whole data set; measures of variability each provide a single value that represents how much spread exists in the data set. Measures of central tendency combined with measures of variability can offer an accurate summary description of a whole data set. A data set that contains values or observations that are much higher or much lower than the mean (extreme scores or outliers) has high dispersion. One measure of variability may be more appropriate than another depending upon the data set.

Range

The range of a set of data is the difference between the largest and smallest value in the data set; it is calculated very simply by using this formula:

$Range = maximum value - minimum value,$

and it is very useful not only for showing dispersion in a data set but also for comparing variability between and among similar data sets. In the previous example, the professor of the college algebra course may want to compare dispersion of Exam 1 scores for three different classes or to compare dispersion of scores for Exams 1, 2, and 3 for the same class.

Although the range is the easiest measure of variability to find, using range to describe dispersion is not always the most appropriate choice. Calculation of range relies on only two values, so it is very sensitive to extreme values. If the highest and/or lowest value is/are an outlier(s), the range will provide an inaccurate picture of how much spread actually exists in the data set. Using the previous college algebra example, the professor wants to find the amount of dispersion in the set of scores from the first exam; there are 25 students in the class. If the highest score in the class is 90 of 100 and the lowest score is 20 of 100, the range is 90 – 20 = 70. This indicates a great deal of dispersion on a 100-point scale. However, if the majority of the scores cluster around the class average of 75 on the exam, and the scores of 20 and 90 are both outliers, then the range of 70 is misleading.

Interquartile Range

For data sets that contain outliers, the interquartile range can be calculated as a measure of variability; the interquartile range is not sensitive to outliers because it considers variability only within the middle 50% of the data set. The interquartile range is related to the median, which is a measure of central tendency that divides a distribution in half with 50% of the scores below and 50% of the scores above it. The distribution or data set can be further divided into fourths or quartiles. If there are 16 values in the data set, each quartile will contain 4 values; the data are not divided into 25% portions of total value but rather 25% of the number of observations. If the values in the data set are in order of ascending value, the first quartile contains the lowest 25% of the values; the second quartile contains the next 25% of the values; the third quartile contains the second highest 25% of the values; and the fourth quartile contains the highest 25% of the values. The interquartile range can be calculated by subtracting the first quartile from the third

...

Sign in to access this content

Get a 30 day FREE TRIAL

Watch videos from a variety of sources bringing classroom topics to life
Read modern, diverse business cases
Explore hundreds of books and reference titles

No internet connection.

All search filters on the page have been cleared.

Your search has been saved.

Entry

Reader's guide

Entries A-Z

Subject index

Measures of Variability

Range

Interquartile Range

Sign in to access this content

Get a 30 day FREE TRIAL

Read next

More like this

Sage Recommends