Skip to main content icon/video/no-internet

Simple Descriptive Statistics

Descriptive statistics allow a researcher to quantify and describe the basic characteristics of a data set. As such, descriptive statistics serve as a starting point for data analysis, allowing researchers to organize, simplify, and summarize data. A data set, which contains hundreds or thousands of individual data points or observations, for example, can be condensed into a series of statistics that provide useful information on the population of interest. Moreover, descriptive statistics determine which advanced statistical tests are appropriate. Descriptive statistics do not allow the researcher to make presumptive conclusions about the population of interest, however, as this is reserved for more advanced, inferential statistics.

Descriptive statistics provide information along two main dimensions: measures of central tendency and measures of spread. In addition, it is also helpful to consider the distribution of the data, as distribution impacts statistical analysis and is considered when deciding which measures are the most appropriate for a given data set. This entry introduces simple descriptive statistics, including measures of central tendency, distribution, and variance; their usefulness to researchers; and their corresponding limitations.

Measures of Central Tendency

Consider a data set comprised of 100 public speaking midterm exams. A professor is interested in learning more about students’ performances on the midterm; however, looking at each score individually does not provide any useful information about the collective data set. Moreover, it is tedious to look at each individual score. Measures of central tendency allow the researcher to learn more about students’ performances by calculating a single score, which is representative of the performances of this particular group of students. Hence, central tendency is a specific measure aimed at finding one score that resides in the center of a distribution. The three most common measures of central tendency are mean, median, and mode; it should be noted that among these, no single measure is considered ideal for all situations.

Mean

The mean (X¯) is one of the most common statistical measures used to describe a sample population. Commonly referred to as the “average,” the mean is calculated by finding the sum of all points in a data set and dividing it by the total number of observations. The formula for X¯ is as follows:

X ¯ = Σ ( X 1 + X 2 . . . X n ) / N

where X represents each observation in the data set, and N is the total number of observations. Using this formula, the professor can take the sum of all 100 of the individual midterm exam scores (score student 1 + score student 2 + score student 3 + . . . score student 100) and divide by the total number of scores (N = 100). The resulting sample mean (X¯) provides useful information related to the average performance on the midterm exam by this group of students (i.e., one score that resides in the center of a distribution). The professor can also compare the means from semester to semester or professor to professor, assuming that the same or very comparable exam is being used, to identify basic trends. It is important to remember, however, that inferring why scores may vary or remain unchanged is beyond the capabilities of descriptive statistics.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading