Skip to main content icon/video/no-internet

Bootstrapping, or the bootstrap, is a statistical methodology that is frequently used in situations where standard distributional assumptions, such as normality, do not hold. In addition, the bootstrap can be used to estimate standard errors and confidence intervals for parameter estimates. It is particularly useful where there is not a known sampling distribution for the statistic of interest, thereby making calculation of standard errors difficult or impossible. There are a number of variations in the bootstrap that make it useful in a wide variety of situations. Regardless of context or application, the bootstrap is based upon a basic framework of resampling with replacement from the original sample. This entry discusses the basic nonparametric bootstrap, bootstrap confidence intervals, variations in the bootstrap, and when to use the bootstrap.

Basic Nonparametric Bootstrap

As an example, we will consider the problem of estimating the standard error for the mean, x. This statistic can be calculated in a straightforward manner using the equation:

SEx=S/Squareroot(N),

where S is the standard deviation of the sample and N is the sample size.

Equation 1 is based upon an assumption that the population distribution underlying the variable x is normal. However, if this is not the case, then Equation 1 no longer yields the appropriate standard error estimate of x. The bootstrap offers an alternative approach for calculating the standard error. The basic nonparametric bootstrap operates using the following steps:

  • Calculate sample statistic of interest (e.g., x) for the original sample.
  • Randomly sample n individuals from the original sample of size n, with replacement; individuals can appear multiple times in the bootstrap sample, while others may not appear at all.
  • Calculate the mean, xB*, for the bootstrap sample.
  • Repeat Steps 2 and 3 many times (e.g., B = 10,000) to create a sampling distribution for the statistic of interest.
  • Calculate the bootstrap standard error: SB = 1 BxB*x* 2B−1.

To illustrate the bootstrap, consider the following simple example involving a sample of five individuals with scores 8, 3, 6, 1, and 5. The mean of these values is 4.6, and the standard deviation is 2.7. Based on Equation 1, the standard error is Sx = 2.75 = 1.2. Now, let’s draw five bootstrap samples (which would be far too few in practice but helps to illustrate how the bootstrap works) and calculate the mean for each. The samples appear below.

The standard deviation of the means for the five bootstrap samples is 0.68. Thus, based on these five samples, we would report that the bootstrap standard error of the mean is 0.68. In actual practice, we would use many more than five bootstrap samples, perhaps as many as 10,000. To finish this illustration of the basic bootstrap, a total of 1,000 bootstrap samples of the five data points were drawn using the software package SPSS, yielding a bootstrap standard error estimate of 1.06.

Bootstrap Confidence Intervals

Standard errors are frequently used to construct a confidence interval for the statistic of interest, in this case the mean. Confidence intervals reflect the neighborhood of values within which the population parameter is likely to reside. Using normal theory methods, the confidence interval for the mean is calculated

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading