Skip to main content icon/video/no-internet

Stepwise Regression

Stepwise regression is an automatic computational procedure that attempts to find the “best” multiple regression model using only statistically significant predictors from a larger set of potential predictive variables. The regression model describes the relationship between a dependent (outcome) variable (Y) and two or more independent (predictor or explanatory) variables (Xj), using a model where Y=B0+B1X1+B2X2++BkXK+e. Computer programs are used to find the Bj weight for each Xj variable so as to minimize the sum of the squared error (e) for cases used to generate the model. The unique contribution of each Xj variable can be tested with a t test and associated p value, using the null hypothesis that Bj = 0 in the population. Stepwise regression attempts to find the best regression model by adding or deleting variables one at a time based solely on the p values for the individual predictors at each step. Although stepwise regression can be useful if it is applied and interpreted appropriately, it has been heavily criticized because it is often misused and misinterpreted. This entry presents the stepwise estimation procedures, reviews the common criticisms, and provides a recommended alternative.

Stepwise Estimation Procedures

There are two main stepwise approaches:

  • Forward selection begins by selecting from a pool of potential predictors the single predictor variable with the smallest statistically significant p value (if any), and then on each successive step selecting the individual variable with the smallest statistically significant p value for its added contribution to the model, until no remaining potential predictor would make a statistically significant contribution.
  • Backward elimination begins with all potential predictors in the model and deletes variables one by one beginning with the variable with the largest nonstatistically significant p value, until every variable remaining in the model (if any) is statistically significant.

Stepwise regression combines the two approaches by testing every variable at each step for both inclusion and exclusion, with criteria set to allow variables already in the model to be eliminated more easily than new variables to be added. For example, one might require p < .05 for a variable to be added, while a variable already in the model would be eliminated if p > .10 at any step. As an option, specific variables may be forced into the model or be given priority consideration before other variables are considered for stepwise inclusion. A related method is all possible subsets regression, whereby a computer program tests all possible combinations of potential predictors and identifies the best model based on some criterion.

All of these methods of allowing a computer program to select the best model according to some statistical criteria have come under severe criticism from statisticians. Common criticisms of these procedures are that the R2 values are biased to be too large, incorrect tests of statistical significance are often used, models are unstable when potential predictors are correlated, and final models may not be the most useful for practical or theoretical purposes.

Criticisms of Stepwise Regression

Criticism 1: The Observed Multiple R Is Inflated

Selecting the best predictors from a larger set of potential predictors capitalizes on chance variation in the observed data set, and the model is unlikely to do as well when applied to new data sets. Statistical programs offer “adjusted” or “shrunken” R2 as a better estimate of the population R2. Consider a situation with N = 40 cases, where 18 potential predictors were considered, and the final stepwise regression model has three predictors and R2 = .500. Adjusted R2 would be reported as .458 based on 40 cases with three predictors. But this adjustment does not take into account that 18 variables were considered. If 18 predictors were used to generate an R2 value of .500, the adjusted R2 would be only .071! Thus, even the reported adjusted R2 is inflated. Stepwise regression capitalizes on random variations in the sample data, so the model is unlikely to fit a new sample as well as it fits the sample that was used to generate the model. Inflation of R2 is greater with smaller samples and more variables.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading