Skip to main content icon/video/no-internet

Linear regression is a powerful tool for testing theories about relationships among observables, and it is also useful for researchers interested in the predictive power of a set of variables. The terms linear regression analysis and general linear model are often used in the same contexts. The general linear model (GLM) is a broad class of interrelated statistical procedures focusing on linear relationships among variables or variable composites. The term linear is used because these techniques can be represented visually by plotting one variable against another on two-dimensional charts and using mathematical formulae for determining where to draw one or more lines that will represent the relationships visually among the variables.

Regression analysis is the most broad, or general, form of the GLM. Hence, regression analysis forms the basis for many other statistical techniques, or, stated differently, a number of other common statistical procedures (e.g., analysis of variance, analysis of covariance, t test, Pearson product-moment correlation, Spearman rho correlation) are all specially designed versions of regression analysis. Furthermore, regression serves as a general framework for understanding a host of related multivariate statistics, most generally subsumed under canonical correlation analysis. An excellent nontechnical introduction to linear regression is provided by Schroeder, Sjoquist, and Stephan, and additional useful information may be found on the Web site of the Multiple Linear Regression Special Interest Group of the American Educational Research Association (http://www.coe.unt.edu/mlrv/).

Simple Linear Regression

Simple linear regression examines the relationship between two variables, one of which is referred to as the predictor variable (i.e., the variable that usually precedes the other), and the other of which is referred to as the criterion variable (i.e., the variable that the researcher is interested in explaining, predicting, or better understanding). Because simple regression results provide an understanding of the patterns of relationships between the two variables of interest in a given context, we often use the term prediction in describing the relationship. The procedure is called “simple” because it includes only one predictor variable.

As in studies employing Pearson product-moment correlation (r), the linear relationship between the predictor (X) and the criterion (Y) variables can be shown on a two-way scatterplot. A line of best fit drawn through the scatterplot is called the regression line, and the statistic representing the relationship between the two sets of points is called multiple R. In Pearson product-moment correlation, the coefficient r is used to show the strength and directionality of a relationship between two variables. A value of r close to zero indicates a low or negligible correlation between two variables, whereas a value closer to |1| indicates a more appreciable amount of correlation between the variables. Negative r values depict inverse relationships between variables, whereas positive r values depict direct relationships. Multiple R is very similar to the Pearson r with the exception that it is always positive in value (0 ≤ R ≤ 1) because of a set of variable weights developed as part of the analysis. Because there is only one predictor in simple linear regression, R will be equivalent to the absolute value of the Pearson correlation coefficient (|r|) between the two variables.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading