Skip to main content icon/video/no-internet

Generalized Linear Mixed Models

The generalized linear mixed model (GLMM) is a statistical framework that broadens the traditional general linear model to include variables that are not normally distributed, relationships that are not strictly linear, and data that have dependency. The general linear model is the foundational statistical structure that includes almost all parametric statistical procedures such as linear regression and analysis of variance. Two of this model’s offshoots are the generalized linear model and the linear mixed model. This entry describes these three models and then explores their most flexible of offspring, the GLMM.

General Linear Model

The general linear model is useful in answering research questions about the impact of one or multiple predictor variables on an outcome variable. A predictor variable is a variable that is being manipulated (ideally) in an experiment to observe its impact on the outcome variable, whereas the outcome variable is a variable that is hypothesized to be changed by the predictor variable(s). Alternative names for the predictor variable are explanatory variable or independent variable. Likewise, the outcome variables are sometimes referred to as response variables or dependent variables.

Statistically, a general linear model represents conventional linear regression models with a continuous outcome variable predicted by one or more continuous and/or categorical variables. A general linear model includes a simple linear model, a multiple linear model, as well as the analysis of variance and the analysis of covariance. In a general linear model, the model can be expressed in the following equation:

yi=β0+β1x1i++βjxji+ei.

The outcome variable yi is modeled by a linear function of the predictor variables xji, plus an error term (ei). The subscript i indicates the ith observation, whereas the subscript j indicates the jth predictor variable.

The word linear in a general linear model implies that a combination of parameters β0 … βj could predict the observed values of the outcome variable. The word general refers to the fact that the outcome variable is dependent on potentially more than one predictor variable and it is normally distributed.

The previous equation can be rewritten with a shortcut as:

y=Xβ+e.

Each bold letter is a matrix. The matrix y represents the values of the outcome variable for all N observations, with a size of N × 1. The matrix X stores the values of all j predictor variables for all N observations, with a size of N × j. The matrix β indicates the estimated values of the general linear model parameters, with a size of j × 1. The matrix e shows the difference between the predicted and the observed value of the outcome variable (i.e., residual/error) for all N observations, with a size of N × 1.

A general linear model is the most widely used statistical model, while it has to meet certain assumptions: (a) linearity, (b) data independency, and (c) the residuals are independent of each other and normally distributed, eiN (0, σ2). The linearity assumption means there is a linear relationship between the predictor variables and the outcome variable. In other words, a one-unit change in the predictor variable is expected to bring about the same amount of change in the outcome variable for all observations. The data independency assumption means each observation is independent of another.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading