Skip to main content icon/video/no-internet

ANALYSIS OF VARIANCE (ANOVA) models can be used to study how a dependent variable Y depends on one or several factors. A factor here is defined as a categorical independent variable; in other words, an explanatory variable with a nominal LEVEL OF MEASUREMENT. A fixed effects ANOVA model (also called a Type I ANOVA model) has fixed (i.e., nonrandom) parameters expressing the effect of the categories of each factor. Such a model is appropriate if the statistical inference aims at finding conclusions that hold for precisely the categories of the factors that are present in the data set at hand. This contrasts with RANDOM EFFECTS MODELS, which are described in another article and can be appropriate if the observations of the categories of the factor are a sample from a population. An example of a fixed effects ANOVA model could be a study about work satisfaction of employees depending on gender and a classification in a small number of job categories.

Fixed effects ANOVA models are special cases of the GENERAL LINEAR MODEL, which is basic to REGRESSION analysis. The effects of the categories of a factor in an ANOVA can be expressed as regression coefficients of dummy variables in a regression analysis. The general linear model can be formulated as

None

where i indicates the case, Y is the dependent variable, X1 to Xp are the independent (or explanatory) variables, and E is the unexplained part, usually called the residual or error term. The quantity bh is the regression coefficient of variable Xh. The regression coefficients are fixed (i.e., nonrandom) quantities and also are called fixed effects. They are characteristics of the population for which the regression model is defined. Fixed effects occur similarly in other, more complicated models, such as generalized linear models or nonlinear regression models.

There is another reason for modeling effects of a given factor as fixed rather than random, irrespective of whether the categories of the factor can be regarded as a sample from a population. This is for factors or other variables that are not of primary interest for the researcher but that have to be included in the model to achieve a good model specification; such variables are called nuisance variables or control variables. In many cases (depending on the correct model specification), such nuisance variables are better controlled for by including them with fixed effects than with random effects. “Better control” means that the parameters of primary interest can be estimated with less bias, or without bias altogether. One class of examples is panel data, used in econometrics, where, in some studies, the time variable (often the year date) has the role of a nuisance variable, which is controlled for by including it as a fixed effect (see Greene, 2003). Another example is the study about work satisfaction of employees depending on gender and job categories mentioned earlier, now conducted in a (small or large) number of different companies. The companies, if regarded as a nuisance variable, could be treated as a fixed effect. On the other hand, if there is interest in the amount and kind of variability between the companies, they could be included as a random effect, which would lead to a MIXED-EFFECTS model.

...

locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading