Skip to main content icon/video/no-internet

Marginal Maximum Likelihood Estimation

Maximum likelihood estimation is one of the backbones of statistical analysis. It is used to obtain parameter estimates for a wide variety of models, including regression, factor analysis, and item response theory (IRT) analyses, among many others. When these estimates are based on data that are only marginally or partially observed, the procedure is called marginal maximum likelihood estimation (MMLE).

This article uses IRT analyses, a context in which the MMLE strategy is most common, to describe the assumptions, mathematics, and procedures of MMLE. The various models contained within the IRT family allow researchers to obtain estimates of an individual’s level of a latent trait of interest, typically referred to as θ, as well as information about the items, including their location on the θ scale (difficulty), their ability to differentiate individuals with different levels of θ (discrimination), and the likelihood that an individual would respond to an item correctly due solely to chance (pseudo-guessing). A distinct advantage of using IRT to estimate these quantities is that it is able to do so in such a way that item parameter estimates are independent of the individuals actually responding to the items, and person ability estimates are independent of the items that they complete. This property, along with the fact that item location and person ability are on the same scale, makes IRT a particularly attractive modeling tool for researchers working with item response data. It is the use of MMLE that supports these properties.

Maximum Likelihood Estimation

Maximum likelihood estimation procedures rest on assumptions about the data distribution of the population from which the sample was drawn. For example, in IRT, it is generally assumed that the latent trait of interest, θ, is normally distributed, although it is sometimes possible to relax this assumption. Consider a reading test made up of 20 multiple-choice items, each of which has a specific level of difficulty. Furthermore, let’s assume that the difficulty values are known. Given this information, one can use the item response pattern to estimate an examinee’s reading ability. If Examinees A and B both answer 8 of the 20 items correctly, each will have a raw score of 8. However, if the 8 items answered correctly by Examinee A were more difficult than the 8 items answered correctly by Examinee B, then the value of ability level, θ, for Examinee A should be estimated as higher than that of Examinee B. The ability to account for differences in the difficulty of items answered correctly by respondents is one of the great strengths of using maximum likelihood estimation in the context of IRT parameter estimation.

Maximum likelihood estimation works by finding the values of the model parameters that maximize a likelihood function for the item responses. For the reading test example, the item response pattern is simply the combination of 1s and 0s that reflect the correct and incorrect responses to the items. Thus, for the 20-item reading assessment, one possible response pattern would be 11010011101110001110. In short, the maximum likelihood approach can be thought of as a sophisticated search algorithm that seeks to find the combination of item and person parameters that as closely as possible reproduces the various item response patterns in the data.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading