Skip to main content icon/video/no-internet

Establishing construct relationships is at the heart of social scientific research. To estimate associations between continuous (interval/ratio) variables, the Pearson’s product-moment correlation coefficient (r; developed by Karl Pearson) is often used. An r estimate indicates the direction (+/–) and the magnitude (0 to |1|) of the association between two continuous variables. However, such estimates are inappropriate for categorical-variable associations, given categorical (nominal/ordinal) variables do not follow the normal distribution nor do they have equal distances between intervals. For estimating associations between categorical variables, the Pearson chi-squared statistic (χ2; also developed by Pearson) is often used. A χ2 estimate (minimum of 0) indicates the in/dependence between two nominal variables (chi-squared statistic tends to lose power when estimating ordinal-variable associations). A significant χ2 estimate is an evidence for the claim that the detection of an association between two nominal variables is not by mere change. Yet, interpreting χ2 estimates can be difficult. Unlike r estimates, χ2 estimates do not have an upper bound of +1. Cramér’s statistic (VC; developed by Harald Cramér) facilitates the interpretation of nominal-variable association estimates, given this index ranges from 0 to +1. A higher VC indicates a stronger association. Hence, VC estimates are often (and should be) reported in addition to χ2 estimates, as an effect size index.

This entry reviews Cramér’s V with a focus on explaining common reasons for and conditions of its use. To provide appropriate context for Cramér’s V, this entry extends discussions of broader topics of effect size and the chi-squared statistic. In addition, examples of Cramér’s V are provided and related key concepts are discussed. Finally, this entry reviews some potential issues with Cramér’s V, particularly as it relates to similar statistics.

Tests of Nominal-Variable Associations

In order to understand the utility of Cramér’s V, it is important to understand the ways in which statistical tests of significance differ from measures of association for categorical variables. The chi-squared (χ2) test for independence provides a statistical test of association between two categorical (nominal) variables from a single population. The test is used to determine whether the association between two variables is significant, with the null hypothesis being that the two variables are not dependent on one another.

In chi-square analysis, expected frequencies are generated following the null hypothesis and compared against the observed frequencies. The chi-square statistic tests whether one variable is independent of the other variable. If the fit is good—that is, if the difference between the expected frequencies and the observed frequencies is small—the chi-square statistic will be small and one would conclude that the two variables are independent. Conversely, a poor fit yields a large chi-square statistic and rejection of the null hypothesis, and suggests the two variables are related.

Although a significant chi-squared statistic suggests a relationship exists between two variables, it does not describe the strength of association. Variation in the size of the chi-square statistic influences the level of confidence in rejecting or retaining null hypotheses (p values), but higher and lower chi-square statistics do not necessarily correspond with varying strengths of associations. Because the chi-square statistic is sensitive to sample size, very weak relationships can produce very large chi-square values in large sample sizes. For example, flipping a coin only 10 times and correctly predicting two of four heads-up results and five of six tails-up results yields a chi-square statistic of 1.27 and p-value of .260, whereas flipping a coin 1,000 times with results of the same proportions—i.e., correctly predicting 200 of the 400 heads and 500 of the 600 tails—yields a chi-square statistic of approximately 127 and p-value of less than .001. As this example illustrates, despite the proportions of frequencies remaining the same, the chi-square statistic and its corresponding probability vary as a function of sample size.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading