Skip to main content

Simpson’s Paradox

Encyclopedia
Edited by: Published: 2018
+- LessMore information
Download PDF

Simpson’s paradox, first defined by Edward H. Simpson in 1951, is a statistical phenomenon in which the association between two variables reverses or disappears when examining aggregate versus disaggregate data of a population via a third variable. Alternative known names of Simpson’s paradox are Yule effect, reversal paradox, or amalgamation paradox.

The practical implication to decision making that Simpson’s paradox raises is the question of which level of data aggregation presents the results of interest. This question further raises the challenge of identifying potential variables and then establishing a criterion for deciding if and which of the potential variables should influence the decision making.

Figure 1 Simpson’s paradox illustration for categorical cause and outcome variables

Simpson’s paradox is commonly defined for a categorical cause variable (C) and a ...

Looks like you do not have access to this content.

Reader's Guide