Saved in:
Bibliographic Details
Main Authors: Dai, Guisheng, Wang, Weizhen
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.12864
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Simpson's Paradox is a well-known phenomenon in statistical science, where the relationship between the response variable $X$ and a certain explanatory factor of interest $A$ reverses when an additional factor $B_1$ is considered. This paper explores the extension of Simpson's Paradox to any given number $n$ of factors, referred to as the $n$-factor Simpson's Paradox. We first provide a rigorous definition of the $n$-factor Simpson's Paradox, then demonstrate the existence of a probability distribution through a geometric construction. Specifically, we show that for any positive integer $n$, it is possible to construct a probability distribution in which the conclusion about the effect of $A$ on $X$ reverses each time an additional factor $B_i$ is introduced for $i=1,...,n$. A detailed example for $n = 3$ illustrates the construction. Our results highlight that, contrary to the intuition that more data leads to more accurate inferences, the inclusion of additional factors can repeatedly reverse conclusions, emphasizing the complexity of statistical inference in the presence of multiple confounding variables.