Saved in:
Bibliographic Details
Main Authors: Landsiedel, Kirsten E., Abbott, Rachel, Mucunguzi, Atukunda, Mwangwa, Florence, Kakande, Elijah, Charlebois, Edwin D., Marquez, Carina, Kamya, Moses R., Balzer, Laura B.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.03336
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908960797229056
author Landsiedel, Kirsten E.
Abbott, Rachel
Mucunguzi, Atukunda
Mwangwa, Florence
Kakande, Elijah
Charlebois, Edwin D.
Marquez, Carina
Kamya, Moses R.
Balzer, Laura B.
author_facet Landsiedel, Kirsten E.
Abbott, Rachel
Mucunguzi, Atukunda
Mwangwa, Florence
Kakande, Elijah
Charlebois, Edwin D.
Marquez, Carina
Kamya, Moses R.
Balzer, Laura B.
contents Missing data are ubiquitous in public health research. When estimating causal effects, there are well-established methods to address bias to due missing outcomes. Commonly, causal estimands are defined under hypothetical interventions to "set" the exposure and to prevent missingness. We demonstrate how this framework can be extended to missing exposures. We further extend this framework to incorporate missingness on the baseline outcome, which induces missingness on the population of interest. To do so, we highlight the use of Counterfactual Strata Effects: causal estimands where the focus population is subject to missingness and/or impacted by the exposure. Our work is motivated by SEARCH-TB's investigation of the effect of alcohol consumption on the risk of incident tuberculosis (TB) infection in rural Uganda. This study posed several real-world challenges: confounding, missingness on the exposure (alcohol use), missingness on the baseline outcome (defining who was at-risk of TB and, thus, in the focus population), and missingness on the outcome at follow-up (capturing who acquired TB). We present a series of causal models and identification results to demonstrate the handling of missingness in these settings. We highlight the use of TMLE with Super Learner and the real-world consequences of our approach.
format Preprint
id arxiv_https___arxiv_org_abs_2506_03336
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Causal Inference with Missing Exposures and Missing Outcomes
Landsiedel, Kirsten E.
Abbott, Rachel
Mucunguzi, Atukunda
Mwangwa, Florence
Kakande, Elijah
Charlebois, Edwin D.
Marquez, Carina
Kamya, Moses R.
Balzer, Laura B.
Methodology
Missing data are ubiquitous in public health research. When estimating causal effects, there are well-established methods to address bias to due missing outcomes. Commonly, causal estimands are defined under hypothetical interventions to "set" the exposure and to prevent missingness. We demonstrate how this framework can be extended to missing exposures. We further extend this framework to incorporate missingness on the baseline outcome, which induces missingness on the population of interest. To do so, we highlight the use of Counterfactual Strata Effects: causal estimands where the focus population is subject to missingness and/or impacted by the exposure. Our work is motivated by SEARCH-TB's investigation of the effect of alcohol consumption on the risk of incident tuberculosis (TB) infection in rural Uganda. This study posed several real-world challenges: confounding, missingness on the exposure (alcohol use), missingness on the baseline outcome (defining who was at-risk of TB and, thus, in the focus population), and missingness on the outcome at follow-up (capturing who acquired TB). We present a series of causal models and identification results to demonstrate the handling of missingness in these settings. We highlight the use of TMLE with Super Learner and the real-world consequences of our approach.
title Causal Inference with Missing Exposures and Missing Outcomes
topic Methodology
url https://arxiv.org/abs/2506.03336