Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Adhikari, Shishir, Muscioni, Guido, Shapiro, Mark, Petrov, Plamen, Zheleva, Elena
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2503.11477
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866910261878718464
author Adhikari, Shishir
Muscioni, Guido
Shapiro, Mark
Petrov, Plamen
Zheleva, Elena
author_facet Adhikari, Shishir
Muscioni, Guido
Shapiro, Mark
Petrov, Plamen
Zheleva, Elena
contents Understanding the factors that trigger or prevent undesirable health outcomes across patient subpopulations is essential for designing targeted interventions. While randomized controlled trials and expert-led patient interviews are standard methods for identifying these factors, they can be time-consuming or infeasible. Causal discovery offers an alternative to conventional approaches by generating cause-and-effect hypotheses from observational data, yet its practical utility is limited by strong or untestable assumptions. This work presents a novel, end-to-end framework that uniquely integrates an ensemble of causal structure learning (CSL) algorithms with heterogeneous causal effect estimation. By aggregating results across multiple algorithms, the framework identifies robust causal relationships that persist under different modeling assumptions while simultaneously revealing how these effects vary across specific patient contexts. The proposed heterogeneous causal discovery framework improves robustness and provides practitioners with a prioritized set of actionable, clinically interpretable hypotheses. We demonstrate the framework's effectiveness through two large-scale healthcare applications: identifying drivers and inhibitors of repeat emergency department visits among diabetic patients and hospital readmissions among ICU patients, using insurance claims and electronic health record datasets. Our results, across both settings, identify chronic disease management and care coordination as key interventions, while revealing that intervention effectiveness depends on specific patient-level modifiers. We employ a multi-layered validation strategy, including ground-truth recovery via simulations, alignment with clinical literature, validation by expert clinicians, and portability in modern healthcare systems using an external dataset, to demonstrate the framework's practical utility.
format Preprint
id arxiv_https___arxiv_org_abs_2503_11477
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Heterogeneous Causal Discovery of Repeated Undesirable Health Outcomes
Adhikari, Shishir
Muscioni, Guido
Shapiro, Mark
Petrov, Plamen
Zheleva, Elena
Artificial Intelligence
Understanding the factors that trigger or prevent undesirable health outcomes across patient subpopulations is essential for designing targeted interventions. While randomized controlled trials and expert-led patient interviews are standard methods for identifying these factors, they can be time-consuming or infeasible. Causal discovery offers an alternative to conventional approaches by generating cause-and-effect hypotheses from observational data, yet its practical utility is limited by strong or untestable assumptions. This work presents a novel, end-to-end framework that uniquely integrates an ensemble of causal structure learning (CSL) algorithms with heterogeneous causal effect estimation. By aggregating results across multiple algorithms, the framework identifies robust causal relationships that persist under different modeling assumptions while simultaneously revealing how these effects vary across specific patient contexts. The proposed heterogeneous causal discovery framework improves robustness and provides practitioners with a prioritized set of actionable, clinically interpretable hypotheses. We demonstrate the framework's effectiveness through two large-scale healthcare applications: identifying drivers and inhibitors of repeat emergency department visits among diabetic patients and hospital readmissions among ICU patients, using insurance claims and electronic health record datasets. Our results, across both settings, identify chronic disease management and care coordination as key interventions, while revealing that intervention effectiveness depends on specific patient-level modifiers. We employ a multi-layered validation strategy, including ground-truth recovery via simulations, alignment with clinical literature, validation by expert clinicians, and portability in modern healthcare systems using an external dataset, to demonstrate the framework's practical utility.
title Heterogeneous Causal Discovery of Repeated Undesirable Health Outcomes
topic Artificial Intelligence
url https://arxiv.org/abs/2503.11477