Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Pryce, Matthew, Diaz-Ordaz, Karla, Keogh, Ruth H., Vansteelandt, Stijn
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2412.19711
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910910230036480
author	Pryce, Matthew Diaz-Ordaz, Karla Keogh, Ruth H. Vansteelandt, Stijn
author_facet	Pryce, Matthew Diaz-Ordaz, Karla Keogh, Ruth H. Vansteelandt, Stijn
contents	When estimating heterogeneous treatment effects, missing outcome data can complicate treatment effect estimation, causing certain subgroups of the population to be poorly represented. In this work, we discuss this commonly overlooked problem and consider the impact that missing at random (MAR) outcome data has on causal machine learning estimators for the conditional average treatment effect (CATE). We propose two de-biased machine learning estimators for the CATE, the mDR-learner and mEP-learner, which address the issue of under-representation by integrating inverse probability of censoring weights into the DR-learner and EP-learner respectively. We show that under reasonable conditions, these estimators are oracle efficient, and illustrate their favorable performance through simulated data settings, comparing them to existing CATE estimators, including comparison to estimators which use common missing data techniques. We present an example of their application using the GBSG2 trial, exploring treatment effect heterogeneity when comparing hormonal therapies to non-hormonal therapies among breast cancer patients post surgery, and offer guidance on the decisions a practitioner must make when implementing these estimators.
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_19711
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Causal machine learning for heterogeneous treatment effects in the presence of missing outcome data Pryce, Matthew Diaz-Ordaz, Karla Keogh, Ruth H. Vansteelandt, Stijn Machine Learning When estimating heterogeneous treatment effects, missing outcome data can complicate treatment effect estimation, causing certain subgroups of the population to be poorly represented. In this work, we discuss this commonly overlooked problem and consider the impact that missing at random (MAR) outcome data has on causal machine learning estimators for the conditional average treatment effect (CATE). We propose two de-biased machine learning estimators for the CATE, the mDR-learner and mEP-learner, which address the issue of under-representation by integrating inverse probability of censoring weights into the DR-learner and EP-learner respectively. We show that under reasonable conditions, these estimators are oracle efficient, and illustrate their favorable performance through simulated data settings, comparing them to existing CATE estimators, including comparison to estimators which use common missing data techniques. We present an example of their application using the GBSG2 trial, exploring treatment effect heterogeneity when comparing hormonal therapies to non-hormonal therapies among breast cancer patients post surgery, and offer guidance on the decisions a practitioner must make when implementing these estimators.
title	Causal machine learning for heterogeneous treatment effects in the presence of missing outcome data
topic	Machine Learning
url	https://arxiv.org/abs/2412.19711

Similar Items