Saved in:
Bibliographic Details
Main Authors: Zhang, Xinyi, Wang, Linbo, Volgushev, Stanislav, Kong, Dehan
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2203.09330
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910615311745024
author Zhang, Xinyi
Wang, Linbo
Volgushev, Stanislav
Kong, Dehan
author_facet Zhang, Xinyi
Wang, Linbo
Volgushev, Stanislav
Kong, Dehan
contents Instrumental variable methods provide useful tools for inferring causal effects in the presence of unmeasured confounding. To apply these methods with large-scale data sets, a major challenge is to find valid instruments from a possibly large candidate set. In practice, most of the candidate instruments are often not relevant for studying a particular exposure of interest. Moreover, not all relevant candidate instruments are valid as they may directly influence the outcome of interest. In this article, we propose a data-driven method for causal inference with many candidate instruments that addresses these two challenges simultaneously. A key component of our proposal involves using pseudo variables, known to be irrelevant, to remove variables from the original set that exhibit spurious correlations with the exposure. Synthetic data analyses show that the proposed method performs favourably compared to existing methods. We apply our method to a Mendelian randomization study estimating the effect of obesity on health-related quality of life.
format Preprint
id arxiv_https___arxiv_org_abs_2203_09330
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle Fighting Noise with Noise: Causal Inference with Many Candidate Instruments
Zhang, Xinyi
Wang, Linbo
Volgushev, Stanislav
Kong, Dehan
Methodology
Instrumental variable methods provide useful tools for inferring causal effects in the presence of unmeasured confounding. To apply these methods with large-scale data sets, a major challenge is to find valid instruments from a possibly large candidate set. In practice, most of the candidate instruments are often not relevant for studying a particular exposure of interest. Moreover, not all relevant candidate instruments are valid as they may directly influence the outcome of interest. In this article, we propose a data-driven method for causal inference with many candidate instruments that addresses these two challenges simultaneously. A key component of our proposal involves using pseudo variables, known to be irrelevant, to remove variables from the original set that exhibit spurious correlations with the exposure. Synthetic data analyses show that the proposed method performs favourably compared to existing methods. We apply our method to a Mendelian randomization study estimating the effect of obesity on health-related quality of life.
title Fighting Noise with Noise: Causal Inference with Many Candidate Instruments
topic Methodology
url https://arxiv.org/abs/2203.09330