Saved in:
Bibliographic Details
Main Authors: Khan, Ahmad Saeed, Schaffernicht, Erik, Stork, Johannes Andreas
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.20003
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914923428184064
author Khan, Ahmad Saeed
Schaffernicht, Erik
Stork, Johannes Andreas
author_facet Khan, Ahmad Saeed
Schaffernicht, Erik
Stork, Johannes Andreas
contents Estimating treatment effects from observational data is paramount in healthcare, education, and economics, but current deep disentanglement-based methods to address selection bias are insufficiently handling irrelevant variables. We demonstrate in experiments that this leads to prediction errors. We disentangle pre-treatment variables with a deep embedding method and explicitly identify and represent irrelevant variables, additionally to instrumental, confounding and adjustment latent factors. To this end, we introduce a reconstruction objective and create an embedding space for irrelevant variables using an attached autoencoder. Instead of relying on serendipitous suppression of irrelevant variables as in previous deep disentanglement approaches, we explicitly force irrelevant variables into this embedding space and employ orthogonalization to prevent irrelevant information from leaking into the latent space representations of the other factors. Our experiments with synthetic and real-world benchmark datasets show that we can better identify irrelevant variables and more precisely predict treatment effects than previous methods, while prediction quality degrades less when additional irrelevant variables are introduced.
format Preprint
id arxiv_https___arxiv_org_abs_2407_20003
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle On the Effects of Irrelevant Variables in Treatment Effect Estimation with Deep Disentanglement
Khan, Ahmad Saeed
Schaffernicht, Erik
Stork, Johannes Andreas
Machine Learning
Estimating treatment effects from observational data is paramount in healthcare, education, and economics, but current deep disentanglement-based methods to address selection bias are insufficiently handling irrelevant variables. We demonstrate in experiments that this leads to prediction errors. We disentangle pre-treatment variables with a deep embedding method and explicitly identify and represent irrelevant variables, additionally to instrumental, confounding and adjustment latent factors. To this end, we introduce a reconstruction objective and create an embedding space for irrelevant variables using an attached autoencoder. Instead of relying on serendipitous suppression of irrelevant variables as in previous deep disentanglement approaches, we explicitly force irrelevant variables into this embedding space and employ orthogonalization to prevent irrelevant information from leaking into the latent space representations of the other factors. Our experiments with synthetic and real-world benchmark datasets show that we can better identify irrelevant variables and more precisely predict treatment effects than previous methods, while prediction quality degrades less when additional irrelevant variables are introduced.
title On the Effects of Irrelevant Variables in Treatment Effect Estimation with Deep Disentanglement
topic Machine Learning
url https://arxiv.org/abs/2407.20003