Saved in:
Bibliographic Details
Main Authors: Wu, Yinjun, Keoliya, Mayank, Chen, Kan, Velingker, Neelay, Li, Ziyang, Getzen, Emily J, Long, Qi, Naik, Mayur, Parikh, Ravi B, Wong, Eric
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.00611
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929369705873408
author Wu, Yinjun
Keoliya, Mayank
Chen, Kan
Velingker, Neelay
Li, Ziyang
Getzen, Emily J
Long, Qi
Naik, Mayur
Parikh, Ravi B
Wong, Eric
author_facet Wu, Yinjun
Keoliya, Mayank
Chen, Kan
Velingker, Neelay
Li, Ziyang
Getzen, Emily J
Long, Qi
Naik, Mayur
Parikh, Ravi B
Wong, Eric
contents Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers for black-box models lack faithfulness guarantees, and self-interpretable models greatly compromise accuracy. To address these issues, we propose DISCRET, a self-interpretable ITE framework that synthesizes faithful, rule-based explanations for each sample. A key insight behind DISCRET is that explanations can serve dually as database queries to identify similar subgroups of samples. We provide a novel RL algorithm to efficiently synthesize these explanations from a large search space. We evaluate DISCRET on diverse tasks involving tabular, image, and text data. DISCRET outperforms the best self-interpretable models and has accuracy comparable to the best black-box models while providing faithful explanations. DISCRET is available at https://github.com/wuyinjun-1993/DISCRET-ICML2024.
format Preprint
id arxiv_https___arxiv_org_abs_2406_00611
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
Wu, Yinjun
Keoliya, Mayank
Chen, Kan
Velingker, Neelay
Li, Ziyang
Getzen, Emily J
Long, Qi
Naik, Mayur
Parikh, Ravi B
Wong, Eric
Machine Learning
Methodology
Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers for black-box models lack faithfulness guarantees, and self-interpretable models greatly compromise accuracy. To address these issues, we propose DISCRET, a self-interpretable ITE framework that synthesizes faithful, rule-based explanations for each sample. A key insight behind DISCRET is that explanations can serve dually as database queries to identify similar subgroups of samples. We provide a novel RL algorithm to efficiently synthesize these explanations from a large search space. We evaluate DISCRET on diverse tasks involving tabular, image, and text data. DISCRET outperforms the best self-interpretable models and has accuracy comparable to the best black-box models while providing faithful explanations. DISCRET is available at https://github.com/wuyinjun-1993/DISCRET-ICML2024.
title DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
topic Machine Learning
Methodology
url https://arxiv.org/abs/2406.00611