Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wu, Yinjun, Keoliya, Mayank, Chen, Kan, Velingker, Neelay, Li, Ziyang, Getzen, Emily J, Long, Qi, Naik, Mayur, Parikh, Ravi B, Wong, Eric
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Methodology
Online Access:	https://arxiv.org/abs/2406.00611
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929369705873408
author	Wu, Yinjun Keoliya, Mayank Chen, Kan Velingker, Neelay Li, Ziyang Getzen, Emily J Long, Qi Naik, Mayur Parikh, Ravi B Wong, Eric
author_facet	Wu, Yinjun Keoliya, Mayank Chen, Kan Velingker, Neelay Li, Ziyang Getzen, Emily J Long, Qi Naik, Mayur Parikh, Ravi B Wong, Eric
contents	Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers for black-box models lack faithfulness guarantees, and self-interpretable models greatly compromise accuracy. To address these issues, we propose DISCRET, a self-interpretable ITE framework that synthesizes faithful, rule-based explanations for each sample. A key insight behind DISCRET is that explanations can serve dually as database queries to identify similar subgroups of samples. We provide a novel RL algorithm to efficiently synthesize these explanations from a large search space. We evaluate DISCRET on diverse tasks involving tabular, image, and text data. DISCRET outperforms the best self-interpretable models and has accuracy comparable to the best black-box models while providing faithful explanations. DISCRET is available at https://github.com/wuyinjun-1993/DISCRET-ICML2024.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_00611
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation Wu, Yinjun Keoliya, Mayank Chen, Kan Velingker, Neelay Li, Ziyang Getzen, Emily J Long, Qi Naik, Mayur Parikh, Ravi B Wong, Eric Machine Learning Methodology Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers for black-box models lack faithfulness guarantees, and self-interpretable models greatly compromise accuracy. To address these issues, we propose DISCRET, a self-interpretable ITE framework that synthesizes faithful, rule-based explanations for each sample. A key insight behind DISCRET is that explanations can serve dually as database queries to identify similar subgroups of samples. We provide a novel RL algorithm to efficiently synthesize these explanations from a large search space. We evaluate DISCRET on diverse tasks involving tabular, image, and text data. DISCRET outperforms the best self-interpretable models and has accuracy comparable to the best black-box models while providing faithful explanations. DISCRET is available at https://github.com/wuyinjun-1993/DISCRET-ICML2024.
title	DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
topic	Machine Learning Methodology
url	https://arxiv.org/abs/2406.00611

Similar Items