Saved in:
Bibliographic Details
Main Authors: Anwar, Saif, Griffiths, Nathan, Bhalerao, Abhir, Popham, Thomas
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.07521
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909249641119744
author Anwar, Saif
Griffiths, Nathan
Bhalerao, Abhir
Popham, Thomas
author_facet Anwar, Saif
Griffiths, Nathan
Bhalerao, Abhir
Popham, Thomas
contents The trustworthiness of Machine Learning (ML) models can be difficult to assess, but is critical in high-risk or ethically sensitive applications. Many models are treated as a `black-box' where the reasoning or criteria for a final decision is opaque to the user. To address this, some existing Explainable AI (XAI) approaches approximate model behaviour using perturbed data. However, such methods have been criticised for ignoring feature dependencies, with explanations being based on potentially unrealistic data. We propose a novel framework, CHILLI, for incorporating data context into XAI by generating contextually aware perturbations, which are faithful to the training data of the base model being explained. This is shown to improve both the soundness and accuracy of the explanations.
format Preprint
id arxiv_https___arxiv_org_abs_2407_07521
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle CHILLI: A data context-aware perturbation method for XAI
Anwar, Saif
Griffiths, Nathan
Bhalerao, Abhir
Popham, Thomas
Machine Learning
Artificial Intelligence
The trustworthiness of Machine Learning (ML) models can be difficult to assess, but is critical in high-risk or ethically sensitive applications. Many models are treated as a `black-box' where the reasoning or criteria for a final decision is opaque to the user. To address this, some existing Explainable AI (XAI) approaches approximate model behaviour using perturbed data. However, such methods have been criticised for ignoring feature dependencies, with explanations being based on potentially unrealistic data. We propose a novel framework, CHILLI, for incorporating data context into XAI by generating contextually aware perturbations, which are faithful to the training data of the base model being explained. This is shown to improve both the soundness and accuracy of the explanations.
title CHILLI: A data context-aware perturbation method for XAI
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2407.07521