Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Burger, Christopher, Walter, Charles, Le, Thai
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Cryptography and Security
Online Access:	https://arxiv.org/abs/2406.15839
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915107777282048
author	Burger, Christopher Walter, Charles Le, Thai
author_facet	Burger, Christopher Walter, Charles Le, Thai
contents	Recent work has investigated the vulnerability of local surrogate methods to adversarial perturbations on a machine learning (ML) model's inputs, where the explanation is manipulated while the meaning and structure of the original input remains similar under the complex model. Although weaknesses across many methods have been shown to exist, the reasons behind why remain little explored. Central to the concept of adversarial attacks on explainable AI (XAI) is the similarity measure used to calculate how one explanation differs from another. A poor choice of similarity measure can lead to erroneous conclusions on the efficacy of an XAI method. Too sensitive a measure results in exaggerated vulnerability, while too coarse understates its weakness. We investigate a variety of similarity measures designed for text-based ranked lists, including Kendall's Tau, Spearman's Footrule, and Rank-biased Overlap to determine how substantial changes in the type of measure or threshold of success affect the conclusions generated from common adversarial attack processes. Certain measures are found to be overly sensitive, resulting in erroneous estimates of stability.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_15839
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	The Effect of Similarity Measures on Accurate Stability Estimates for Local Surrogate Models in Text-based Explainable AI Burger, Christopher Walter, Charles Le, Thai Machine Learning Cryptography and Security Recent work has investigated the vulnerability of local surrogate methods to adversarial perturbations on a machine learning (ML) model's inputs, where the explanation is manipulated while the meaning and structure of the original input remains similar under the complex model. Although weaknesses across many methods have been shown to exist, the reasons behind why remain little explored. Central to the concept of adversarial attacks on explainable AI (XAI) is the similarity measure used to calculate how one explanation differs from another. A poor choice of similarity measure can lead to erroneous conclusions on the efficacy of an XAI method. Too sensitive a measure results in exaggerated vulnerability, while too coarse understates its weakness. We investigate a variety of similarity measures designed for text-based ranked lists, including Kendall's Tau, Spearman's Footrule, and Rank-biased Overlap to determine how substantial changes in the type of measure or threshold of success affect the conclusions generated from common adversarial attack processes. Certain measures are found to be overly sensitive, resulting in erroneous estimates of stability.
title	The Effect of Similarity Measures on Accurate Stability Estimates for Local Surrogate Models in Text-based Explainable AI
topic	Machine Learning Cryptography and Security
url	https://arxiv.org/abs/2406.15839

Similar Items