Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Tuan, Kien Tran Duc, Trong, Tam Nguyen, Hoang, Son Nguyen, Than, Khoat, Duc, Anh Nguyen
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2505.03201
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912719847817216
author	Tuan, Kien Tran Duc Trong, Tam Nguyen Hoang, Son Nguyen Than, Khoat Duc, Anh Nguyen
author_facet	Tuan, Kien Tran Duc Trong, Tam Nguyen Hoang, Son Nguyen Than, Khoat Duc, Anh Nguyen
contents	Integrated Gradients (IG) is a widely used attribution method in explainable AI, particularly in computer vision applications where reliable feature attribution is essential. A key limitation of IG is its sensitivity to the choice of baseline (reference) images. Multi-baseline extensions such as Expected Gradients (EG) assume uniform weighting over baselines, implicitly treating baseline images as equally informative. In high-dimensional vision models, this assumption often leads to noisy or unstable explanations. This paper proposes Weighted Integrated Gradients (WG), a principled approach that evaluates and weights baselines to enhance attribution reliability. WG introduces an unsupervised criterion for baseline suitability, enabling adaptive selection and weighting of baselines on a per-input basis. The method not only preserves core axiomatic properties of IG but also provides improved theoretical guarantees on the quality of explanation over EG. Experiments on commonly used image datasets and models show that WG consistently outperforms EG, yielding 10 to 35 percent improvements in attribution fidelity. WG further identifies informative baseline subsets, reducing unnecessary variability while maintaining high attribution accuracy. By moving beyond the idea that all baselines matter equally, Weighted Integrated Gradients offers a clearer and more reliable way to explain computer-vision models, improving both understanding and practical usability in explainable AI.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_03201
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Enhancing Visual Feature Attribution via Weighted Integrated Gradients Tuan, Kien Tran Duc Trong, Tam Nguyen Hoang, Son Nguyen Than, Khoat Duc, Anh Nguyen Machine Learning Integrated Gradients (IG) is a widely used attribution method in explainable AI, particularly in computer vision applications where reliable feature attribution is essential. A key limitation of IG is its sensitivity to the choice of baseline (reference) images. Multi-baseline extensions such as Expected Gradients (EG) assume uniform weighting over baselines, implicitly treating baseline images as equally informative. In high-dimensional vision models, this assumption often leads to noisy or unstable explanations. This paper proposes Weighted Integrated Gradients (WG), a principled approach that evaluates and weights baselines to enhance attribution reliability. WG introduces an unsupervised criterion for baseline suitability, enabling adaptive selection and weighting of baselines on a per-input basis. The method not only preserves core axiomatic properties of IG but also provides improved theoretical guarantees on the quality of explanation over EG. Experiments on commonly used image datasets and models show that WG consistently outperforms EG, yielding 10 to 35 percent improvements in attribution fidelity. WG further identifies informative baseline subsets, reducing unnecessary variability while maintaining high attribution accuracy. By moving beyond the idea that all baselines matter equally, Weighted Integrated Gradients offers a clearer and more reliable way to explain computer-vision models, improving both understanding and practical usability in explainable AI.
title	Enhancing Visual Feature Attribution via Weighted Integrated Gradients
topic	Machine Learning
url	https://arxiv.org/abs/2505.03201

Similar Items