Saved in:
Bibliographic Details
Main Authors: Tuan, Kien Tran Duc, Trong, Tam Nguyen, Hoang, Son Nguyen, Than, Khoat, Duc, Anh Nguyen
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.03201
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912719847817216
author Tuan, Kien Tran Duc
Trong, Tam Nguyen
Hoang, Son Nguyen
Than, Khoat
Duc, Anh Nguyen
author_facet Tuan, Kien Tran Duc
Trong, Tam Nguyen
Hoang, Son Nguyen
Than, Khoat
Duc, Anh Nguyen
contents Integrated Gradients (IG) is a widely used attribution method in explainable AI, particularly in computer vision applications where reliable feature attribution is essential. A key limitation of IG is its sensitivity to the choice of baseline (reference) images. Multi-baseline extensions such as Expected Gradients (EG) assume uniform weighting over baselines, implicitly treating baseline images as equally informative. In high-dimensional vision models, this assumption often leads to noisy or unstable explanations. This paper proposes Weighted Integrated Gradients (WG), a principled approach that evaluates and weights baselines to enhance attribution reliability. WG introduces an unsupervised criterion for baseline suitability, enabling adaptive selection and weighting of baselines on a per-input basis. The method not only preserves core axiomatic properties of IG but also provides improved theoretical guarantees on the quality of explanation over EG. Experiments on commonly used image datasets and models show that WG consistently outperforms EG, yielding 10 to 35 percent improvements in attribution fidelity. WG further identifies informative baseline subsets, reducing unnecessary variability while maintaining high attribution accuracy. By moving beyond the idea that all baselines matter equally, Weighted Integrated Gradients offers a clearer and more reliable way to explain computer-vision models, improving both understanding and practical usability in explainable AI.
format Preprint
id arxiv_https___arxiv_org_abs_2505_03201
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Enhancing Visual Feature Attribution via Weighted Integrated Gradients
Tuan, Kien Tran Duc
Trong, Tam Nguyen
Hoang, Son Nguyen
Than, Khoat
Duc, Anh Nguyen
Machine Learning
Integrated Gradients (IG) is a widely used attribution method in explainable AI, particularly in computer vision applications where reliable feature attribution is essential. A key limitation of IG is its sensitivity to the choice of baseline (reference) images. Multi-baseline extensions such as Expected Gradients (EG) assume uniform weighting over baselines, implicitly treating baseline images as equally informative. In high-dimensional vision models, this assumption often leads to noisy or unstable explanations. This paper proposes Weighted Integrated Gradients (WG), a principled approach that evaluates and weights baselines to enhance attribution reliability. WG introduces an unsupervised criterion for baseline suitability, enabling adaptive selection and weighting of baselines on a per-input basis. The method not only preserves core axiomatic properties of IG but also provides improved theoretical guarantees on the quality of explanation over EG. Experiments on commonly used image datasets and models show that WG consistently outperforms EG, yielding 10 to 35 percent improvements in attribution fidelity. WG further identifies informative baseline subsets, reducing unnecessary variability while maintaining high attribution accuracy. By moving beyond the idea that all baselines matter equally, Weighted Integrated Gradients offers a clearer and more reliable way to explain computer-vision models, improving both understanding and practical usability in explainable AI.
title Enhancing Visual Feature Attribution via Weighted Integrated Gradients
topic Machine Learning
url https://arxiv.org/abs/2505.03201