Bewaard in:
Bibliografische gegevens
Hoofdauteurs: Tan, Andong, Zhou, Fengtao, Chen, Hao
Formaat: Preprint
Gepubliceerd in: 2024
Onderwerpen:
Online toegang:https://arxiv.org/abs/2406.03421
Tags: Voeg label toe
Geen labels, Wees de eerste die dit record labelt!
_version_ 1866909217622851584
author Tan, Andong
Zhou, Fengtao
Chen, Hao
author_facet Tan, Andong
Zhou, Fengtao
Chen, Hao
contents Post-hoc explainability methods such as Grad-CAM are popular because they do not influence the performance of a trained model. However, they mainly reveal "where" a model looks at for a given input, fail to explain "what" the model looks for (e.g., what is important to classify a bird image to a Scott Oriole?). Existing part-prototype networks leverage part-prototypes (e.g., characteristic Scott Oriole's wing and head) to answer both "where" and "what", but often under-perform their black box counterparts in the accuracy. Therefore, a natural question is: can one construct a network that answers both "where" and "what" in a post-hoc manner to guarantee the model's performance? To this end, we propose the first post-hoc part-prototype network via decomposing the classification head of a trained model into a set of interpretable part-prototypes. Concretely, we propose an unsupervised prototype discovery and refining strategy to obtain prototypes that can precisely reconstruct the classification head, yet being interpretable. Besides guaranteeing the performance, we show that our network offers more faithful explanations qualitatively and yields even better part-prototypes quantitatively than prior part-prototype networks.
format Preprint
id arxiv_https___arxiv_org_abs_2406_03421
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Post-hoc Part-prototype Networks
Tan, Andong
Zhou, Fengtao
Chen, Hao
Computer Vision and Pattern Recognition
Post-hoc explainability methods such as Grad-CAM are popular because they do not influence the performance of a trained model. However, they mainly reveal "where" a model looks at for a given input, fail to explain "what" the model looks for (e.g., what is important to classify a bird image to a Scott Oriole?). Existing part-prototype networks leverage part-prototypes (e.g., characteristic Scott Oriole's wing and head) to answer both "where" and "what", but often under-perform their black box counterparts in the accuracy. Therefore, a natural question is: can one construct a network that answers both "where" and "what" in a post-hoc manner to guarantee the model's performance? To this end, we propose the first post-hoc part-prototype network via decomposing the classification head of a trained model into a set of interpretable part-prototypes. Concretely, we propose an unsupervised prototype discovery and refining strategy to obtain prototypes that can precisely reconstruct the classification head, yet being interpretable. Besides guaranteeing the performance, we show that our network offers more faithful explanations qualitatively and yields even better part-prototypes quantitatively than prior part-prototype networks.
title Post-hoc Part-prototype Networks
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2406.03421