Saved in:
Bibliographic Details
Main Authors: Ma, Baoqiang, Madzia-Madzou, Djennifer K., Kraaijveld, Rosa C. J., Ouyang, Jin
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.16034
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908973992509440
author Ma, Baoqiang
Madzia-Madzou, Djennifer K.
Kraaijveld, Rosa C. J.
Ouyang, Jin
author_facet Ma, Baoqiang
Madzia-Madzou, Djennifer K.
Kraaijveld, Rosa C. J.
Ouyang, Jin
contents For head and neck cancer (HNC) patients, prognostic outcome prediction can support personalized treatment strategy selection. Improving prediction performance of HNC outcomes has been extensively explored by using advanced artificial intelligence (AI) techniques on PET/CT data. However, the interpretability of AI remains a critical obstacle for its clinical adoption. Unlike previous HNC studies that empirically selected explainable AI (XAI) techniques, we are the first to comprehensively evaluate and rank 13 XAI methods across 24 metrics, covering faithfulness, robustness, complexity and plausibility. Experimental results on the multi-center HECKTOR challenge dataset show large variations across evaluation aspects among different XAI methods, with Integrated Gradients (IG) and DeepLIFT (DL) consistently obtained high rankings for faithfulness, complexity and plausibility. This work highlights the importance of comprehensive XAI method evaluation and can be extended to other medical imaging tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2604_16034
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Ranking XAI Methods for Head and Neck Cancer Outcome Prediction
Ma, Baoqiang
Madzia-Madzou, Djennifer K.
Kraaijveld, Rosa C. J.
Ouyang, Jin
Computer Vision and Pattern Recognition
Data Analysis, Statistics and Probability
For head and neck cancer (HNC) patients, prognostic outcome prediction can support personalized treatment strategy selection. Improving prediction performance of HNC outcomes has been extensively explored by using advanced artificial intelligence (AI) techniques on PET/CT data. However, the interpretability of AI remains a critical obstacle for its clinical adoption. Unlike previous HNC studies that empirically selected explainable AI (XAI) techniques, we are the first to comprehensively evaluate and rank 13 XAI methods across 24 metrics, covering faithfulness, robustness, complexity and plausibility. Experimental results on the multi-center HECKTOR challenge dataset show large variations across evaluation aspects among different XAI methods, with Integrated Gradients (IG) and DeepLIFT (DL) consistently obtained high rankings for faithfulness, complexity and plausibility. This work highlights the importance of comprehensive XAI method evaluation and can be extended to other medical imaging tasks.
title Ranking XAI Methods for Head and Neck Cancer Outcome Prediction
topic Computer Vision and Pattern Recognition
Data Analysis, Statistics and Probability
url https://arxiv.org/abs/2604.16034