Saved in:
Bibliographic Details
Main Authors: Najafi, Mohammad Hossein, Morsali, Mohammad, Pashanejad, Mohammadreza, Roudi, Saman Soleimani, Norouzi, Mohammad, Shouraki, Saeed Bagheri
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.05483
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Deep neural networks for medical image classification often fail to generalize consistently in clinical practice due to violations of the i.i.d. assumption and opaque decision-making. This paper examines interpretability in deep neural networks fine-tuned for fracture detection by evaluating model performance against adversarial attack and comparing interpretability methods to fracture regions annotated by an orthopedic surgeon. Our findings prove that robust models yield explanations more aligned with clinically meaningful areas, indicating that robustness encourages anatomically relevant feature prioritization. We emphasize the value of interpretability for facilitating human-AI collaboration, in which models serve as assistants under a human-in-the-loop paradigm: clinically plausible explanations foster trust, enable error correction, and discourage reliance on AI for high-stakes decisions. This paper investigates robustness and interpretability as complementary benchmarks for bridging the gap between benchmark performance and safe, actionable clinical deployment.