Saved in:
Bibliographic Details
Main Authors: Abbasi, Ali, Taghipour, Mehdi, Beheshti, Rahmatollah
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.12498
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915796197834752
author Abbasi, Ali
Taghipour, Mehdi
Beheshti, Rahmatollah
author_facet Abbasi, Ali
Taghipour, Mehdi
Beheshti, Rahmatollah
contents Negation is a fundamental linguistic operation in clinical reporting, yet vision-language models (VLMs) frequently fail to distinguish affirmative from negated medical statements. To systematically characterize this limitation, we introduce a radiology-specific diagnostic benchmark that evaluates polarity sensitivity under controlled clinical conditions, revealing that common medical VLMs consistently confuse negated and non-negated findings. To enable learning beyond simple condition absence, we further construct a contextual clinical negation dataset that encodes structured claims and supports attribute-level negations involving location and severity. Building on these resources, we propose Negation-Aware Selective Training (NAST), an interpretability-guided adaptation method that uses causal tracing effects (CTEs) to modulate layer-wise gradient updates during fine-tuning. Rather than applying uniform learning rates, NAST scales each layer's update according to its causal contribution to negation processing, transforming mechanistic interpretability signals into a principled optimization rule. Experiments demonstrate improved discrimination of affirmative and negated clinical statements without degrading general vision-language alignment, highlighting the value of causal interpretability for targeted model adaptation in safety-critical medical settings. Code and resources are available at https://github.com/healthylaife/NAST.
format Preprint
id arxiv_https___arxiv_org_abs_2602_12498
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Layer-Specific Fine-Tuning for Improved Negation Handling in Medical Vision-Language Models
Abbasi, Ali
Taghipour, Mehdi
Beheshti, Rahmatollah
Computer Vision and Pattern Recognition
Negation is a fundamental linguistic operation in clinical reporting, yet vision-language models (VLMs) frequently fail to distinguish affirmative from negated medical statements. To systematically characterize this limitation, we introduce a radiology-specific diagnostic benchmark that evaluates polarity sensitivity under controlled clinical conditions, revealing that common medical VLMs consistently confuse negated and non-negated findings. To enable learning beyond simple condition absence, we further construct a contextual clinical negation dataset that encodes structured claims and supports attribute-level negations involving location and severity. Building on these resources, we propose Negation-Aware Selective Training (NAST), an interpretability-guided adaptation method that uses causal tracing effects (CTEs) to modulate layer-wise gradient updates during fine-tuning. Rather than applying uniform learning rates, NAST scales each layer's update according to its causal contribution to negation processing, transforming mechanistic interpretability signals into a principled optimization rule. Experiments demonstrate improved discrimination of affirmative and negated clinical statements without degrading general vision-language alignment, highlighting the value of causal interpretability for targeted model adaptation in safety-critical medical settings. Code and resources are available at https://github.com/healthylaife/NAST.
title Layer-Specific Fine-Tuning for Improved Negation Handling in Medical Vision-Language Models
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.12498