Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Abbasi, Ali, Taghipour, Mehdi, Beheshti, Rahmatollah
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.12498
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915796197834752
author	Abbasi, Ali Taghipour, Mehdi Beheshti, Rahmatollah
author_facet	Abbasi, Ali Taghipour, Mehdi Beheshti, Rahmatollah
contents	Negation is a fundamental linguistic operation in clinical reporting, yet vision-language models (VLMs) frequently fail to distinguish affirmative from negated medical statements. To systematically characterize this limitation, we introduce a radiology-specific diagnostic benchmark that evaluates polarity sensitivity under controlled clinical conditions, revealing that common medical VLMs consistently confuse negated and non-negated findings. To enable learning beyond simple condition absence, we further construct a contextual clinical negation dataset that encodes structured claims and supports attribute-level negations involving location and severity. Building on these resources, we propose Negation-Aware Selective Training (NAST), an interpretability-guided adaptation method that uses causal tracing effects (CTEs) to modulate layer-wise gradient updates during fine-tuning. Rather than applying uniform learning rates, NAST scales each layer's update according to its causal contribution to negation processing, transforming mechanistic interpretability signals into a principled optimization rule. Experiments demonstrate improved discrimination of affirmative and negated clinical statements without degrading general vision-language alignment, highlighting the value of causal interpretability for targeted model adaptation in safety-critical medical settings. Code and resources are available at https://github.com/healthylaife/NAST.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_12498
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Layer-Specific Fine-Tuning for Improved Negation Handling in Medical Vision-Language Models Abbasi, Ali Taghipour, Mehdi Beheshti, Rahmatollah Computer Vision and Pattern Recognition Negation is a fundamental linguistic operation in clinical reporting, yet vision-language models (VLMs) frequently fail to distinguish affirmative from negated medical statements. To systematically characterize this limitation, we introduce a radiology-specific diagnostic benchmark that evaluates polarity sensitivity under controlled clinical conditions, revealing that common medical VLMs consistently confuse negated and non-negated findings. To enable learning beyond simple condition absence, we further construct a contextual clinical negation dataset that encodes structured claims and supports attribute-level negations involving location and severity. Building on these resources, we propose Negation-Aware Selective Training (NAST), an interpretability-guided adaptation method that uses causal tracing effects (CTEs) to modulate layer-wise gradient updates during fine-tuning. Rather than applying uniform learning rates, NAST scales each layer's update according to its causal contribution to negation processing, transforming mechanistic interpretability signals into a principled optimization rule. Experiments demonstrate improved discrimination of affirmative and negated clinical statements without degrading general vision-language alignment, highlighting the value of causal interpretability for targeted model adaptation in safety-critical medical settings. Code and resources are available at https://github.com/healthylaife/NAST.
title	Layer-Specific Fine-Tuning for Improved Negation Handling in Medical Vision-Language Models
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2602.12498

Similar Items