Saved in:
Bibliographic Details
Main Authors: Nguyen, Tuan Dung, Ho, Minh Khoi, Chen, Qi, Xie, Yutong, Cam-Tu, Nguyen, Nguyen, Minh Khoi, Nguyen, Dang Huy Pham, Hengel, Anton van den, Verjans, Johan W., Nguyen, Phi Le, Phan, Vu Minh Hieu
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.04863
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Large vision-language models (LVLMs) achieve strong performance on visual reasoning tasks but remain highly susceptible to hallucination. Existing detection methods predominantly rely on coarse, whole-image measures of how an object token relates to the input image. This global strategy is limited: hallucinated tokens may exhibit weak but widely scattered correlations across many local regions, which aggregate into deceptively high overall relevance, thus evading the current global hallucination detectors. We begin with a simple yet critical observation: a faithful object token must be strongly grounded in a specific image region. Building on this insight, we introduce a patch-level hallucination detection framework that examines fine-grained token-level interactions across model layers. Our analysis uncovers two characteristic signatures of hallucinated tokens: (i) they yield diffuse, non-localized attention patterns, in contrast to the compact, well-focused attention seen in faithful tokens; and (ii) they fail to exhibit meaningful semantic alignment with any visual region. Guided by these findings, we develop a lightweight and interpretable detection method that leverages patch-level statistical features, combined with hidden-layer representations. Our approach achieves up to 90% accuracy in token-level hallucination detection, demonstrating the superiority of fine-grained structural analysis for detecting hallucinations.