Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Cai, Yuanqing, Huang, Ziyi, Liu, Minhao, Duan, Lixin, Li, Wen, Zhang, Yanru
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2605.20128
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916028756262912
author	Cai, Yuanqing Huang, Ziyi Liu, Minhao Duan, Lixin Li, Wen Zhang, Yanru
author_facet	Cai, Yuanqing Huang, Ziyi Liu, Minhao Duan, Lixin Li, Wen Zhang, Yanru
contents	Large language models (LLMs) are increasingly integrated into high-stakes decision-making. Inspired by the theory of \emph{inattentional blindness} in human cognition, we investigate whether LLMs, trained on human-preferred corpora that embed attentional biases, exhibit a similar limitation: \emph{failing to attend to subtle yet important contextual cues under explicit task instructions}. To evaluate this, we introduce the task of \textbf{explicit-implicit reasoning} and present \textbf{MixRea}, a benchmark of 2,246 multiple-choice questions across 9 reasoning types with varying distributions of explicit and implicit information. Evaluation of 21 advanced LLMs shows that even the best-performing reasoning model (Gemini 2.5 Pro) achieves only 42.8\% consistency, revealing widespread inattentional blindness. To mitigate this, we propose \textbf{Potential Relation Completion Prompting (PRCP)}, a prompting method that improves reasoning by recovering overlooked causal relations. Further analysis shows that this limitation persists across diverse multi-source reasoning tasks, highlighting the need for more cognitively aligned models.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_20128
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	MixRea: Benchmarking Explicit-Implicit Reasoning in Large Language Models Cai, Yuanqing Huang, Ziyi Liu, Minhao Duan, Lixin Li, Wen Zhang, Yanru Computation and Language Large language models (LLMs) are increasingly integrated into high-stakes decision-making. Inspired by the theory of \emph{inattentional blindness} in human cognition, we investigate whether LLMs, trained on human-preferred corpora that embed attentional biases, exhibit a similar limitation: \emph{failing to attend to subtle yet important contextual cues under explicit task instructions}. To evaluate this, we introduce the task of \textbf{explicit-implicit reasoning} and present \textbf{MixRea}, a benchmark of 2,246 multiple-choice questions across 9 reasoning types with varying distributions of explicit and implicit information. Evaluation of 21 advanced LLMs shows that even the best-performing reasoning model (Gemini 2.5 Pro) achieves only 42.8\% consistency, revealing widespread inattentional blindness. To mitigate this, we propose \textbf{Potential Relation Completion Prompting (PRCP)}, a prompting method that improves reasoning by recovering overlooked causal relations. Further analysis shows that this limitation persists across diverse multi-source reasoning tasks, highlighting the need for more cognitively aligned models.
title	MixRea: Benchmarking Explicit-Implicit Reasoning in Large Language Models
topic	Computation and Language
url	https://arxiv.org/abs/2605.20128

Similar Items