Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Whitecross, Kyle, Rahimi, Negin
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence Information Retrieval Machine Learning I.2.7
Online Access:	https://arxiv.org/abs/2604.09494
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918438926024704
author	Whitecross, Kyle Rahimi, Negin
author_facet	Whitecross, Kyle Rahimi, Negin
contents	We propose RecaLLM, a set of reasoning language models post-trained to make effective use of long-context information. In-context retrieval, which identifies relevant evidence from context, and reasoning are deeply intertwined: retrieval supports reasoning, while reasoning often determines what must be retrieved. However, their interaction remains largely underexplored. In preliminary experiments on several open-source LLMs, we observe that in-context retrieval performance substantially degrades even after a short reasoning span, revealing a key bottleneck for test-time scaling that we refer to as lost-in-thought: reasoning steps that improve performance also make subsequent in-context retrieval more challenging. To address this limitation, RecaLLM interleaves reasoning with explicit in-context retrieval, alternating between reasoning and retrieving context information needed to solve intermediate subproblems. We introduce a negligible-overhead constrained decoding mechanism that enables verbatim copying of evidence spans, improving the grounding of subsequent generation. Trained on diverse lexical and semantic retrieval tasks, RecaLLM achieves strong performance on two long-context benchmarks, RULER and HELMET, significantly outperforming baselines. Notably, we observe consistent gains at context windows of up to 128K tokens using training samples of at most 10K tokens, far shorter than those used by existing long-context approaches, highlighting a promising path toward improving long-context performance without expensive long-context training data.
format	Preprint
id	arxiv_https___arxiv_org_abs_2604_09494
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	RecaLLM: Addressing the Lost-in-Thought Phenomenon with Explicit In-Context Retrieval Whitecross, Kyle Rahimi, Negin Computation and Language Artificial Intelligence Information Retrieval Machine Learning I.2.7 We propose RecaLLM, a set of reasoning language models post-trained to make effective use of long-context information. In-context retrieval, which identifies relevant evidence from context, and reasoning are deeply intertwined: retrieval supports reasoning, while reasoning often determines what must be retrieved. However, their interaction remains largely underexplored. In preliminary experiments on several open-source LLMs, we observe that in-context retrieval performance substantially degrades even after a short reasoning span, revealing a key bottleneck for test-time scaling that we refer to as lost-in-thought: reasoning steps that improve performance also make subsequent in-context retrieval more challenging. To address this limitation, RecaLLM interleaves reasoning with explicit in-context retrieval, alternating between reasoning and retrieving context information needed to solve intermediate subproblems. We introduce a negligible-overhead constrained decoding mechanism that enables verbatim copying of evidence spans, improving the grounding of subsequent generation. Trained on diverse lexical and semantic retrieval tasks, RecaLLM achieves strong performance on two long-context benchmarks, RULER and HELMET, significantly outperforming baselines. Notably, we observe consistent gains at context windows of up to 128K tokens using training samples of at most 10K tokens, far shorter than those used by existing long-context approaches, highlighting a promising path toward improving long-context performance without expensive long-context training data.
title	RecaLLM: Addressing the Lost-in-Thought Phenomenon with Explicit In-Context Retrieval
topic	Computation and Language Artificial Intelligence Information Retrieval Machine Learning I.2.7
url	https://arxiv.org/abs/2604.09494

Similar Items