Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ali, Mohammed, Abdallah, Abdelrahman, Agarwal, Amit, Patel, Hitesh Laxmichand, Jatowt, Adam
Format:	Preprint
Published:	2026
Subjects:	Information Retrieval
Online Access:	https://arxiv.org/abs/2601.05461
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911362780758016
author	Ali, Mohammed Abdallah, Abdelrahman Agarwal, Amit Patel, Hitesh Laxmichand Jatowt, Adam
author_facet	Ali, Mohammed Abdallah, Abdelrahman Agarwal, Amit Patel, Hitesh Laxmichand Jatowt, Adam
contents	Existing benchmarks treat multi-turn conversation and reasoning-intensive retrieval separately, yet real-world information seeking requires both. To bridge this gap, we present a benchmark for reasoning-based conversational information retrieval comprising 707 conversations (2,971 turns) across eleven domains. To ensure quality, our Decomposition-and-Verification framework transforms complex queries into fact-grounded multi-turn dialogues through multi-level validation, where atomic facts are verified against sources and explicit retrieval reasoning is generated for each turn. Comprehensive evaluation reveals that combining conversation history with reasoning doubles retrieval performance (Baseline .236 $\rightarrow$ History+Reasoning .479 nDCG@10), while reasoning-specialized models substantially outperform dense encoders. Despite these gains, further analysis highlights that implicit reasoning remains challenging, particularly when logical connections are not explicitly stated in the text.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_05461
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	RECOR: Reasoning-focused Multi-turn Conversational Retrieval Benchmark Ali, Mohammed Abdallah, Abdelrahman Agarwal, Amit Patel, Hitesh Laxmichand Jatowt, Adam Information Retrieval Existing benchmarks treat multi-turn conversation and reasoning-intensive retrieval separately, yet real-world information seeking requires both. To bridge this gap, we present a benchmark for reasoning-based conversational information retrieval comprising 707 conversations (2,971 turns) across eleven domains. To ensure quality, our Decomposition-and-Verification framework transforms complex queries into fact-grounded multi-turn dialogues through multi-level validation, where atomic facts are verified against sources and explicit retrieval reasoning is generated for each turn. Comprehensive evaluation reveals that combining conversation history with reasoning doubles retrieval performance (Baseline .236 $\rightarrow$ History+Reasoning .479 nDCG@10), while reasoning-specialized models substantially outperform dense encoders. Despite these gains, further analysis highlights that implicit reasoning remains challenging, particularly when logical connections are not explicitly stated in the text.
title	RECOR: Reasoning-focused Multi-turn Conversational Retrieval Benchmark
topic	Information Retrieval
url	https://arxiv.org/abs/2601.05461

Similar Items