Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Du, Mingxuan, Xu, Benfeng, Zhu, Chiwei, Wang, Shaohan, Wang, Pengyu, Wang, Xiaorui, Mao, Zhendong
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.03442
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915770585317376
author	Du, Mingxuan Xu, Benfeng Zhu, Chiwei Wang, Shaohan Wang, Pengyu Wang, Xiaorui Mao, Zhendong
author_facet	Du, Mingxuan Xu, Benfeng Zhu, Chiwei Wang, Shaohan Wang, Pengyu Wang, Xiaorui Mao, Zhendong
contents	Frontier language models have demonstrated strong reasoning and long-horizon tool-use capabilities. However, existing RAG systems fail to leverage these capabilities. They still rely on two paradigms: (1) designing an algorithm that retrieves passages in a single shot and concatenates them into the model's input, or (2) predefining a workflow and prompting the model to execute it step-by-step. Neither paradigm allows the model to participate in retrieval decisions, preventing efficient scaling with model improvements. In this paper, we introduce A-RAG, an Agentic RAG framework that exposes hierarchical retrieval interfaces directly to the model. A-RAG provides three retrieval tools: keyword search, semantic search, and chunk read, enabling the agent to adaptively search and retrieve information across multiple granularities. Experiments on multiple open-domain QA benchmarks show that A-RAG consistently outperforms existing approaches with comparable or lower retrieved tokens, demonstrating that A-RAG effectively leverages model capabilities and dynamically adapts to different RAG tasks. We further systematically study how A-RAG scales with model size and test-time compute. We will release our code and evaluation suite to facilitate future research. Code and evaluation suite are available at https://github.com/Ayanami0730/arag.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_03442
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces Du, Mingxuan Xu, Benfeng Zhu, Chiwei Wang, Shaohan Wang, Pengyu Wang, Xiaorui Mao, Zhendong Computation and Language Frontier language models have demonstrated strong reasoning and long-horizon tool-use capabilities. However, existing RAG systems fail to leverage these capabilities. They still rely on two paradigms: (1) designing an algorithm that retrieves passages in a single shot and concatenates them into the model's input, or (2) predefining a workflow and prompting the model to execute it step-by-step. Neither paradigm allows the model to participate in retrieval decisions, preventing efficient scaling with model improvements. In this paper, we introduce A-RAG, an Agentic RAG framework that exposes hierarchical retrieval interfaces directly to the model. A-RAG provides three retrieval tools: keyword search, semantic search, and chunk read, enabling the agent to adaptively search and retrieve information across multiple granularities. Experiments on multiple open-domain QA benchmarks show that A-RAG consistently outperforms existing approaches with comparable or lower retrieved tokens, demonstrating that A-RAG effectively leverages model capabilities and dynamically adapts to different RAG tasks. We further systematically study how A-RAG scales with model size and test-time compute. We will release our code and evaluation suite to facilitate future research. Code and evaluation suite are available at https://github.com/Ayanami0730/arag.
title	A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces
topic	Computation and Language
url	https://arxiv.org/abs/2602.03442

Similar Items