MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Lin, Huifeng, Su, Gang, Liang, Jintao, Wu, You, Zhao, Rui, Li, Ziyue
Natura:	Preprint
Pubblicazione:	2025
Soggetti:	Information Retrieval
Accesso online:	https://arxiv.org/abs/2509.04820
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866911138992619520
author	Lin, Huifeng Su, Gang Liang, Jintao Wu, You Zhao, Rui Li, Ziyue
author_facet	Lin, Huifeng Su, Gang Liang, Jintao Wu, You Zhao, Rui Li, Ziyue
contents	Retrieval-Augmented Generation (RAG) based on Large Language Models (LLMs) is a powerful solution to understand and query the industry's closed-source documents. However, basic RAG often struggles with complex QA tasks in legal and regulatory domains, particularly when dealing with numerous government documents. The top-$k$ strategy frequently misses golden chunks, leading to incomplete or inaccurate answers. To address these retrieval bottlenecks, we explore two strategies to improve evidence coverage and answer quality. The first is a One-SHOT retrieval method that adaptively selects chunks based on a token budget, allowing as much relevant content as possible to be included within the model's context window. Additionally, we design modules to further filter and refine the chunks. The second is an iterative retrieval strategy built on a Reasoning Agentic RAG framework, where a reasoning LLM dynamically issues search queries, evaluates retrieved results, and progressively refines the context over multiple turns. We identify query drift and retrieval laziness issues and further design two modules to tackle them. Through extensive experiments on a dataset of government documents, we aim to offer practical insights and guidance for real-world applications in legal and regulatory domains.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_04820
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Fishing for Answers: Exploring One-shot vs. Iterative Retrieval Strategies for Retrieval Augmented Generation Lin, Huifeng Su, Gang Liang, Jintao Wu, You Zhao, Rui Li, Ziyue Information Retrieval Retrieval-Augmented Generation (RAG) based on Large Language Models (LLMs) is a powerful solution to understand and query the industry's closed-source documents. However, basic RAG often struggles with complex QA tasks in legal and regulatory domains, particularly when dealing with numerous government documents. The top-$k$ strategy frequently misses golden chunks, leading to incomplete or inaccurate answers. To address these retrieval bottlenecks, we explore two strategies to improve evidence coverage and answer quality. The first is a One-SHOT retrieval method that adaptively selects chunks based on a token budget, allowing as much relevant content as possible to be included within the model's context window. Additionally, we design modules to further filter and refine the chunks. The second is an iterative retrieval strategy built on a Reasoning Agentic RAG framework, where a reasoning LLM dynamically issues search queries, evaluates retrieved results, and progressively refines the context over multiple turns. We identify query drift and retrieval laziness issues and further design two modules to tackle them. Through extensive experiments on a dataset of government documents, we aim to offer practical insights and guidance for real-world applications in legal and regulatory domains.
title	Fishing for Answers: Exploring One-shot vs. Iterative Retrieval Strategies for Retrieval Augmented Generation
topic	Information Retrieval
url	https://arxiv.org/abs/2509.04820

Documenti analoghi