Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhu, Haojia, Xu, Qinyuan, Li, Haoyu, Liu, Yuxi, Qiu, Hanchen, Chen, Jiaoyan, Jin, Jiahui
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.01355
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917243564064768
author	Zhu, Haojia Xu, Qinyuan Li, Haoyu Liu, Yuxi Qiu, Hanchen Chen, Jiaoyan Jin, Jiahui
author_facet	Zhu, Haojia Xu, Qinyuan Li, Haoyu Liu, Yuxi Qiu, Hanchen Chen, Jiaoyan Jin, Jiahui
contents	Aggregation query over free text is a long-standing yet underexplored problem. Unlike ordinary question answering, aggregate queries require exhaustive evidence collection and systems are required to "find all," not merely "find one." Existing paradigms such as Text-to-SQL and Retrieval-Augmented Generation fail to achieve this completeness. In this work, we formalize entity-level aggregation querying over text in a corpus-bounded setting with strict completeness requirement. To enable principled evaluation, we introduce AGGBench, a benchmark designed to evaluate completeness-oriented aggregation under realistic large-scale corpus. To accompany the benchmark, we propose DFA (Disambiguation--Filtering--Aggregation), a modular agentic baseline that decomposes aggregation querying into interpretable stages and exposes key failure modes related to ambiguity, filtering, and aggregation. Empirical results show that DFA consistently improves aggregation evidence coverage over strong RAG and agentic baselines. The data and code are available in \href{https://anonymous.4open.science/r/DFA-A4C1}.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_01355
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Aggregation Queries over Unstructured Text: Benchmark and Agentic Method Zhu, Haojia Xu, Qinyuan Li, Haoyu Liu, Yuxi Qiu, Hanchen Chen, Jiaoyan Jin, Jiahui Artificial Intelligence Aggregation query over free text is a long-standing yet underexplored problem. Unlike ordinary question answering, aggregate queries require exhaustive evidence collection and systems are required to "find all," not merely "find one." Existing paradigms such as Text-to-SQL and Retrieval-Augmented Generation fail to achieve this completeness. In this work, we formalize entity-level aggregation querying over text in a corpus-bounded setting with strict completeness requirement. To enable principled evaluation, we introduce AGGBench, a benchmark designed to evaluate completeness-oriented aggregation under realistic large-scale corpus. To accompany the benchmark, we propose DFA (Disambiguation--Filtering--Aggregation), a modular agentic baseline that decomposes aggregation querying into interpretable stages and exposes key failure modes related to ambiguity, filtering, and aggregation. Empirical results show that DFA consistently improves aggregation evidence coverage over strong RAG and agentic baselines. The data and code are available in \href{https://anonymous.4open.science/r/DFA-A4C1}.
title	Aggregation Queries over Unstructured Text: Benchmark and Agentic Method
topic	Artificial Intelligence
url	https://arxiv.org/abs/2602.01355

Similar Items