Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhu, Chiwei, Xu, Benfeng, Du, Mingxuan, Wang, Shaohan, Wang, Xiaorui, Mao, Zhendong, Zhang, Yongdong
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.01566
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910137533333504
author	Zhu, Chiwei Xu, Benfeng Du, Mingxuan Wang, Shaohan Wang, Xiaorui Mao, Zhendong Zhang, Yongdong
author_facet	Zhu, Chiwei Xu, Benfeng Du, Mingxuan Wang, Shaohan Wang, Xiaorui Mao, Zhendong Zhang, Yongdong
contents	Deep research is emerging as a representative long-horizon task for large language model (LLM) agents. However, long trajectories in deep research often exceed model context limits, compressing token budgets for both evidence collection and report writing, and preventing effective test-time scaling. We introduce FS-Researcher, a file-system-based, dual-agent framework that scales deep research beyond the context window via a persistent workspace. Specifically, a Context Builder agent acts as a librarian which browses the internet, writes structured notes, and archives raw sources into a hierarchical knowledge base that can grow far beyond context length. A Report Writer agent then composes the final report section by section, treating the knowledge base as the source of facts. In this framework, the file system serves as a durable external memory and a shared coordination medium across agents and sessions, enabling iterative refinement beyond the context window. Experiments on two open-ended benchmarks (DeepResearch Bench and DeepConsult) show that FS-Researcher achieves state-of-the-art report quality across different backbone models. Further analyses demonstrate a positive correlation between final report quality and the computation allocated to the Context Builder, validating effective test-time scaling under the file-system paradigm. The code and data are open-sourced at https://github.com/Ignoramus0817/FS-Researcher.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_01566
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents Zhu, Chiwei Xu, Benfeng Du, Mingxuan Wang, Shaohan Wang, Xiaorui Mao, Zhendong Zhang, Yongdong Computation and Language Deep research is emerging as a representative long-horizon task for large language model (LLM) agents. However, long trajectories in deep research often exceed model context limits, compressing token budgets for both evidence collection and report writing, and preventing effective test-time scaling. We introduce FS-Researcher, a file-system-based, dual-agent framework that scales deep research beyond the context window via a persistent workspace. Specifically, a Context Builder agent acts as a librarian which browses the internet, writes structured notes, and archives raw sources into a hierarchical knowledge base that can grow far beyond context length. A Report Writer agent then composes the final report section by section, treating the knowledge base as the source of facts. In this framework, the file system serves as a durable external memory and a shared coordination medium across agents and sessions, enabling iterative refinement beyond the context window. Experiments on two open-ended benchmarks (DeepResearch Bench and DeepConsult) show that FS-Researcher achieves state-of-the-art report quality across different backbone models. Further analyses demonstrate a positive correlation between final report quality and the computation allocated to the Context Builder, validating effective test-time scaling under the file-system paradigm. The code and data are open-sourced at https://github.com/Ignoramus0817/FS-Researcher.
title	FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents
topic	Computation and Language
url	https://arxiv.org/abs/2602.01566

Similar Items