Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yang, Tong, Wang, Yemin, Zhang, Chaoning, Wu, Aming
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.02206
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911430317441024
author	Yang, Tong Wang, Yemin Zhang, Chaoning Wu, Aming
author_facet	Yang, Tong Wang, Yemin Zhang, Chaoning Wu, Aming
contents	The effectiveness of LLM-based agents is often limited not by model capacity alone, but by how efficiently contextual information is utilized at runtime. Existing agent frameworks rely on rigid, syntax-heavy state representations such as nested JSON, which require models to devote a substantial portion of their limited attention to syntactic processing rather than semantic reasoning. In this paper, we propose Fat-Cat, a document-driven agent architecture that improves the signal-to-noise ratio of state management. By integrating three key components: (1) a Semantic File System that represents agent state as Markdown documents aligned with common pre-training corpora, (2) a Textual Strategy Evolution module that accumulates task-solving knowledge without parameter updates, and (3) a Closed-Loop Watcher that monitors reasoning trajectories to reduce hallucinations. Extensive reasoning, retrieval, and coding benchmarks, Fat-Cat consistently improves agent performance. It enables the Kimi-k2 model to outperform the proprietary GPT-4o baseline on HotPotQA. Replacing the document-based state with JSON leads to performance drop, while empirically validating the critical necessity of document-driven state modeling over rigid syntax. The code is available at https://github.com/answeryt/Fat-Cat.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_02206
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Fat-Cat: Document-Driven Metacognitive Multi-Agent System for Complex Reasoning Yang, Tong Wang, Yemin Zhang, Chaoning Wu, Aming Machine Learning The effectiveness of LLM-based agents is often limited not by model capacity alone, but by how efficiently contextual information is utilized at runtime. Existing agent frameworks rely on rigid, syntax-heavy state representations such as nested JSON, which require models to devote a substantial portion of their limited attention to syntactic processing rather than semantic reasoning. In this paper, we propose Fat-Cat, a document-driven agent architecture that improves the signal-to-noise ratio of state management. By integrating three key components: (1) a Semantic File System that represents agent state as Markdown documents aligned with common pre-training corpora, (2) a Textual Strategy Evolution module that accumulates task-solving knowledge without parameter updates, and (3) a Closed-Loop Watcher that monitors reasoning trajectories to reduce hallucinations. Extensive reasoning, retrieval, and coding benchmarks, Fat-Cat consistently improves agent performance. It enables the Kimi-k2 model to outperform the proprietary GPT-4o baseline on HotPotQA. Replacing the document-based state with JSON leads to performance drop, while empirically validating the critical necessity of document-driven state modeling over rigid syntax. The code is available at https://github.com/answeryt/Fat-Cat.
title	Fat-Cat: Document-Driven Metacognitive Multi-Agent System for Complex Reasoning
topic	Machine Learning
url	https://arxiv.org/abs/2602.02206

Similar Items