Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chang, Ge, Su, Jinbo, Liu, Jiacheng, Yang, Pengfei, Shang, Yuhao, Zheng, Huiwen, Ma, Hongli, Liang, Yan, Li, Yuanchun, Liu, Yunxin
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2510.03323
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910157185744896
author	Chang, Ge Su, Jinbo Liu, Jiacheng Yang, Pengfei Shang, Yuhao Zheng, Huiwen Ma, Hongli Liang, Yan Li, Yuanchun Liu, Yunxin
author_facet	Chang, Ge Su, Jinbo Liu, Jiacheng Yang, Pengfei Shang, Yuhao Zheng, Huiwen Ma, Hongli Liang, Yan Li, Yuanchun Liu, Yunxin
contents	Integrating textual graphs into Large Language Models (LLMs) is promising for complex graph-based QA. However, a key bottleneck is retrieving informative yet compact subgraphs that fit the LLM context. Existing retrievers often struggle, relying either on shallow embedding similarity or costly interactive policies that require excessive supervision. To address these challenges, we introduce an agentic textual graph reasoning framework featuring an LLM-based retriever trained with synthetic stepwise supervision. Rather than relying on final answer rewards which often yield sparse and unstable signals, we optimize the retriever by evaluating each step against offline-extracted golden subgraphs. Our approach distills golden subgraphs via a specialized data synthesis pipeline to formulate dense rewards, facilitating a two-stage training scheme that effectively learns the interactive graph exploration policy. Based on extensive experiments on three common datasets in comparison with seven strong baselines, our approach achieves an average improvement of 15.6% in accuracy and 17.2% in F1 score. The advantage is even higher in more complicated multi-hop reasoning tasks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_03323
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Enhancing Agentic Textual Graph Retrieval with Synthetic Stepwise Supervision Chang, Ge Su, Jinbo Liu, Jiacheng Yang, Pengfei Shang, Yuhao Zheng, Huiwen Ma, Hongli Liang, Yan Li, Yuanchun Liu, Yunxin Computation and Language Integrating textual graphs into Large Language Models (LLMs) is promising for complex graph-based QA. However, a key bottleneck is retrieving informative yet compact subgraphs that fit the LLM context. Existing retrievers often struggle, relying either on shallow embedding similarity or costly interactive policies that require excessive supervision. To address these challenges, we introduce an agentic textual graph reasoning framework featuring an LLM-based retriever trained with synthetic stepwise supervision. Rather than relying on final answer rewards which often yield sparse and unstable signals, we optimize the retriever by evaluating each step against offline-extracted golden subgraphs. Our approach distills golden subgraphs via a specialized data synthesis pipeline to formulate dense rewards, facilitating a two-stage training scheme that effectively learns the interactive graph exploration policy. Based on extensive experiments on three common datasets in comparison with seven strong baselines, our approach achieves an average improvement of 15.6% in accuracy and 17.2% in F1 score. The advantage is even higher in more complicated multi-hop reasoning tasks.
title	Enhancing Agentic Textual Graph Retrieval with Synthetic Stepwise Supervision
topic	Computation and Language
url	https://arxiv.org/abs/2510.03323

Similar Items