Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Li, Lehui, Wang, Ruining, Song, Haochen, Mao, Yaoxin, Zhang, Tong, Wang, Yuyao, Fan, Jiayi, Zhang, Yitong, Ye, Jieping, Zhang, Chengqi, Gong, Yongshun
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.01801
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918366375051264
author	Li, Lehui Wang, Ruining Song, Haochen Mao, Yaoxin Zhang, Tong Wang, Yuyao Fan, Jiayi Zhang, Yitong Ye, Jieping Zhang, Chengqi Gong, Yongshun
author_facet	Li, Lehui Wang, Ruining Song, Haochen Mao, Yaoxin Zhang, Tong Wang, Yuyao Fan, Jiayi Zhang, Yitong Ye, Jieping Zhang, Chengqi Gong, Yongshun
contents	Automated paper reproduction -- generating executable code from academic papers -- is bottlenecked not by information retrieval but by the tacit knowledge that papers inevitably leave implicit. We formalize this challenge as the progressive recovery of three types of tacit knowledge -- relational, somatic, and collective -- and propose \method, a graph-based agent framework with a dedicated mechanism for each: node-level relation-aware aggregation recovers relational knowledge by analyzing implementation-unit-level reuse and adaptation relationships between the target paper and its citation neighbors; execution-feedback refinement recovers somatic knowledge through iterative debugging driven by runtime signals; and graph-level knowledge induction distills collective knowledge from clusters of papers sharing similar implementations. On an extended ReproduceBench spanning 3 domains, 10 tasks, and 40 recent papers, \method{} achieves an average performance gap of 10.04\% against official implementations, improving over the strongest baseline by 24.68\%. The code will be publicly released upon acceptance; the repository link will be provided in the final version.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_01801
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction Li, Lehui Wang, Ruining Song, Haochen Mao, Yaoxin Zhang, Tong Wang, Yuyao Fan, Jiayi Zhang, Yitong Ye, Jieping Zhang, Chengqi Gong, Yongshun Artificial Intelligence Automated paper reproduction -- generating executable code from academic papers -- is bottlenecked not by information retrieval but by the tacit knowledge that papers inevitably leave implicit. We formalize this challenge as the progressive recovery of three types of tacit knowledge -- relational, somatic, and collective -- and propose \method, a graph-based agent framework with a dedicated mechanism for each: node-level relation-aware aggregation recovers relational knowledge by analyzing implementation-unit-level reuse and adaptation relationships between the target paper and its citation neighbors; execution-feedback refinement recovers somatic knowledge through iterative debugging driven by runtime signals; and graph-level knowledge induction distills collective knowledge from clusters of papers sharing similar implementations. On an extended ReproduceBench spanning 3 domains, 10 tasks, and 40 recent papers, \method{} achieves an average performance gap of 10.04\% against official implementations, improving over the strongest baseline by 24.68\%. The code will be publicly released upon acceptance; the repository link will be provided in the final version.
title	What Papers Don't Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction
topic	Artificial Intelligence
url	https://arxiv.org/abs/2603.01801

Similar Items