Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Xiao, Yanan, Tang, Yixiang, Feng, Zechen, Jiang, Lu, Yin, Minghao, Wang, Pengyang
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.09419
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909030177308672
author	Xiao, Yanan Tang, Yixiang Feng, Zechen Jiang, Lu Yin, Minghao Wang, Pengyang
author_facet	Xiao, Yanan Tang, Yixiang Feng, Zechen Jiang, Lu Yin, Minghao Wang, Pengyang
contents	While experience replay is essential for data efficiency in reinforcement learning (RL), standard methods treat the replay buffer as a passive memory system, prioritizing samples based on numerical prediction errors rather than their semantic significance. This approach stands in contrast to human learning, which accelerates mastery by actively abstracting fragmented experiences into behavioral rules. To bridge this gap, we propose Neuro-Symbolic Experience Replay (NSER), a framework that transforms experience replay from a passive sample reuse mechanism into an active engine for knowledge construction. Specifically, NSER addresses the incompatibility between linguistic reasoning and numerical optimization through a novel neuro-symbolic grounding pipeline. It leverages Large Language Models (LLMs) in a zero-shot manner to induce candidate behavioral rules from accumulated trajectories, grounds these insights into differentiable first-order logic representations, and utilizes the resulting symbolic structures to dynamically reweight the replay distribution. By allowing abstract knowledge to directly shape policy optimization, NSER achieves consistent superior sample efficiency and convergence speed across reactive, rule-based, and procedural benchmarks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_09419
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	From Passive Reuse to Active Reasoning: Grounding Large Language Models for Neuro-Symbolic Experience Replay Xiao, Yanan Tang, Yixiang Feng, Zechen Jiang, Lu Yin, Minghao Wang, Pengyang Artificial Intelligence While experience replay is essential for data efficiency in reinforcement learning (RL), standard methods treat the replay buffer as a passive memory system, prioritizing samples based on numerical prediction errors rather than their semantic significance. This approach stands in contrast to human learning, which accelerates mastery by actively abstracting fragmented experiences into behavioral rules. To bridge this gap, we propose Neuro-Symbolic Experience Replay (NSER), a framework that transforms experience replay from a passive sample reuse mechanism into an active engine for knowledge construction. Specifically, NSER addresses the incompatibility between linguistic reasoning and numerical optimization through a novel neuro-symbolic grounding pipeline. It leverages Large Language Models (LLMs) in a zero-shot manner to induce candidate behavioral rules from accumulated trajectories, grounds these insights into differentiable first-order logic representations, and utilizes the resulting symbolic structures to dynamically reweight the replay distribution. By allowing abstract knowledge to directly shape policy optimization, NSER achieves consistent superior sample efficiency and convergence speed across reactive, rule-based, and procedural benchmarks.
title	From Passive Reuse to Active Reasoning: Grounding Large Language Models for Neuro-Symbolic Experience Replay
topic	Artificial Intelligence
url	https://arxiv.org/abs/2605.09419

Similar Items