Saved in:
Bibliographic Details
Main Authors: Xiao, Yanan, Tang, Yixiang, Feng, Zechen, Jiang, Lu, Yin, Minghao, Wang, Pengyang
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.09419
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909030177308672
author Xiao, Yanan
Tang, Yixiang
Feng, Zechen
Jiang, Lu
Yin, Minghao
Wang, Pengyang
author_facet Xiao, Yanan
Tang, Yixiang
Feng, Zechen
Jiang, Lu
Yin, Minghao
Wang, Pengyang
contents While experience replay is essential for data efficiency in reinforcement learning (RL), standard methods treat the replay buffer as a passive memory system, prioritizing samples based on numerical prediction errors rather than their semantic significance. This approach stands in contrast to human learning, which accelerates mastery by actively abstracting fragmented experiences into behavioral rules. To bridge this gap, we propose Neuro-Symbolic Experience Replay (NSER), a framework that transforms experience replay from a passive sample reuse mechanism into an active engine for knowledge construction. Specifically, NSER addresses the incompatibility between linguistic reasoning and numerical optimization through a novel neuro-symbolic grounding pipeline. It leverages Large Language Models (LLMs) in a zero-shot manner to induce candidate behavioral rules from accumulated trajectories, grounds these insights into differentiable first-order logic representations, and utilizes the resulting symbolic structures to dynamically reweight the replay distribution. By allowing abstract knowledge to directly shape policy optimization, NSER achieves consistent superior sample efficiency and convergence speed across reactive, rule-based, and procedural benchmarks.
format Preprint
id arxiv_https___arxiv_org_abs_2605_09419
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle From Passive Reuse to Active Reasoning: Grounding Large Language Models for Neuro-Symbolic Experience Replay
Xiao, Yanan
Tang, Yixiang
Feng, Zechen
Jiang, Lu
Yin, Minghao
Wang, Pengyang
Artificial Intelligence
While experience replay is essential for data efficiency in reinforcement learning (RL), standard methods treat the replay buffer as a passive memory system, prioritizing samples based on numerical prediction errors rather than their semantic significance. This approach stands in contrast to human learning, which accelerates mastery by actively abstracting fragmented experiences into behavioral rules. To bridge this gap, we propose Neuro-Symbolic Experience Replay (NSER), a framework that transforms experience replay from a passive sample reuse mechanism into an active engine for knowledge construction. Specifically, NSER addresses the incompatibility between linguistic reasoning and numerical optimization through a novel neuro-symbolic grounding pipeline. It leverages Large Language Models (LLMs) in a zero-shot manner to induce candidate behavioral rules from accumulated trajectories, grounds these insights into differentiable first-order logic representations, and utilizes the resulting symbolic structures to dynamically reweight the replay distribution. By allowing abstract knowledge to directly shape policy optimization, NSER achieves consistent superior sample efficiency and convergence speed across reactive, rule-based, and procedural benchmarks.
title From Passive Reuse to Active Reasoning: Grounding Large Language Models for Neuro-Symbolic Experience Replay
topic Artificial Intelligence
url https://arxiv.org/abs/2605.09419