Saved in:
Bibliographic Details
Main Authors: Kang, Zilin, Hu, Chenyuan, Luo, Yu, Yuan, Zhecheng, Zheng, Ruijie, Xu, Huazhe
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.02712
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909674925719552
author Kang, Zilin
Hu, Chenyuan
Luo, Yu
Yuan, Zhecheng
Zheng, Ruijie
Xu, Huazhe
author_facet Kang, Zilin
Hu, Chenyuan
Luo, Yu
Yuan, Zhecheng
Zheng, Ruijie
Xu, Huazhe
contents Deep reinforcement learning for continuous control has recently achieved impressive progress. However, existing methods often suffer from primacy bias, a tendency to overfit early experiences stored in the replay buffer, which limits an RL agent's sample efficiency and generalizability. In contrast, humans are less susceptible to such bias, partly due to infantile amnesia, where the formation of new neurons disrupts early memory traces, leading to the forgetting of initial experiences. Inspired by this dual processes of forgetting and growing in neuroscience, in this paper, we propose Forget and Grow (FoG), a new deep RL algorithm with two mechanisms introduced. First, Experience Replay Decay (ER Decay) "forgetting early experience", which balances memory by gradually reducing the influence of early experiences. Second, Network Expansion, "growing neural capacity", which enhances agents' capability to exploit the patterns of existing data by dynamically adding new parameters during training. Empirical results on four major continuous control benchmarks with more than 40 tasks demonstrate the superior performance of FoG against SoTA existing deep RL algorithms, including BRO, SimBa, and TD-MPC2.
format Preprint
id arxiv_https___arxiv_org_abs_2507_02712
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control
Kang, Zilin
Hu, Chenyuan
Luo, Yu
Yuan, Zhecheng
Zheng, Ruijie
Xu, Huazhe
Machine Learning
Deep reinforcement learning for continuous control has recently achieved impressive progress. However, existing methods often suffer from primacy bias, a tendency to overfit early experiences stored in the replay buffer, which limits an RL agent's sample efficiency and generalizability. In contrast, humans are less susceptible to such bias, partly due to infantile amnesia, where the formation of new neurons disrupts early memory traces, leading to the forgetting of initial experiences. Inspired by this dual processes of forgetting and growing in neuroscience, in this paper, we propose Forget and Grow (FoG), a new deep RL algorithm with two mechanisms introduced. First, Experience Replay Decay (ER Decay) "forgetting early experience", which balances memory by gradually reducing the influence of early experiences. Second, Network Expansion, "growing neural capacity", which enhances agents' capability to exploit the patterns of existing data by dynamically adding new parameters during training. Empirical results on four major continuous control benchmarks with more than 40 tasks demonstrate the superior performance of FoG against SoTA existing deep RL algorithms, including BRO, SimBa, and TD-MPC2.
title A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control
topic Machine Learning
url https://arxiv.org/abs/2507.02712