Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Wenzhang, Jin, Lianjun, Ren, Lu, Mu, Chaoxu, Sun, Changyin
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2501.14543
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916581841305600
author	Liu, Wenzhang Jin, Lianjun Ren, Lu Mu, Chaoxu Sun, Changyin
author_facet	Liu, Wenzhang Jin, Lianjun Ren, Lu Mu, Chaoxu Sun, Changyin
contents	Intelligent decision-making within large and redundant action spaces remains challenging in deep reinforcement learning. Considering similar but ineffective actions at each step can lead to repetitive and unproductive trials. Existing methods attempt to improve agent exploration by reducing or penalizing redundant actions, yet they fail to provide quantitative and reliable evidence to determine redundancy. In this paper, we propose a method to improve exploration efficiency by estimating the causal effects of actions. Unlike prior methods, our approach offers quantitative results regarding the causality of actions for one-step transitions. We first pre-train an inverse dynamics model to serve as prior knowledge of the environment. Subsequently, we classify actions across the entire action space at each time step and estimate the causal effect of each action to suppress redundant actions during exploration. We provide a theoretical analysis to demonstrate the effectiveness of our method and present empirical results from simulations in environments with redundant actions to evaluate its performance. Our implementation is available at https://github.com/agi-brain/cee.git.
format	Preprint
id	arxiv_https___arxiv_org_abs_2501_14543
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation Liu, Wenzhang Jin, Lianjun Ren, Lu Mu, Chaoxu Sun, Changyin Machine Learning Intelligent decision-making within large and redundant action spaces remains challenging in deep reinforcement learning. Considering similar but ineffective actions at each step can lead to repetitive and unproductive trials. Existing methods attempt to improve agent exploration by reducing or penalizing redundant actions, yet they fail to provide quantitative and reliable evidence to determine redundancy. In this paper, we propose a method to improve exploration efficiency by estimating the causal effects of actions. Unlike prior methods, our approach offers quantitative results regarding the causality of actions for one-step transitions. We first pre-train an inverse dynamics model to serve as prior knowledge of the environment. Subsequently, we classify actions across the entire action space at each time step and estimate the causal effect of each action to suppress redundant actions during exploration. We provide a theoretical analysis to demonstrate the effectiveness of our method and present empirical results from simulations in environments with redundant actions to evaluate its performance. Our implementation is available at https://github.com/agi-brain/cee.git.
title	Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
topic	Machine Learning
url	https://arxiv.org/abs/2501.14543

Similar Items