Saved in:
Bibliographic Details
Main Authors: Liu, Wenzhang, Jin, Lianjun, Ren, Lu, Mu, Chaoxu, Sun, Changyin
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2501.14543
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916581841305600
author Liu, Wenzhang
Jin, Lianjun
Ren, Lu
Mu, Chaoxu
Sun, Changyin
author_facet Liu, Wenzhang
Jin, Lianjun
Ren, Lu
Mu, Chaoxu
Sun, Changyin
contents Intelligent decision-making within large and redundant action spaces remains challenging in deep reinforcement learning. Considering similar but ineffective actions at each step can lead to repetitive and unproductive trials. Existing methods attempt to improve agent exploration by reducing or penalizing redundant actions, yet they fail to provide quantitative and reliable evidence to determine redundancy. In this paper, we propose a method to improve exploration efficiency by estimating the causal effects of actions. Unlike prior methods, our approach offers quantitative results regarding the causality of actions for one-step transitions. We first pre-train an inverse dynamics model to serve as prior knowledge of the environment. Subsequently, we classify actions across the entire action space at each time step and estimate the causal effect of each action to suppress redundant actions during exploration. We provide a theoretical analysis to demonstrate the effectiveness of our method and present empirical results from simulations in environments with redundant actions to evaluate its performance. Our implementation is available at https://github.com/agi-brain/cee.git.
format Preprint
id arxiv_https___arxiv_org_abs_2501_14543
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Liu, Wenzhang
Jin, Lianjun
Ren, Lu
Mu, Chaoxu
Sun, Changyin
Machine Learning
Intelligent decision-making within large and redundant action spaces remains challenging in deep reinforcement learning. Considering similar but ineffective actions at each step can lead to repetitive and unproductive trials. Existing methods attempt to improve agent exploration by reducing or penalizing redundant actions, yet they fail to provide quantitative and reliable evidence to determine redundancy. In this paper, we propose a method to improve exploration efficiency by estimating the causal effects of actions. Unlike prior methods, our approach offers quantitative results regarding the causality of actions for one-step transitions. We first pre-train an inverse dynamics model to serve as prior knowledge of the environment. Subsequently, we classify actions across the entire action space at each time step and estimate the causal effect of each action to suppress redundant actions during exploration. We provide a theoretical analysis to demonstrate the effectiveness of our method and present empirical results from simulations in environments with redundant actions to evaluate its performance. Our implementation is available at https://github.com/agi-brain/cee.git.
title Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
topic Machine Learning
url https://arxiv.org/abs/2501.14543