Saved in:
Bibliographic Details
Main Authors: Deng, Jiechao, Tan, Ning
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.03145
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909525416607744
author Deng, Jiechao
Tan, Ning
author_facet Deng, Jiechao
Tan, Ning
contents Deep reinforcement learning has made significant strides in various robotic tasks. However, employing deep reinforcement learning methods to tackle multi-stage tasks still a challenge. Reinforcement learning algorithms often encounter issues such as redundant exploration, getting stuck in dead ends, and progress reversal in multi-stage tasks. To address this, we propose a method that integrates causal relationships with reinforcement learning for multi-stage tasks. Our approach enables robots to automatically discover the causal relationships between their actions and the rewards of the tasks and constructs the action space using only causal actions, thereby reducing redundant exploration and progress reversal. By integrating correct causal relationships using the causal policy gradient method into the learning process, our approach can enhance the performance of reinforcement learning algorithms in multi-stage robotic tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2503_03145
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Causality-Based Reinforcement Learning Method for Multi-Stage Robotic Tasks
Deng, Jiechao
Tan, Ning
Robotics
Deep reinforcement learning has made significant strides in various robotic tasks. However, employing deep reinforcement learning methods to tackle multi-stage tasks still a challenge. Reinforcement learning algorithms often encounter issues such as redundant exploration, getting stuck in dead ends, and progress reversal in multi-stage tasks. To address this, we propose a method that integrates causal relationships with reinforcement learning for multi-stage tasks. Our approach enables robots to automatically discover the causal relationships between their actions and the rewards of the tasks and constructs the action space using only causal actions, thereby reducing redundant exploration and progress reversal. By integrating correct causal relationships using the causal policy gradient method into the learning process, our approach can enhance the performance of reinforcement learning algorithms in multi-stage robotic tasks.
title Causality-Based Reinforcement Learning Method for Multi-Stage Robotic Tasks
topic Robotics
url https://arxiv.org/abs/2503.03145