Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Deng, Kaize, Yang, Zewen
Format:	Preprint
Publié:	2026
Sujets:	Robotics Artificial Intelligence
Accès en ligne:	https://arxiv.org/abs/2605.15480
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866914568757837824
author	Deng, Kaize Yang, Zewen
author_facet	Deng, Kaize Yang, Zewen
contents	Stochastic communication delays in teleoperation introduce signal discontinuities that undermine control stability and degrade control performance. Consequently, the conventional reinforcement learning (RL) methods struggle with the delayed observations due to the delay-induced observations, leading to high-frequency chattering. To address this, we propose a hybrid control framework, delay-resilient RL, integrating a state estimator utilizing Long Short-Term Memory (LSTM) with a residual RL policy, which is resilient to stochastic delays. The LSTM reconstructs smooth, continuous state estimates from delayed observations, enabling the RL agent to learn a residual torque compensation policy that balances tracking accuracy with velocity smoothness. Experimental validation on Franka Panda robots demonstrates that our approach significantly outperforms the state-of-the-art baselines, ensuring robust and stable teleoperation even under high-variance stochastic delays.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_15480
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Residual Reinforcement Learning for Robot Teleoperation under Stochastic Delays Deng, Kaize Yang, Zewen Robotics Artificial Intelligence Stochastic communication delays in teleoperation introduce signal discontinuities that undermine control stability and degrade control performance. Consequently, the conventional reinforcement learning (RL) methods struggle with the delayed observations due to the delay-induced observations, leading to high-frequency chattering. To address this, we propose a hybrid control framework, delay-resilient RL, integrating a state estimator utilizing Long Short-Term Memory (LSTM) with a residual RL policy, which is resilient to stochastic delays. The LSTM reconstructs smooth, continuous state estimates from delayed observations, enabling the RL agent to learn a residual torque compensation policy that balances tracking accuracy with velocity smoothness. Experimental validation on Franka Panda robots demonstrates that our approach significantly outperforms the state-of-the-art baselines, ensuring robust and stable teleoperation even under high-variance stochastic delays.
title	Residual Reinforcement Learning for Robot Teleoperation under Stochastic Delays
topic	Robotics Artificial Intelligence
url	https://arxiv.org/abs/2605.15480

Documents similaires