Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	He, Zewen, Chen, Chenyuan, Azizov, Dilshod, Nakamura, Yoshihiko
Format:	Preprint
Published:	2025
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2509.25443
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911194860748800
author	He, Zewen Chen, Chenyuan Azizov, Dilshod Nakamura, Yoshihiko
author_facet	He, Zewen Chen, Chenyuan Azizov, Dilshod Nakamura, Yoshihiko
contents	Humanoid whole-body locomotion control is a critical approach for humanoid robots to leverage their inherent advantages. Learning-based control methods derived from retargeted human motion data provide an effective means of addressing this issue. However, because most current human datasets lack measured force data, and learning-based robot control is largely position-based, achieving appropriate compliance during interaction with real environments remains challenging. This paper presents Compliant Task Pipeline (CoTaP): a pipeline that leverages compliance information in the learning-based structure of humanoid robots. A two-stage dual-agent reinforcement learning framework combined with model-based compliance control for humanoid robots is proposed. In the training process, first a base policy with a position-based controller is trained; then in the distillation, the upper-body policy is combined with model-based compliance control, and the lower-body agent is guided by the base policy. In the upper-body control, adjustable task-space compliance can be specified and integrated with other controllers through compliance modulation on the symmetric positive definite (SPD) manifold, ensuring system stability. We validated the feasibility of the proposed strategy in simulation, primarily comparing the responses to external disturbances under different compliance settings.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_25443
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	CoTaP: Compliant Task Pipeline and Reinforcement Learning of Its Controller with Compliance Modulation He, Zewen Chen, Chenyuan Azizov, Dilshod Nakamura, Yoshihiko Robotics Humanoid whole-body locomotion control is a critical approach for humanoid robots to leverage their inherent advantages. Learning-based control methods derived from retargeted human motion data provide an effective means of addressing this issue. However, because most current human datasets lack measured force data, and learning-based robot control is largely position-based, achieving appropriate compliance during interaction with real environments remains challenging. This paper presents Compliant Task Pipeline (CoTaP): a pipeline that leverages compliance information in the learning-based structure of humanoid robots. A two-stage dual-agent reinforcement learning framework combined with model-based compliance control for humanoid robots is proposed. In the training process, first a base policy with a position-based controller is trained; then in the distillation, the upper-body policy is combined with model-based compliance control, and the lower-body agent is guided by the base policy. In the upper-body control, adjustable task-space compliance can be specified and integrated with other controllers through compliance modulation on the symmetric positive definite (SPD) manifold, ensuring system stability. We validated the feasibility of the proposed strategy in simulation, primarily comparing the responses to external disturbances under different compliance settings.
title	CoTaP: Compliant Task Pipeline and Reinforcement Learning of Its Controller with Compliance Modulation
topic	Robotics
url	https://arxiv.org/abs/2509.25443

Similar Items