Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yuan, Mingqi, Yu, Tao, Song, Haolin, Li, Bo, Jin, Xin, Chen, Hua, Zeng, Wenjun
Format:	Preprint
Published:	2025
Subjects:	Robotics Machine Learning
Online Access:	https://arxiv.org/abs/2512.13093
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915851523850240
author	Yuan, Mingqi Yu, Tao Song, Haolin Li, Bo Jin, Xin Chen, Hua Zeng, Wenjun
author_facet	Yuan, Mingqi Yu, Tao Song, Haolin Li, Bo Jin, Xin Chen, Hua Zeng, Wenjun
contents	Achieving efficient and robust whole-body control (WBC) is essential for enabling humanoid robots to perform complex tasks in dynamic environments. Despite the success of reinforcement learning (RL) in this domain, its sample inefficiency remains a significant challenge due to the intricate dynamics and partial observability of humanoid robots. To address this limitation, we propose PvP, a Proprioceptive-Privileged contrastive learning framework that leverages the intrinsic complementarity between proprioceptive and privileged states. PvP learns compact and task-relevant latent representations without requiring hand-crafted data augmentations, enabling faster and more stable policy learning. To support systematic evaluation, we develop SRL4Humanoid, the first unified and modular framework that provides high-quality implementations of representative state representation learning (SRL) methods for humanoid robot learning. Extensive experiments on the LimX Oli robot across velocity tracking and motion imitation tasks demonstrate that PvP significantly improves sample efficiency and final performance compared to baseline SRL methods. Our study further provides practical insights into integrating SRL with RL for humanoid WBC, offering valuable guidance for data-efficient humanoid robot learning.
format	Preprint
id	arxiv_https___arxiv_org_abs_2512_13093
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations Yuan, Mingqi Yu, Tao Song, Haolin Li, Bo Jin, Xin Chen, Hua Zeng, Wenjun Robotics Machine Learning Achieving efficient and robust whole-body control (WBC) is essential for enabling humanoid robots to perform complex tasks in dynamic environments. Despite the success of reinforcement learning (RL) in this domain, its sample inefficiency remains a significant challenge due to the intricate dynamics and partial observability of humanoid robots. To address this limitation, we propose PvP, a Proprioceptive-Privileged contrastive learning framework that leverages the intrinsic complementarity between proprioceptive and privileged states. PvP learns compact and task-relevant latent representations without requiring hand-crafted data augmentations, enabling faster and more stable policy learning. To support systematic evaluation, we develop SRL4Humanoid, the first unified and modular framework that provides high-quality implementations of representative state representation learning (SRL) methods for humanoid robot learning. Extensive experiments on the LimX Oli robot across velocity tracking and motion imitation tasks demonstrate that PvP significantly improves sample efficiency and final performance compared to baseline SRL methods. Our study further provides practical insights into integrating SRL with RL for humanoid WBC, offering valuable guidance for data-efficient humanoid robot learning.
title	PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations
topic	Robotics Machine Learning
url	https://arxiv.org/abs/2512.13093

Similar Items