Saved in:
Bibliographic Details
Main Authors: Kamimura, Tomoya, Washiyama, Haruka, Sano, Akihito
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.14565
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914479009169408
author Kamimura, Tomoya
Washiyama, Haruka
Sano, Akihito
author_facet Kamimura, Tomoya
Washiyama, Haruka
Sano, Akihito
contents Embodiment is a significant keyword in recent machine learning fields. This study focused on the passive nature of the body of a biped robot to generate walking and running locomotion using model-based deep reinforcement learning. We constructed two models in a simulator, one with passive elements (e.g., springs) and the other, which is similar to general humanoids, without passive elements. The training of the model with passive elements was highly affected by the attractor of the system. This lead that although the trajectories quickly converged to limit cycles, it took a long time to obtain large rewards. However, thanks to the attractor-driven learning, the acquired locomotion was robust and energy-efficient. The results revealed that robots with passive elements could efficiently acquire high-performance locomotion by utilizing stable limit cycles generated through dynamic interaction between the body and ground. This study demonstrates the importance of implementing passive properties in the body for future embodied AI.
format Preprint
id arxiv_https___arxiv_org_abs_2604_14565
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Model-Based Reinforcement Learning Exploits Passive Body Dynamics for High-Performance Biped Robot Locomotion
Kamimura, Tomoya
Washiyama, Haruka
Sano, Akihito
Robotics
Systems and Control
Embodiment is a significant keyword in recent machine learning fields. This study focused on the passive nature of the body of a biped robot to generate walking and running locomotion using model-based deep reinforcement learning. We constructed two models in a simulator, one with passive elements (e.g., springs) and the other, which is similar to general humanoids, without passive elements. The training of the model with passive elements was highly affected by the attractor of the system. This lead that although the trajectories quickly converged to limit cycles, it took a long time to obtain large rewards. However, thanks to the attractor-driven learning, the acquired locomotion was robust and energy-efficient. The results revealed that robots with passive elements could efficiently acquire high-performance locomotion by utilizing stable limit cycles generated through dynamic interaction between the body and ground. This study demonstrates the importance of implementing passive properties in the body for future embodied AI.
title Model-Based Reinforcement Learning Exploits Passive Body Dynamics for High-Performance Biped Robot Locomotion
topic Robotics
Systems and Control
url https://arxiv.org/abs/2604.14565