Saved in:
Bibliographic Details
Main Authors: Zheng, Yixin, Lyu, Jiangran, Zhang, Yifan, Chen, Jiayi, Yan, Mi, Deng, Yuntian, Shi, Xuesong, Zhao, Xiaoguang, Wang, Yizhou, Zhang, Zhizheng, Wang, He
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.09882
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911722368925696
author Zheng, Yixin
Lyu, Jiangran
Zhang, Yifan
Chen, Jiayi
Yan, Mi
Deng, Yuntian
Shi, Xuesong
Zhao, Xiaoguang
Wang, Yizhou
Zhang, Zhizheng
Wang, He
author_facet Zheng, Yixin
Lyu, Jiangran
Zhang, Yifan
Chen, Jiayi
Yan, Mi
Deng, Yuntian
Shi, Xuesong
Zhao, Xiaoguang
Wang, Yizhou
Zhang, Zhizheng
Wang, He
contents Extrinsic dexterity leverages environmental contact to overcome the limitations of prehensile manipulation. However, achieving such dexterity in cluttered scenes remains challenging and underexplored, as it requires selectively exploiting contact among multiple interacting objects with inherently coupled dynamics. Existing approaches lack explicit modeling of such complex dynamics and therefore fall short in non-prehensile manipulation in cluttered environments, which in turn limits their practical applicability in real-world environments. In this paper, we introduce a Dynamics-Aware Policy Learning (DAPL) framework that can facilitate policy learning with a learned representation of contact-induced object dynamics in cluttered environments. This representation is learned through explicit world modeling and used to condition reinforcement learning, enabling extrinsic dexterity to emerge without hand-crafted contact heuristics or complex reward shaping. We evaluate our approach in both simulation and the real world. Our method outperforms prehensile manipulation, human teleoperation, and prior representation-based policies by over 25% in success rate on unseen simulated cluttered scenes with varying densities. The real-world success rate reaches around 50% across 10 cluttered scenes, while a practical grocery deployment further demonstrates robust sim-to-real transfer and applicability.
format Preprint
id arxiv_https___arxiv_org_abs_2603_09882
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning
Zheng, Yixin
Lyu, Jiangran
Zhang, Yifan
Chen, Jiayi
Yan, Mi
Deng, Yuntian
Shi, Xuesong
Zhao, Xiaoguang
Wang, Yizhou
Zhang, Zhizheng
Wang, He
Robotics
Artificial Intelligence
Extrinsic dexterity leverages environmental contact to overcome the limitations of prehensile manipulation. However, achieving such dexterity in cluttered scenes remains challenging and underexplored, as it requires selectively exploiting contact among multiple interacting objects with inherently coupled dynamics. Existing approaches lack explicit modeling of such complex dynamics and therefore fall short in non-prehensile manipulation in cluttered environments, which in turn limits their practical applicability in real-world environments. In this paper, we introduce a Dynamics-Aware Policy Learning (DAPL) framework that can facilitate policy learning with a learned representation of contact-induced object dynamics in cluttered environments. This representation is learned through explicit world modeling and used to condition reinforcement learning, enabling extrinsic dexterity to emerge without hand-crafted contact heuristics or complex reward shaping. We evaluate our approach in both simulation and the real world. Our method outperforms prehensile manipulation, human teleoperation, and prior representation-based policies by over 25% in success rate on unseen simulated cluttered scenes with varying densities. The real-world success rate reaches around 50% across 10 cluttered scenes, while a practical grocery deployment further demonstrates robust sim-to-real transfer and applicability.
title Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning
topic Robotics
Artificial Intelligence
url https://arxiv.org/abs/2603.09882