Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yan, Yuxiang, Zhou, Zhiyuan, Gao, Xin, Li, Guanghao, Li, Shenglin, Chen, Jiaqi, Pu, Qunyan, Pu, Jian
Format:	Preprint
Published:	2025
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2510.25138
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914226518360064
author	Yan, Yuxiang Zhou, Zhiyuan Gao, Xin Li, Guanghao Li, Shenglin Chen, Jiaqi Pu, Qunyan Pu, Jian
author_facet	Yan, Yuxiang Zhou, Zhiyuan Gao, Xin Li, Guanghao Li, Shenglin Chen, Jiaqi Pu, Qunyan Pu, Jian
contents	Manipulation in cluttered environments is challenging due to spatial dependencies among objects, where an improper manipulation order can cause collisions or blocked access. Existing approaches often overlook these spatial relationships, limiting their flexibility and scalability. To address these limitations, we propose OrderMind, a unified spatial-aware manipulation ordering framework that directly learns object manipulation priorities based on spatial context. Our architecture integrates a spatial context encoder with a temporal priority structuring module. We construct a spatial graph using k-Nearest Neighbors to aggregate geometric information from the local layout and encode both object-object and object-manipulator interactions to support accurate manipulation ordering in real-time. To generate physically and semantically plausible supervision signals, we introduce a spatial prior labeling method that guides a vision-language model to produce reasonable manipulation orders for distillation. We evaluate OrderMind on our Manipulation Ordering Benchmark, comprising 163,222 samples of varying difficulty. Extensive experiments in both simulation and real-world environments demonstrate that our method significantly outperforms prior approaches in effectiveness and efficiency, enabling robust manipulation in cluttered scenes.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_25138
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Learning Spatial-Aware Manipulation Ordering Yan, Yuxiang Zhou, Zhiyuan Gao, Xin Li, Guanghao Li, Shenglin Chen, Jiaqi Pu, Qunyan Pu, Jian Robotics Manipulation in cluttered environments is challenging due to spatial dependencies among objects, where an improper manipulation order can cause collisions or blocked access. Existing approaches often overlook these spatial relationships, limiting their flexibility and scalability. To address these limitations, we propose OrderMind, a unified spatial-aware manipulation ordering framework that directly learns object manipulation priorities based on spatial context. Our architecture integrates a spatial context encoder with a temporal priority structuring module. We construct a spatial graph using k-Nearest Neighbors to aggregate geometric information from the local layout and encode both object-object and object-manipulator interactions to support accurate manipulation ordering in real-time. To generate physically and semantically plausible supervision signals, we introduce a spatial prior labeling method that guides a vision-language model to produce reasonable manipulation orders for distillation. We evaluate OrderMind on our Manipulation Ordering Benchmark, comprising 163,222 samples of varying difficulty. Extensive experiments in both simulation and real-world environments demonstrate that our method significantly outperforms prior approaches in effectiveness and efficiency, enabling robust manipulation in cluttered scenes.
title	Learning Spatial-Aware Manipulation Ordering
topic	Robotics
url	https://arxiv.org/abs/2510.25138

Similar Items