Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yan, Muyang, Mengdibayev, Miras, Floros, Ardon, Guo, Weihang, Kavraki, Lydia E., Kingston, Zachary
Format:	Preprint
Published:	2025
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2510.25548
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908884885569536
author	Yan, Muyang Mengdibayev, Miras Floros, Ardon Guo, Weihang Kavraki, Lydia E. Kingston, Zachary
author_facet	Yan, Muyang Mengdibayev, Miras Floros, Ardon Guo, Weihang Kavraki, Lydia E. Kingston, Zachary
contents	In task and motion planning, high-level task planning is done over an abstraction of the world to enable efficient search in long-horizon robotics problems. However, the feasibility of these task-level plans relies on the downward refinability of the abstraction into continuous motion. When a domain's refinability is poor, task-level plans that appear valid may ultimately fail during motion planning, requiring replanning and resulting in slower overall performance. Prior works mitigate this by encoding refinement issues as constraints to prune infeasible task plans. However, these approaches only add constraints upon refinement failure, expending significant search effort on infeasible branches. We propose VIZ-COAST, a method of leveraging the common-sense spatial reasoning of large pretrained Vision-Language Models to identify issues with downward refinement a priori, bypassing the need to fix these failures during planning. Experiments on three challenging TAMP domains show that our approach is able to extract plausible constraints from images and domain descriptions, drastically reducing planning times and, in some cases, eliminating downward refinement failures altogether, generalizing to a diverse range of instances from the broader domain.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_25548
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Using VLM Reasoning to Constrain Task and Motion Planning Yan, Muyang Mengdibayev, Miras Floros, Ardon Guo, Weihang Kavraki, Lydia E. Kingston, Zachary Robotics In task and motion planning, high-level task planning is done over an abstraction of the world to enable efficient search in long-horizon robotics problems. However, the feasibility of these task-level plans relies on the downward refinability of the abstraction into continuous motion. When a domain's refinability is poor, task-level plans that appear valid may ultimately fail during motion planning, requiring replanning and resulting in slower overall performance. Prior works mitigate this by encoding refinement issues as constraints to prune infeasible task plans. However, these approaches only add constraints upon refinement failure, expending significant search effort on infeasible branches. We propose VIZ-COAST, a method of leveraging the common-sense spatial reasoning of large pretrained Vision-Language Models to identify issues with downward refinement a priori, bypassing the need to fix these failures during planning. Experiments on three challenging TAMP domains show that our approach is able to extract plausible constraints from images and domain descriptions, drastically reducing planning times and, in some cases, eliminating downward refinement failures altogether, generalizing to a diverse range of instances from the broader domain.
title	Using VLM Reasoning to Constrain Task and Motion Planning
topic	Robotics
url	https://arxiv.org/abs/2510.25548

Similar Items