Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yin, Xu, Yoon, Min-Sung, Huo, Yuchi, Zhang, Kang, Yoon, Sung-Eui
Format:	Preprint
Published:	2025
Subjects:	Robotics Artificial Intelligence
Online Access:	https://arxiv.org/abs/2504.09893
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912325592678400
author	Yin, Xu Yoon, Min-Sung Huo, Yuchi Zhang, Kang Yoon, Sung-Eui
author_facet	Yin, Xu Yoon, Min-Sung Huo, Yuchi Zhang, Kang Yoon, Sung-Eui
contents	Task execution for object rearrangement could be challenged by Task-Level Perturbations (TLP), i.e., unexpected object additions, removals, and displacements that can disrupt underlying visual policies and fundamentally compromise task feasibility and progress. To address these challenges, we present LangPert, a language-based framework designed to detect and mitigate TLP situations in tabletop rearrangement tasks. LangPert integrates a Visual Language Model (VLM) to comprehensively monitor policy's skill execution and environmental TLP, while leveraging the Hierarchical Chain-of-Thought (HCoT) reasoning mechanism to enhance the Large Language Model (LLM)'s contextual understanding and generate adaptive, corrective skill-execution plans. Our experimental results demonstrate that LangPert handles diverse TLP situations more effectively than baseline methods, achieving higher task completion rates, improved execution efficiency, and potential generalization to unseen scenarios.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_09893
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	LangPert: Detecting and Handling Task-level Perturbations for Robust Object Rearrangement Yin, Xu Yoon, Min-Sung Huo, Yuchi Zhang, Kang Yoon, Sung-Eui Robotics Artificial Intelligence Task execution for object rearrangement could be challenged by Task-Level Perturbations (TLP), i.e., unexpected object additions, removals, and displacements that can disrupt underlying visual policies and fundamentally compromise task feasibility and progress. To address these challenges, we present LangPert, a language-based framework designed to detect and mitigate TLP situations in tabletop rearrangement tasks. LangPert integrates a Visual Language Model (VLM) to comprehensively monitor policy's skill execution and environmental TLP, while leveraging the Hierarchical Chain-of-Thought (HCoT) reasoning mechanism to enhance the Large Language Model (LLM)'s contextual understanding and generate adaptive, corrective skill-execution plans. Our experimental results demonstrate that LangPert handles diverse TLP situations more effectively than baseline methods, achieving higher task completion rates, improved execution efficiency, and potential generalization to unseen scenarios.
title	LangPert: Detecting and Handling Task-level Perturbations for Robust Object Rearrangement
topic	Robotics Artificial Intelligence
url	https://arxiv.org/abs/2504.09893

Similar Items