Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Zhihong, Li, Yang, Huang, Rengming, Lu, Cewu, Cai, Panpan
Format:	Preprint
Published:	2026
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2602.12244
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915795056984064
author	Liu, Zhihong Li, Yang Huang, Rengming Lu, Cewu Cai, Panpan
author_facet	Liu, Zhihong Li, Yang Huang, Rengming Lu, Cewu Cai, Panpan
contents	Open world language conditioned task planning is crucial for robots operating in large-scale household environments. While many recent works attempt to address this problem using Large Language Models (LLMs) via prompting or training, a key challenge remains scalability. Performance often degrades rapidly with increasing environment size, plan length, instruction ambiguity, and constraint complexity. In this work, we propose Any House Any Task (AHAT), a household task planner optimized for long-horizon planning in large environments given ambiguous human instructions. At its core, AHAT utilizes an LLM trained to map task instructions and textual scene graphs into grounded subgoals defined in the Planning Domain Definition Language (PDDL). These subgoals are subsequently solved to generate feasible and optimal long-horizon plans through explicit symbolic reasoning. To enhance the model's ability to decompose complex and ambiguous intentions, we introduce TGPO, a novel reinforcement learning algorithm that integrates external correction of intermediate reasoning traces into Group Relative Policy Optimization (GRPO). Experiments demonstrate that AHAT achieves significant performance gains over state-of-the-art prompting, planning, and learning methods, particularly in human-style household tasks characterized by brief instructions but requiring complex execution plans.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_12244
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks Liu, Zhihong Li, Yang Huang, Rengming Lu, Cewu Cai, Panpan Robotics Open world language conditioned task planning is crucial for robots operating in large-scale household environments. While many recent works attempt to address this problem using Large Language Models (LLMs) via prompting or training, a key challenge remains scalability. Performance often degrades rapidly with increasing environment size, plan length, instruction ambiguity, and constraint complexity. In this work, we propose Any House Any Task (AHAT), a household task planner optimized for long-horizon planning in large environments given ambiguous human instructions. At its core, AHAT utilizes an LLM trained to map task instructions and textual scene graphs into grounded subgoals defined in the Planning Domain Definition Language (PDDL). These subgoals are subsequently solved to generate feasible and optimal long-horizon plans through explicit symbolic reasoning. To enhance the model's ability to decompose complex and ambiguous intentions, we introduce TGPO, a novel reinforcement learning algorithm that integrates external correction of intermediate reasoning traces into Group Relative Policy Optimization (GRPO). Experiments demonstrate that AHAT achieves significant performance gains over state-of-the-art prompting, planning, and learning methods, particularly in human-style household tasks characterized by brief instructions but requiring complex execution plans.
title	Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks
topic	Robotics
url	https://arxiv.org/abs/2602.12244

Similar Items