Saved in:
Bibliographic Details
Main Authors: Liu, Zhihong, Li, Yang, Huang, Rengming, Lu, Cewu, Cai, Panpan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.12244
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915795056984064
author Liu, Zhihong
Li, Yang
Huang, Rengming
Lu, Cewu
Cai, Panpan
author_facet Liu, Zhihong
Li, Yang
Huang, Rengming
Lu, Cewu
Cai, Panpan
contents Open world language conditioned task planning is crucial for robots operating in large-scale household environments. While many recent works attempt to address this problem using Large Language Models (LLMs) via prompting or training, a key challenge remains scalability. Performance often degrades rapidly with increasing environment size, plan length, instruction ambiguity, and constraint complexity. In this work, we propose Any House Any Task (AHAT), a household task planner optimized for long-horizon planning in large environments given ambiguous human instructions. At its core, AHAT utilizes an LLM trained to map task instructions and textual scene graphs into grounded subgoals defined in the Planning Domain Definition Language (PDDL). These subgoals are subsequently solved to generate feasible and optimal long-horizon plans through explicit symbolic reasoning. To enhance the model's ability to decompose complex and ambiguous intentions, we introduce TGPO, a novel reinforcement learning algorithm that integrates external correction of intermediate reasoning traces into Group Relative Policy Optimization (GRPO). Experiments demonstrate that AHAT achieves significant performance gains over state-of-the-art prompting, planning, and learning methods, particularly in human-style household tasks characterized by brief instructions but requiring complex execution plans.
format Preprint
id arxiv_https___arxiv_org_abs_2602_12244
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks
Liu, Zhihong
Li, Yang
Huang, Rengming
Lu, Cewu
Cai, Panpan
Robotics
Open world language conditioned task planning is crucial for robots operating in large-scale household environments. While many recent works attempt to address this problem using Large Language Models (LLMs) via prompting or training, a key challenge remains scalability. Performance often degrades rapidly with increasing environment size, plan length, instruction ambiguity, and constraint complexity. In this work, we propose Any House Any Task (AHAT), a household task planner optimized for long-horizon planning in large environments given ambiguous human instructions. At its core, AHAT utilizes an LLM trained to map task instructions and textual scene graphs into grounded subgoals defined in the Planning Domain Definition Language (PDDL). These subgoals are subsequently solved to generate feasible and optimal long-horizon plans through explicit symbolic reasoning. To enhance the model's ability to decompose complex and ambiguous intentions, we introduce TGPO, a novel reinforcement learning algorithm that integrates external correction of intermediate reasoning traces into Group Relative Policy Optimization (GRPO). Experiments demonstrate that AHAT achieves significant performance gains over state-of-the-art prompting, planning, and learning methods, particularly in human-style household tasks characterized by brief instructions but requiring complex execution plans.
title Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks
topic Robotics
url https://arxiv.org/abs/2602.12244