Saved in:
Bibliographic Details
Main Authors: Zhang, Chunhui, Wang, Yuxuan, Qin, Aoyang, Lu, Yi-Long, Wu, Kunlun, Wang, Yizhou, Wang, Wei
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.27699
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Current embodied agents are often limited to passive instruction-following or reactive need-satisfaction, lacking a stable, high-order value framework essential for long-term, self-directed behavior and resolving motivational conflicts. We introduce \textit{ValuePlanner}, a hierarchical cognitive architecture that decouples high-level value scheduling from low-level action execution. \textit{ValuePlanner} employs an LLM-based cognitive module to generate symbolic subgoals by reasoning through abstract value trade-offs, which are then translated into executable action plans by a classical PDDL planner. This process is refined via a closed-loop feedback mechanism. Evaluating such autonomy requires methods beyond task-success rates, and we therefore propose a value-centric evaluation suite measuring cumulative value gain, preference alignment, and behavioral diversity. Experiments in the TongSim household environment demonstrate that \textit{ValuePlanner} arbitrates competing values to generate coherent, long-horizon, self-directed behavior absent from instruction-following and needs-driven baselines. Our work offers a structured approach to bridging intrinsic values and grounded behavior for autonomous agents.