Saved in:
Bibliographic Details
Main Authors: Zhang, Zihao, Wei, Hui, Jiang, Kenan, Pan, Shijia, Kai, Shu, Liu, Fei
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.14656
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917195410309120
author Zhang, Zihao
Wei, Hui
Jiang, Kenan
Pan, Shijia
Kai, Shu
Liu, Fei
author_facet Zhang, Zihao
Wei, Hui
Jiang, Kenan
Pan, Shijia
Kai, Shu
Liu, Fei
contents Planning under resource constraints is central to real-world decision making, yet most large language model (LLM) planners assume uniform action costs. We systematically analyze whether tree-search LLM planners are cost-aware and whether they efficiently generate budget-feasible plans. In contrast to black-box prompting, explicit search trees expose intermediate decisions, node evaluations, and failure modes, which allows for controlled ablations of planner behavior. We study depth-first search, breadth-first search, Monte Carlo Tree Search, and bidirectional search within a unified framework. Our experiments show that existing tree-based LLM planners often struggle to find cost-optimal plans, and that additional search computation does not reliably improve optimality. Among the methods evaluated, bidirectional search achieves the best overall efficiency and success rate. MCTS achieves the highest optimality on short-horizon tasks. Tree-search planners are especially valuable for studying LLM planning because their reasoning steps are explicit, in contrast to plain LLMs that internalize planning dynamics through post-training trajectories. Our findings suggest that improving LLM planning under resource constraints will likely require new search algorithms, rather than solely scaling inference-time compute.
format Preprint
id arxiv_https___arxiv_org_abs_2505_14656
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Cost-Awareness in Tree-Search LLM Planning: A Systematic Study
Zhang, Zihao
Wei, Hui
Jiang, Kenan
Pan, Shijia
Kai, Shu
Liu, Fei
Artificial Intelligence
Planning under resource constraints is central to real-world decision making, yet most large language model (LLM) planners assume uniform action costs. We systematically analyze whether tree-search LLM planners are cost-aware and whether they efficiently generate budget-feasible plans. In contrast to black-box prompting, explicit search trees expose intermediate decisions, node evaluations, and failure modes, which allows for controlled ablations of planner behavior. We study depth-first search, breadth-first search, Monte Carlo Tree Search, and bidirectional search within a unified framework. Our experiments show that existing tree-based LLM planners often struggle to find cost-optimal plans, and that additional search computation does not reliably improve optimality. Among the methods evaluated, bidirectional search achieves the best overall efficiency and success rate. MCTS achieves the highest optimality on short-horizon tasks. Tree-search planners are especially valuable for studying LLM planning because their reasoning steps are explicit, in contrast to plain LLMs that internalize planning dynamics through post-training trajectories. Our findings suggest that improving LLM planning under resource constraints will likely require new search algorithms, rather than solely scaling inference-time compute.
title Cost-Awareness in Tree-Search LLM Planning: A Systematic Study
topic Artificial Intelligence
url https://arxiv.org/abs/2505.14656