Saved in:
Bibliographic Details
Main Authors: Ha, Minh Hieu, Ta, Khanh Ly, Phan, Hung, Doan, Tung, Dao, Tung, Tran, Dao, Binh, Huynh Thi Thanh
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.01910
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917216502415360
author Ha, Minh Hieu
Ta, Khanh Ly
Phan, Hung
Doan, Tung
Dao, Tung
Tran, Dao
Binh, Huynh Thi Thanh
author_facet Ha, Minh Hieu
Ta, Khanh Ly
Phan, Hung
Doan, Tung
Dao, Tung
Tran, Dao
Binh, Huynh Thi Thanh
contents Autonomous path planning requires a synergy between global reasoning and geometric precision, especially in complex or cluttered environments. While classical A* is valued for its optimality, it incurs prohibitive computational and memory costs in large-scale scenarios. Recent attempts to mitigate these limitations by using Large Language Models for waypoint guidance remain insufficient, as they rely only on text-based reasoning without spatial grounding. As a result, such models often produce incorrect waypoints in topologically complex environments with dead ends, and lack the perceptual capacity to interpret ambiguous physical boundaries. These inconsistencies lead to costly corrective expansions and undermine the intended computational efficiency. We introduce MMP-A*, a multimodal framework that integrates the spatial grounding capabilities of vision-language models with a novel adaptive decay mechanism. By anchoring high-level reasoning in physical geometry, the framework produces coherent waypoint guidance that addresses the limitations of text-only planners. The adaptive decay mechanism dynamically regulates the influence of uncertain waypoints within the heuristic, ensuring geometric validity while substantially reducing memory overhead. To evaluate robustness, we test the framework in challenging environments characterized by severe clutter and topological complexity. Experimental results show that MMP-A* achieves near-optimal trajectories with significantly reduced operational costs, demonstrating its potential as a perception-grounded and computationally efficient paradigm for autonomous navigation.
format Preprint
id arxiv_https___arxiv_org_abs_2601_01910
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle MMP-A*: Multimodal Perception Enhanced Incremental Heuristic Search on Path Planning
Ha, Minh Hieu
Ta, Khanh Ly
Phan, Hung
Doan, Tung
Dao, Tung
Tran, Dao
Binh, Huynh Thi Thanh
Artificial Intelligence
Autonomous path planning requires a synergy between global reasoning and geometric precision, especially in complex or cluttered environments. While classical A* is valued for its optimality, it incurs prohibitive computational and memory costs in large-scale scenarios. Recent attempts to mitigate these limitations by using Large Language Models for waypoint guidance remain insufficient, as they rely only on text-based reasoning without spatial grounding. As a result, such models often produce incorrect waypoints in topologically complex environments with dead ends, and lack the perceptual capacity to interpret ambiguous physical boundaries. These inconsistencies lead to costly corrective expansions and undermine the intended computational efficiency. We introduce MMP-A*, a multimodal framework that integrates the spatial grounding capabilities of vision-language models with a novel adaptive decay mechanism. By anchoring high-level reasoning in physical geometry, the framework produces coherent waypoint guidance that addresses the limitations of text-only planners. The adaptive decay mechanism dynamically regulates the influence of uncertain waypoints within the heuristic, ensuring geometric validity while substantially reducing memory overhead. To evaluate robustness, we test the framework in challenging environments characterized by severe clutter and topological complexity. Experimental results show that MMP-A* achieves near-optimal trajectories with significantly reduced operational costs, demonstrating its potential as a perception-grounded and computationally efficient paradigm for autonomous navigation.
title MMP-A*: Multimodal Perception Enhanced Incremental Heuristic Search on Path Planning
topic Artificial Intelligence
url https://arxiv.org/abs/2601.01910