:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Xiang, Wentao, Zhang, Haokang, Yang, Tianhang, Chu, Zedong, Chu, Ruihang, Xie, Shichao, Yuan, Yujian, Sun, Jian, Gu, Zhining, Wang, Junjie, Wu, Xiaolong, Xu, Mu, Yang, Yujiu
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.02400
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction
by: Liu, Fei, et al.
Published: (2025)

MerNav: A Highly Generalizable Memory-Execute-Review Framework for Zero-Shot Object Goal Navigation
by: Qi, Dekang, et al.
Published: (2026)

CE-Nav: Flow-Guided Reinforcement Refinement for Cross-Embodiment Local Navigation
by: Yang, Kai, et al.
Published: (2025)

SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
by: Chen, Ziyi, et al.
Published: (2025)

VideoZoomer: Reinforcement-Learned Temporal Focusing for Long Video Reasoning
by: Ding, Yang, et al.
Published: (2025)

OmniNav: A Unified Framework for Prospective Exploration and Visual-Language Navigation
by: Xue, Xinda, et al.
Published: (2025)

O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing
by: Chen, Yuqing, et al.
Published: (2025)

Explore Like Humans: Autonomous Exploration with Online SG-Memo Construction for Embodied Agents
by: Chen, Xu, et al.
Published: (2026)

DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation
by: Ortega-Peimbert, Jesús, et al.
Published: (2025)

AsyncShield: A Plug-and-Play Edge Adapter for Asynchronous Cloud-based VLA Navigation
by: Yang, Kai, et al.
Published: (2026)

Velocity-Space 3D Asset Editing
by: Liu, Hao, et al.
Published: (2026)

AstraNav-World: World Model for Foresight Control and Consistency
by: Chen, Jintao, et al.
Published: (2025)

Generative Universal Verifier as Multimodal Meta-Reasoner
by: Zhang, Xinchen, et al.
Published: (2025)

DRIVE-Nav: Directional Reasoning, Inspection, and Verification for Efficient Open-Vocabulary Navigation
by: Gao, Maoguo, et al.
Published: (2026)

OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation
by: Rahman, Muhammad Rameez ur, et al.
Published: (2024)

FOM-Nav: Frontier-Object Maps for Object Goal Navigation
by: Chabal, Thomas, et al.
Published: (2025)

POINav: Benchmarking and Enhancing Final-Meters Arrival in Real-World Vision-Language Navigation
by: Gong, Ruiyan, et al.
Published: (2026)

EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval
by: Yang, Zebin, et al.
Published: (2025)

OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
by: Zemskova, Tatiana, et al.
Published: (2025)

Uncertainty-Informed Active Perception for Open Vocabulary Object Goal Navigation
by: Bajpai, Utkarsh, et al.
Published: (2025)

Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning
by: Wang, Zixuan, et al.
Published: (2026)

DSCD-Nav: Dual-Stance Cooperative Debate for Object Navigation
by: An, Weitao, et al.
Published: (2026)

OVAL: Open-Vocabulary Augmented Memory Model for Lifelong Object Goal Navigation
by: Pei, Jiahua, et al.
Published: (2026)

GoalSwarm: Multi-UAV Semantic Coordination for Open-Vocabulary Object Navigation
by: James, MoniJesu Wonders, et al.
Published: (2026)

Video-Zero: Self-Evolution Video Understanding
by: Zhang, Ruixu, et al.
Published: (2026)

CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs
by: Cao, Yihan, et al.
Published: (2024)

TreeFedDG: Alleviating Global Drift in Federated Domain Generalization for Medical Image Segmentation
by: Song, Yucheng, et al.
Published: (2025)

FGML-DG: Feynman-Inspired Cognitive Science Paradigm for Cross-Domain Medical Image Segmentation
by: Song, Yucheng, et al.
Published: (2026)

LOG-Nav: Efficient Layout-Aware Object-Goal Navigation with Hierarchical Planning
by: Hou, Jiawei, et al.
Published: (2025)

SR-Nav: Spatial Relationships Matter for Zero-shot Object Goal Navigation
by: Fang, Leyuan, et al.
Published: (2026)

DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments
by: Ma, Ji, et al.
Published: (2024)

HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal Navigation
by: Yokoyama, Naoki, et al.
Published: (2024)

Mitigating the Reasoning Tax in Vision-Language Fine-Tuning with Input-Adaptive Depth Aggregation
by: Ren, Yiming, et al.
Published: (2026)

PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
by: Wan, Jiansong, et al.
Published: (2025)

LOVON: Legged Open-Vocabulary Object Navigator
by: Peng, Daojie, et al.
Published: (2025)

From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning
by: Luo, Ruilin, et al.
Published: (2026)

Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding
by: Zhang, Yuhang, et al.
Published: (2025)

LangMap: A Human-Verified Benchmark for Hierarchical Open-Vocabulary Goal Navigation
by: Miao, Bo, et al.
Published: (2026)

Open-Vocabulary Object Detection in UAV Imagery: A Review and Future Perspectives
by: Zhou, Yang, et al.
Published: (2025)

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning
by: Ren, Yiming, et al.
Published: (2025)