Saved in:
| Main Authors: | Xiang, Wentao, Zhang, Haokang, Yang, Tianhang, Chu, Zedong, Chu, Ruihang, Xie, Shichao, Yuan, Yujian, Sun, Jian, Gu, Zhining, Wang, Junjie, Wu, Xiaolong, Xu, Mu, Yang, Yujiu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.02400 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction
by: Liu, Fei, et al.
Published: (2025)
by: Liu, Fei, et al.
Published: (2025)
MerNav: A Highly Generalizable Memory-Execute-Review Framework for Zero-Shot Object Goal Navigation
by: Qi, Dekang, et al.
Published: (2026)
by: Qi, Dekang, et al.
Published: (2026)
CE-Nav: Flow-Guided Reinforcement Refinement for Cross-Embodiment Local Navigation
by: Yang, Kai, et al.
Published: (2025)
by: Yang, Kai, et al.
Published: (2025)
SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
by: Chen, Ziyi, et al.
Published: (2025)
by: Chen, Ziyi, et al.
Published: (2025)
VideoZoomer: Reinforcement-Learned Temporal Focusing for Long Video Reasoning
by: Ding, Yang, et al.
Published: (2025)
by: Ding, Yang, et al.
Published: (2025)
OmniNav: A Unified Framework for Prospective Exploration and Visual-Language Navigation
by: Xue, Xinda, et al.
Published: (2025)
by: Xue, Xinda, et al.
Published: (2025)
O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing
by: Chen, Yuqing, et al.
Published: (2025)
by: Chen, Yuqing, et al.
Published: (2025)
Explore Like Humans: Autonomous Exploration with Online SG-Memo Construction for Embodied Agents
by: Chen, Xu, et al.
Published: (2026)
by: Chen, Xu, et al.
Published: (2026)
DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation
by: Ortega-Peimbert, Jesús, et al.
Published: (2025)
by: Ortega-Peimbert, Jesús, et al.
Published: (2025)
AsyncShield: A Plug-and-Play Edge Adapter for Asynchronous Cloud-based VLA Navigation
by: Yang, Kai, et al.
Published: (2026)
by: Yang, Kai, et al.
Published: (2026)
Velocity-Space 3D Asset Editing
by: Liu, Hao, et al.
Published: (2026)
by: Liu, Hao, et al.
Published: (2026)
AstraNav-World: World Model for Foresight Control and Consistency
by: Chen, Jintao, et al.
Published: (2025)
by: Chen, Jintao, et al.
Published: (2025)
Generative Universal Verifier as Multimodal Meta-Reasoner
by: Zhang, Xinchen, et al.
Published: (2025)
by: Zhang, Xinchen, et al.
Published: (2025)
DRIVE-Nav: Directional Reasoning, Inspection, and Verification for Efficient Open-Vocabulary Navigation
by: Gao, Maoguo, et al.
Published: (2026)
by: Gao, Maoguo, et al.
Published: (2026)
OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation
by: Rahman, Muhammad Rameez ur, et al.
Published: (2024)
by: Rahman, Muhammad Rameez ur, et al.
Published: (2024)
FOM-Nav: Frontier-Object Maps for Object Goal Navigation
by: Chabal, Thomas, et al.
Published: (2025)
by: Chabal, Thomas, et al.
Published: (2025)
POINav: Benchmarking and Enhancing Final-Meters Arrival in Real-World Vision-Language Navigation
by: Gong, Ruiyan, et al.
Published: (2026)
by: Gong, Ruiyan, et al.
Published: (2026)
EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval
by: Yang, Zebin, et al.
Published: (2025)
by: Yang, Zebin, et al.
Published: (2025)
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
by: Zemskova, Tatiana, et al.
Published: (2025)
by: Zemskova, Tatiana, et al.
Published: (2025)
Uncertainty-Informed Active Perception for Open Vocabulary Object Goal Navigation
by: Bajpai, Utkarsh, et al.
Published: (2025)
by: Bajpai, Utkarsh, et al.
Published: (2025)
Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning
by: Wang, Zixuan, et al.
Published: (2026)
by: Wang, Zixuan, et al.
Published: (2026)
DSCD-Nav: Dual-Stance Cooperative Debate for Object Navigation
by: An, Weitao, et al.
Published: (2026)
by: An, Weitao, et al.
Published: (2026)
OVAL: Open-Vocabulary Augmented Memory Model for Lifelong Object Goal Navigation
by: Pei, Jiahua, et al.
Published: (2026)
by: Pei, Jiahua, et al.
Published: (2026)
GoalSwarm: Multi-UAV Semantic Coordination for Open-Vocabulary Object Navigation
by: James, MoniJesu Wonders, et al.
Published: (2026)
by: James, MoniJesu Wonders, et al.
Published: (2026)
Video-Zero: Self-Evolution Video Understanding
by: Zhang, Ruixu, et al.
Published: (2026)
by: Zhang, Ruixu, et al.
Published: (2026)
CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs
by: Cao, Yihan, et al.
Published: (2024)
by: Cao, Yihan, et al.
Published: (2024)
TreeFedDG: Alleviating Global Drift in Federated Domain Generalization for Medical Image Segmentation
by: Song, Yucheng, et al.
Published: (2025)
by: Song, Yucheng, et al.
Published: (2025)
FGML-DG: Feynman-Inspired Cognitive Science Paradigm for Cross-Domain Medical Image Segmentation
by: Song, Yucheng, et al.
Published: (2026)
by: Song, Yucheng, et al.
Published: (2026)
LOG-Nav: Efficient Layout-Aware Object-Goal Navigation with Hierarchical Planning
by: Hou, Jiawei, et al.
Published: (2025)
by: Hou, Jiawei, et al.
Published: (2025)
SR-Nav: Spatial Relationships Matter for Zero-shot Object Goal Navigation
by: Fang, Leyuan, et al.
Published: (2026)
by: Fang, Leyuan, et al.
Published: (2026)
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments
by: Ma, Ji, et al.
Published: (2024)
by: Ma, Ji, et al.
Published: (2024)
HM3D-OVON: A Dataset and Benchmark for Open-Vocabulary Object Goal Navigation
by: Yokoyama, Naoki, et al.
Published: (2024)
by: Yokoyama, Naoki, et al.
Published: (2024)
Mitigating the Reasoning Tax in Vision-Language Fine-Tuning with Input-Adaptive Depth Aggregation
by: Ren, Yiming, et al.
Published: (2026)
by: Ren, Yiming, et al.
Published: (2026)
PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models
by: Wan, Jiansong, et al.
Published: (2025)
by: Wan, Jiansong, et al.
Published: (2025)
LOVON: Legged Open-Vocabulary Object Navigator
by: Peng, Daojie, et al.
Published: (2025)
by: Peng, Daojie, et al.
Published: (2025)
From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning
by: Luo, Ruilin, et al.
Published: (2026)
by: Luo, Ruilin, et al.
Published: (2026)
Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding
by: Zhang, Yuhang, et al.
Published: (2025)
by: Zhang, Yuhang, et al.
Published: (2025)
LangMap: A Human-Verified Benchmark for Hierarchical Open-Vocabulary Goal Navigation
by: Miao, Bo, et al.
Published: (2026)
by: Miao, Bo, et al.
Published: (2026)
Open-Vocabulary Object Detection in UAV Imagery: A Review and Future Perspectives
by: Zhou, Yang, et al.
Published: (2025)
by: Zhou, Yang, et al.
Published: (2025)
AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning
by: Ren, Yiming, et al.
Published: (2025)
by: Ren, Yiming, et al.
Published: (2025)
Similar Items
-
NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction
by: Liu, Fei, et al.
Published: (2025) -
MerNav: A Highly Generalizable Memory-Execute-Review Framework for Zero-Shot Object Goal Navigation
by: Qi, Dekang, et al.
Published: (2026) -
CE-Nav: Flow-Guided Reinforcement Refinement for Cross-Embodiment Local Navigation
by: Yang, Kai, et al.
Published: (2025) -
SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation
by: Chen, Ziyi, et al.
Published: (2025) -
VideoZoomer: Reinforcement-Learned Temporal Focusing for Long Video Reasoning
by: Ding, Yang, et al.
Published: (2025)