Saved in:
| Main Authors: | Wang, Liuyi, He, Zongtao, Li, Jinlong, Xia, Ruihao, Hu, Mengxian, Yao, Chenpeng, Liu, Chengju, Tang, Yang, Chen, Qijun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.10360 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
by: He, Zongtao, et al.
Published: (2023)
by: He, Zongtao, et al.
Published: (2023)
Vision-and-Language Navigation via Causal Learning
by: Wang, Liuyi, et al.
Published: (2024)
by: Wang, Liuyi, et al.
Published: (2024)
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation
by: Wang, Liuyi, et al.
Published: (2024)
by: Wang, Liuyi, et al.
Published: (2024)
Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation
by: Wang, Liuyi, et al.
Published: (2024)
by: Wang, Liuyi, et al.
Published: (2024)
NavComposer: Composing Language Instructions for Navigation Trajectories through Action-Scene-Object Modularization
by: He, Zongtao, et al.
Published: (2025)
by: He, Zongtao, et al.
Published: (2025)
P2DNav: Panorama-to-Downview Reasoning for Zero-shot Vision-and-Language Navigation
by: Sheng, Kai, et al.
Published: (2026)
by: Sheng, Kai, et al.
Published: (2026)
A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation
by: Wang, Liuyi, et al.
Published: (2023)
by: Wang, Liuyi, et al.
Published: (2023)
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation
by: Wang, Liuyi, et al.
Published: (2023)
by: Wang, Liuyi, et al.
Published: (2023)
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities
by: Wang, Liuyi, et al.
Published: (2025)
by: Wang, Liuyi, et al.
Published: (2025)
Realizing Text-Driven Motion Generation on NAO Robot: A Reinforcement Learning-Optimized Control Pipeline
by: Xu, Zihan, et al.
Published: (2025)
by: Xu, Zihan, et al.
Published: (2025)
Dynamics Are Learned, Not Told: Semi-Supervised Discovery of Latent Dynamics Geometries For Zero-Shot Policy Adaptation
by: Xu, Zhiming, et al.
Published: (2026)
by: Xu, Zhiming, et al.
Published: (2026)
Vision-Language Navigation for Aerial Robots: Towards the Era of Large Language Models
by: Xia, Xingyu, et al.
Published: (2026)
by: Xia, Xingyu, et al.
Published: (2026)
Deconfounded Lifelong Learning for Autonomous Driving via Dynamic Knowledge Spaces
by: Du, Jiayuan, et al.
Published: (2026)
by: Du, Jiayuan, et al.
Published: (2026)
Trajectory Planning and Tracking of Hybrid Flying-Crawling Quadrotors
by: Hu, Dongnan, et al.
Published: (2023)
by: Hu, Dongnan, et al.
Published: (2023)
Adaptive Denoising-Enhanced LiDAR Odometry for Degeneration Resilience in Diverse Terrains
by: Ji, Mazeyu, et al.
Published: (2023)
by: Ji, Mazeyu, et al.
Published: (2023)
Kinematics-Aware Multi-Policy Reinforcement Learning for Force-Capable Humanoid Loco-Manipulation
by: Xiao, Kaiyan, et al.
Published: (2025)
by: Xiao, Kaiyan, et al.
Published: (2025)
Temporal-Guided Visual Foundation Models for Event-Based Vision
by: Xia, Ruihao, et al.
Published: (2025)
by: Xia, Ruihao, et al.
Published: (2025)
ECHO: Continuous Hierarchical Memory for Vision-Language-Action Models
by: Hu, Yanbin, et al.
Published: (2026)
by: Hu, Yanbin, et al.
Published: (2026)
Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments
by: Li, Zhiyuan, et al.
Published: (2024)
by: Li, Zhiyuan, et al.
Published: (2024)
CLASH: Collision Learning via Augmented Sim-to-real Hybridization to Bridge the Reality Gap
by: He, Haotian, et al.
Published: (2026)
by: He, Haotian, et al.
Published: (2026)
Vision-Language Navigation with Continual Learning
by: Li, Zhiyuan, et al.
Published: (2024)
by: Li, Zhiyuan, et al.
Published: (2024)
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation
by: Wang, Zihan, et al.
Published: (2024)
by: Wang, Zihan, et al.
Published: (2024)
VLNVerse: A Benchmark for Vision-Language Navigation with Versatile, Embodied, Realistic Simulation and Evaluation
by: Lin, Sihao, et al.
Published: (2025)
by: Lin, Sihao, et al.
Published: (2025)
A Deployable Embodied Vision-Language Navigation System with Hierarchical Cognition and Context-Aware Exploration
by: Xu, Kuan, et al.
Published: (2026)
by: Xu, Kuan, et al.
Published: (2026)
LaViRA: Language-Vision-Robot Actions Translation for Zero-Shot Vision Language Navigation in Continuous Environments
by: Ding, Hongyu, et al.
Published: (2025)
by: Ding, Hongyu, et al.
Published: (2025)
SeqWalker: Sequential-Horizon Vision-and-Language Navigation with Hierarchical Planning
by: Han, Zebin, et al.
Published: (2026)
by: Han, Zebin, et al.
Published: (2026)
Does Peer Observation Help? Vision-Sharing Collaboration for Vision-Language Navigation
by: Jin, Qunchao, et al.
Published: (2026)
by: Jin, Qunchao, et al.
Published: (2026)
DeCoNav: Dialog enhanced Long-Horizon Collaborative Vision-Language Navigation
by: Zhou, Sunyao, et al.
Published: (2026)
by: Zhou, Sunyao, et al.
Published: (2026)
IndoorUAV: Benchmarking Vision-Language UAV Navigation in Continuous Indoor Environments
by: Liu, Xu, et al.
Published: (2025)
by: Liu, Xu, et al.
Published: (2025)
CL-CoTNav: Closed-Loop Hierarchical Chain-of-Thought for Zero-Shot Object-Goal Navigation with Vision-Language Models
by: Cai, Yuxin, et al.
Published: (2025)
by: Cai, Yuxin, et al.
Published: (2025)
ImagineUAV: Aerial Vision-Language Navigation via World-Action Modeling and Kinodynamic Planning
by: Liu, Xuchen, et al.
Published: (2026)
by: Liu, Xuchen, et al.
Published: (2026)
Hierarchical Semantic-Augmented Navigation: Optimal Transport and Graph-Driven Reasoning for Vision-Language Navigation
by: Fang, Xiang, et al.
Published: (2026)
by: Fang, Xiang, et al.
Published: (2026)
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
by: An, Dong, et al.
Published: (2023)
by: An, Dong, et al.
Published: (2023)
HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System
by: Lyu, Kailin, et al.
Published: (2026)
by: Lyu, Kailin, et al.
Published: (2026)
FSR-VLN: Fast and Slow Reasoning for Vision-Language Navigation with Hierarchical Multi-modal Scene Graph
by: Zhou, Xiaolin, et al.
Published: (2025)
by: Zhou, Xiaolin, et al.
Published: (2025)
SFCo-Nav: Efficient Zero-Shot Visual Language Navigation via Collaboration of Slow LLM and Fast Attributed Graph Alignment
by: Xiong, Chaoran, et al.
Published: (2026)
by: Xiong, Chaoran, et al.
Published: (2026)
HTNav: A Hybrid Navigation Framework with Tiered Structure for Urban Aerial Vision-and-Language Navigation
by: Fan, Chengjie, et al.
Published: (2026)
by: Fan, Chengjie, et al.
Published: (2026)
NavForesee: A Unified Vision-Language World Model for Hierarchical Planning and Dual-Horizon Navigation Prediction
by: Liu, Fei, et al.
Published: (2025)
by: Liu, Fei, et al.
Published: (2025)
MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer
by: Zhu, Minghao, et al.
Published: (2024)
by: Zhu, Minghao, et al.
Published: (2024)
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory
by: Zhang, Weichen, et al.
Published: (2025)
by: Zhang, Weichen, et al.
Published: (2025)
Similar Items
-
MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
by: He, Zongtao, et al.
Published: (2023) -
Vision-and-Language Navigation via Causal Learning
by: Wang, Liuyi, et al.
Published: (2024) -
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation
by: Wang, Liuyi, et al.
Published: (2024) -
Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation
by: Wang, Liuyi, et al.
Published: (2024) -
NavComposer: Composing Language Instructions for Navigation Trajectories through Action-Scene-Object Modularization
by: He, Zongtao, et al.
Published: (2025)