Saved in:
| Main Authors: | Zhu, Haokun, Li, Zongtai, Liu, Zhixuan, Wang, Wenshan, Zhang, Ji, Francis, Jonathan, Oh, Jean |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.06729 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SysNav: Multi-Level Systematic Cooperation Enables Real-World, Cross-Embodiment Object Navigation
by: Zhu, Haokun, et al.
Published: (2026)
by: Zhu, Haokun, et al.
Published: (2026)
Goal2Pixel: Grounding Goals to Pixels for Vision-Language Navigation
by: Bao, Muyi, et al.
Published: (2026)
by: Bao, Muyi, et al.
Published: (2026)
RoPotter: Toward Robotic Pottery and Deformable Object Manipulation with Structural Priors
by: Yoo, Uksang, et al.
Published: (2024)
by: Yoo, Uksang, et al.
Published: (2024)
AirHunt: Bridging VLM Semantics and Continuous Planning for Efficient Aerial Object Navigation
by: Chen, Xuecheng, et al.
Published: (2026)
by: Chen, Xuecheng, et al.
Published: (2026)
Think, Remember, Navigate: Zero-Shot Object-Goal Navigation with VLM-Powered Reasoning
by: Habibpour, Mobin, et al.
Published: (2025)
by: Habibpour, Mobin, et al.
Published: (2025)
GoalVLM: VLM-driven Object Goal Navigation for Multi-Agent System
by: James, MoniJesu, et al.
Published: (2026)
by: James, MoniJesu, et al.
Published: (2026)
PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
by: Peng, Cheng, et al.
Published: (2025)
by: Peng, Cheng, et al.
Published: (2025)
VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation
by: Zhang, Haochen, et al.
Published: (2024)
by: Zhang, Haochen, et al.
Published: (2024)
LongComp: Long-Tail Compositional Zero-Shot Generalization for Robust Trajectory Prediction
by: Stoler, Benjamin, et al.
Published: (2025)
by: Stoler, Benjamin, et al.
Published: (2025)
TagaVLM: Topology-Aware Global Action Reasoning for Vision-Language Navigation
by: Liu, Jiaxing, et al.
Published: (2026)
by: Liu, Jiaxing, et al.
Published: (2026)
Interactive-FAR:Interactive, Fast and Adaptable Routing for Navigation Among Movable Obstacles in Complex Unknown Environments
by: He, Botao, et al.
Published: (2024)
by: He, Botao, et al.
Published: (2024)
Language as Cost: Proactive Hazard Mapping using VLM for Robot Navigation
by: Oh, Mintaek, et al.
Published: (2025)
by: Oh, Mintaek, et al.
Published: (2025)
Reasoning about the Unseen for Efficient Outdoor Object Navigation
by: Xie, Quanting, et al.
Published: (2023)
by: Xie, Quanting, et al.
Published: (2023)
MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
by: Liu, Zhixuan, et al.
Published: (2025)
by: Liu, Zhixuan, et al.
Published: (2025)
RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding
by: Stoler, Benjamin, et al.
Published: (2025)
by: Stoler, Benjamin, et al.
Published: (2025)
KineSoft: Learning Proprioceptive Manipulation Policies with Soft Robot Hands
by: Yoo, Uksang, et al.
Published: (2025)
by: Yoo, Uksang, et al.
Published: (2025)
SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models
by: Zantout, Nader, et al.
Published: (2025)
by: Zantout, Nader, et al.
Published: (2025)
ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation
by: Bansal, Shruti, et al.
Published: (2025)
by: Bansal, Shruti, et al.
Published: (2025)
User-Centric Object Navigation: A Benchmark with Integrated User Habits for Personalized Embodied Object Search
by: Wang, Hongcheng, et al.
Published: (2026)
by: Wang, Hongcheng, et al.
Published: (2026)
Contact-Aware Motion Planning Among Movable Objects
by: Wang, Haokun, et al.
Published: (2025)
by: Wang, Haokun, et al.
Published: (2025)
VLM-Empowered Multi-Mode System for Efficient and Safe Planetary Navigation
by: Cheng, Sinuo, et al.
Published: (2025)
by: Cheng, Sinuo, et al.
Published: (2025)
Rethinking Intermediate Representation for VLM-based Robot Manipulation
by: Tang, Weiliang, et al.
Published: (2025)
by: Tang, Weiliang, et al.
Published: (2025)
SoraNav: Adaptive UAV Task-Centric Navigation via Zeroshot VLM Reasoning
by: Song, Hongyu, et al.
Published: (2025)
by: Song, Hongyu, et al.
Published: (2025)
FiLM-Nav: Efficient and Generalizable Navigation via VLM Fine-tuning
by: Yokoyama, Naoki, et al.
Published: (2025)
by: Yokoyama, Naoki, et al.
Published: (2025)
Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning
by: Wang, Zixuan, et al.
Published: (2026)
by: Wang, Zixuan, et al.
Published: (2026)
GRAPPA: Generalizing and Adapting Robot Policies via Online Agentic Guidance
by: Bucker, Arthur, et al.
Published: (2024)
by: Bucker, Arthur, et al.
Published: (2024)
SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation
by: Stoler, Benjamin, et al.
Published: (2024)
by: Stoler, Benjamin, et al.
Published: (2024)
T(R,O) Grasp: Efficient Graph Diffusion of Robot-Object Spatial Transformation for Cross-Embodiment Dexterous Grasping
by: Fei, Xin, et al.
Published: (2025)
by: Fei, Xin, et al.
Published: (2025)
Bridging VLM and KMP: Enabling Fine-grained robotic manipulation via Semantic Keypoints Representation
by: Zhu, Junjie, et al.
Published: (2025)
by: Zhu, Junjie, et al.
Published: (2025)
COVLM-RL: Critical Object-Oriented Reasoning for Autonomous Driving Using VLM-Guided Reinforcement Learning
by: Li, Lin, et al.
Published: (2025)
by: Li, Lin, et al.
Published: (2025)
STRIVE: Structured Reasoning for Self-Improvement in Claim Verification
by: Gong, Haisong, et al.
Published: (2025)
by: Gong, Haisong, et al.
Published: (2025)
MOSAIC: Learning Unified Multi-Sensory Object Property Representations for Robot Learning via Interactive Perception
by: Tatiya, Gyan, et al.
Published: (2023)
by: Tatiya, Gyan, et al.
Published: (2023)
Semantic Environment Atlas for Object-Goal Navigation
by: Kim, Nuri, et al.
Published: (2024)
by: Kim, Nuri, et al.
Published: (2024)
EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval
by: Yang, Zebin, et al.
Published: (2025)
by: Yang, Zebin, et al.
Published: (2025)
UniDiffGrasp: A Unified Framework Integrating VLM Reasoning and VLM-Guided Part Diffusion for Open-Vocabulary Constrained Grasping with Dual Arms
by: Guo, Xueyang, et al.
Published: (2025)
by: Guo, Xueyang, et al.
Published: (2025)
SwarmVLM: VLM-Guided Impedance Control for Autonomous Navigation of Heterogeneous Robots in Dynamic Warehousing
by: Zafar, Malaika, et al.
Published: (2025)
by: Zafar, Malaika, et al.
Published: (2025)
CoINS: Counterfactual Interactive Navigation via Skill-Aware VLM
by: Zhou, Kangjie, et al.
Published: (2026)
by: Zhou, Kangjie, et al.
Published: (2026)
TPS-Drive: Task-Guided Representation Purification for VLM-based Autonomous Driving
by: Li, Jiaxiang, et al.
Published: (2026)
by: Li, Jiaxiang, et al.
Published: (2026)
Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
by: Lin, Mengying, et al.
Published: (2024)
by: Lin, Mengying, et al.
Published: (2024)
DyNaVLM: Zero-Shot Vision-Language Navigation System with Dynamic Viewpoints and Self-Refining Graph Memory
by: Ji, Zihe, et al.
Published: (2025)
by: Ji, Zihe, et al.
Published: (2025)
Similar Items
-
SysNav: Multi-Level Systematic Cooperation Enables Real-World, Cross-Embodiment Object Navigation
by: Zhu, Haokun, et al.
Published: (2026) -
Goal2Pixel: Grounding Goals to Pixels for Vision-Language Navigation
by: Bao, Muyi, et al.
Published: (2026) -
RoPotter: Toward Robotic Pottery and Deformable Object Manipulation with Structural Priors
by: Yoo, Uksang, et al.
Published: (2024) -
AirHunt: Bridging VLM Semantics and Continuous Planning for Efficient Aerial Object Navigation
by: Chen, Xuecheng, et al.
Published: (2026) -
Think, Remember, Navigate: Zero-Shot Object-Goal Navigation with VLM-Powered Reasoning
by: Habibpour, Mobin, et al.
Published: (2025)