:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhu, Haokun, Li, Zongtai, Liu, Zhixuan, Wang, Wenshan, Zhang, Ji, Francis, Jonathan, Oh, Jean
Format:	Preprint
Published:	2025
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2505.06729
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SysNav: Multi-Level Systematic Cooperation Enables Real-World, Cross-Embodiment Object Navigation
by: Zhu, Haokun, et al.
Published: (2026)

Goal2Pixel: Grounding Goals to Pixels for Vision-Language Navigation
by: Bao, Muyi, et al.
Published: (2026)

RoPotter: Toward Robotic Pottery and Deformable Object Manipulation with Structural Priors
by: Yoo, Uksang, et al.
Published: (2024)

AirHunt: Bridging VLM Semantics and Continuous Planning for Efficient Aerial Object Navigation
by: Chen, Xuecheng, et al.
Published: (2026)

Think, Remember, Navigate: Zero-Shot Object-Goal Navigation with VLM-Powered Reasoning
by: Habibpour, Mobin, et al.
Published: (2025)

GoalVLM: VLM-driven Object Goal Navigation for Multi-Agent System
by: James, MoniJesu, et al.
Published: (2026)

PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
by: Peng, Cheng, et al.
Published: (2025)

VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation
by: Zhang, Haochen, et al.
Published: (2024)

LongComp: Long-Tail Compositional Zero-Shot Generalization for Robust Trajectory Prediction
by: Stoler, Benjamin, et al.
Published: (2025)

TagaVLM: Topology-Aware Global Action Reasoning for Vision-Language Navigation
by: Liu, Jiaxing, et al.
Published: (2026)

Interactive-FAR:Interactive, Fast and Adaptable Routing for Navigation Among Movable Obstacles in Complex Unknown Environments
by: He, Botao, et al.
Published: (2024)

Language as Cost: Proactive Hazard Mapping using VLM for Robot Navigation
by: Oh, Mintaek, et al.
Published: (2025)

Reasoning about the Unseen for Efficient Outdoor Object Navigation
by: Xie, Quanting, et al.
Published: (2023)

MOSAIC: Generating Consistent, Privacy-Preserving Scenes from Multiple Depth Views in Multi-Room Environments
by: Liu, Zhixuan, et al.
Published: (2025)

RCG: Safety-Critical Scenario Generation for Robust Autonomous Driving via Real-World Crash Grounding
by: Stoler, Benjamin, et al.
Published: (2025)

KineSoft: Learning Proprioceptive Manipulation Policies with Soft Robot Hands
by: Yoo, Uksang, et al.
Published: (2025)

SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models
by: Zantout, Nader, et al.
Published: (2025)

ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation
by: Bansal, Shruti, et al.
Published: (2025)

User-Centric Object Navigation: A Benchmark with Integrated User Habits for Personalized Embodied Object Search
by: Wang, Hongcheng, et al.
Published: (2026)

Contact-Aware Motion Planning Among Movable Objects
by: Wang, Haokun, et al.
Published: (2025)

VLM-Empowered Multi-Mode System for Efficient and Safe Planetary Navigation
by: Cheng, Sinuo, et al.
Published: (2025)

Rethinking Intermediate Representation for VLM-based Robot Manipulation
by: Tang, Weiliang, et al.
Published: (2025)

SoraNav: Adaptive UAV Task-Centric Navigation via Zeroshot VLM Reasoning
by: Song, Hongyu, et al.
Published: (2025)

FiLM-Nav: Efficient and Generalizable Navigation via VLM Fine-tuning
by: Yokoyama, Naoki, et al.
Published: (2025)

Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning
by: Wang, Zixuan, et al.
Published: (2026)

GRAPPA: Generalizing and Adapting Robot Policies via Online Agentic Guidance
by: Bucker, Arthur, et al.
Published: (2024)

SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation
by: Stoler, Benjamin, et al.
Published: (2024)

T(R,O) Grasp: Efficient Graph Diffusion of Robot-Object Spatial Transformation for Cross-Embodiment Dexterous Grasping
by: Fei, Xin, et al.
Published: (2025)

Bridging VLM and KMP: Enabling Fine-grained robotic manipulation via Semantic Keypoints Representation
by: Zhu, Junjie, et al.
Published: (2025)

COVLM-RL: Critical Object-Oriented Reasoning for Autonomous Driving Using VLM-Guided Reinforcement Learning
by: Li, Lin, et al.
Published: (2025)

STRIVE: Structured Reasoning for Self-Improvement in Claim Verification
by: Gong, Haisong, et al.
Published: (2025)

MOSAIC: Learning Unified Multi-Sensory Object Property Representations for Robot Learning via Interactive Perception
by: Tatiya, Gyan, et al.
Published: (2023)

Semantic Environment Atlas for Object-Goal Navigation
by: Kim, Nuri, et al.
Published: (2024)

EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval
by: Yang, Zebin, et al.
Published: (2025)

UniDiffGrasp: A Unified Framework Integrating VLM Reasoning and VLM-Guided Part Diffusion for Open-Vocabulary Constrained Grasping with Dual Arms
by: Guo, Xueyang, et al.
Published: (2025)

SwarmVLM: VLM-Guided Impedance Control for Autonomous Navigation of Heterogeneous Robots in Dynamic Warehousing
by: Zafar, Malaika, et al.
Published: (2025)

CoINS: Counterfactual Interactive Navigation via Skill-Aware VLM
by: Zhou, Kangjie, et al.
Published: (2026)

TPS-Drive: Task-Guided Representation Purification for VLM-based Autonomous Driving
by: Li, Jiaxiang, et al.
Published: (2026)

Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
by: Lin, Mengying, et al.
Published: (2024)

DyNaVLM: Zero-Shot Vision-Language Navigation System with Dynamic Viewpoints and Self-Refining Graph Memory
by: Ji, Zihe, et al.
Published: (2025)