:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hu, Ning, Cao, Senhao, Li, Maochen
Format:	Preprint
Published:	2026
Subjects:	Robotics Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.08466
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

System-Level Error Propagation and Tail-Risk Amplification in Reference-Based Robotic Navigation
by: Hu, Ning, et al.
Published: (2026)

Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models
by: Lee, Seungjae, et al.
Published: (2025)

HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System
by: Lyu, Kailin, et al.
Published: (2026)

RAM-NAS: Resource-aware Multiobjective Neural Architecture Search Method for Robot Vision Tasks
by: Mao, Shouren, et al.
Published: (2025)

Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment
by: Liu, Kangcheng, et al.
Published: (2023)

Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy
by: Garcia, Ricardo, et al.
Published: (2024)

From Imagined Futures to Executable Actions: Mixture of Latent Actions for Robot Manipulation
by: Li, Yajie, et al.
Published: (2026)

LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment
by: Nie, Dujun, et al.
Published: (2026)

SpikePingpong: Spike Vision-based Fast-Slow Pingpong Robot System
by: Wang, Hao, et al.
Published: (2025)

Uncertainty-aware Semantic Mapping in Off-road Environments with Dempster-Shafer Theory of Evidence
by: Kim, Junyoung, et al.
Published: (2024)

Evidential Semantic Mapping in Off-road Environments with Uncertainty-aware Bayesian Kernel Inference
by: Kim, Junyoung, et al.
Published: (2024)

TwinAligner: Visual-Dynamic Alignment Empowers Physics-aware Real2Sim2Real for Robotic Manipulation
by: Fan, Hongwei, et al.
Published: (2025)

Learning High-Fidelity Robot Self-Model with Articulated 3D Gaussian Splatting
by: Hu, Kejun, et al.
Published: (2025)

ROSA: Harnessing Robot States for Vision-Language and Action Alignment
by: Wen, Yuqing, et al.
Published: (2025)

AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation
by: Guo, Wenxuan, et al.
Published: (2026)

UFO: Uncertainty-aware LiDAR-image Fusion for Off-road Semantic Terrain Map Estimation
by: Kim, Ohn, et al.
Published: (2024)

UAOR: Uncertainty-aware Observation Reinjection for Vision-Language-Action Models
by: Yang, Jiabing, et al.
Published: (2026)

Collaborative Representation Learning for Alignment of Tactile, Language, and Vision Modalities
by: Zhou, Yiyun, et al.
Published: (2025)

From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
by: Fang, Irving, et al.
Published: (2025)

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024)

Semantic-Aware Particle Filter for Reliable Vineyard Robot Localisation
by: de Silva, Rajitha, et al.
Published: (2025)

Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
by: Han, Xiaofeng, et al.
Published: (2025)

Ensuring Force Safety in Vision-Guided Robotic Manipulation via Implicit Tactile Calibration
by: Wei, Lai, et al.
Published: (2024)

Vibration-Based Energy Metric for Restoring Needle Alignment in Autonomous Robotic Ultrasound
by: Chen, Zhongyu, et al.
Published: (2025)

Language-guided Robust Navigation for Mobile Robots in Dynamically-changing Environments
by: Simons, Cody, et al.
Published: (2024)

Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
by: Ma, Teli, et al.
Published: (2024)

UAV-VLN: End-to-End Vision Language guided Navigation for UAVs
by: Saxena, Pranav, et al.
Published: (2025)

RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
by: Mao, Weixin, et al.
Published: (2024)

A Touch, Vision, and Language Dataset for Multimodal Alignment
by: Fu, Letian, et al.
Published: (2024)

Visual Anomaly Detection for Reliable Robotic Implantation of Flexible Microelectrode Array
by: Chen, Yitong, et al.
Published: (2025)

MineInsight: A Multi-sensor Dataset for Humanitarian Demining Robotics in Off-Road Environments
by: Malizia, Mario, et al.
Published: (2025)

Continuous Vision-Language-Action Co-Learning with Semantic-Physical Alignment for Behavioral Cloning
by: Qi, Xiuxiu, et al.
Published: (2025)

RoboTAG: End-to-end Robot Configuration Estimation via Topological Alignment Graph
by: Liu, Yifan, et al.
Published: (2025)

ManiPose: A Comprehensive Benchmark for Pose-aware Object Manipulation in Robotics
by: Yu, Qiaojun, et al.
Published: (2024)

Language-Guided Grasp Detection with Coarse-to-Fine Learning for Robotic Manipulation
by: Jiang, Zebin, et al.
Published: (2025)

Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment
by: Lin, Tao, et al.
Published: (2025)

RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
by: Huang, Haifeng, et al.
Published: (2025)

Towards an Accurate and Effective Robot Vision (The Problem of Topological Localization for Mobile Robots)
by: Boros, Emanuela
Published: (2025)

What Matters in Building Vision-Language-Action Models for Generalist Robots
by: Li, Xinghang, et al.
Published: (2024)

Observe Then Act: Asynchronous Active Vision-Action Model for Robotic Manipulation
by: Wang, Guokang, et al.
Published: (2024)