Saved in:
| Main Authors: | Hu, Ning, Cao, Senhao, Li, Maochen |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.08466 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
System-Level Error Propagation and Tail-Risk Amplification in Reference-Based Robotic Navigation
by: Hu, Ning, et al.
Published: (2026)
by: Hu, Ning, et al.
Published: (2026)
Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models
by: Lee, Seungjae, et al.
Published: (2025)
by: Lee, Seungjae, et al.
Published: (2025)
HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System
by: Lyu, Kailin, et al.
Published: (2026)
by: Lyu, Kailin, et al.
Published: (2026)
RAM-NAS: Resource-aware Multiobjective Neural Architecture Search Method for Robot Vision Tasks
by: Mao, Shouren, et al.
Published: (2025)
by: Mao, Shouren, et al.
Published: (2025)
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment
by: Liu, Kangcheng, et al.
Published: (2023)
by: Liu, Kangcheng, et al.
Published: (2023)
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy
by: Garcia, Ricardo, et al.
Published: (2024)
by: Garcia, Ricardo, et al.
Published: (2024)
From Imagined Futures to Executable Actions: Mixture of Latent Actions for Robot Manipulation
by: Li, Yajie, et al.
Published: (2026)
by: Li, Yajie, et al.
Published: (2026)
LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment
by: Nie, Dujun, et al.
Published: (2026)
by: Nie, Dujun, et al.
Published: (2026)
SpikePingpong: Spike Vision-based Fast-Slow Pingpong Robot System
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
Uncertainty-aware Semantic Mapping in Off-road Environments with Dempster-Shafer Theory of Evidence
by: Kim, Junyoung, et al.
Published: (2024)
by: Kim, Junyoung, et al.
Published: (2024)
Evidential Semantic Mapping in Off-road Environments with Uncertainty-aware Bayesian Kernel Inference
by: Kim, Junyoung, et al.
Published: (2024)
by: Kim, Junyoung, et al.
Published: (2024)
TwinAligner: Visual-Dynamic Alignment Empowers Physics-aware Real2Sim2Real for Robotic Manipulation
by: Fan, Hongwei, et al.
Published: (2025)
by: Fan, Hongwei, et al.
Published: (2025)
Learning High-Fidelity Robot Self-Model with Articulated 3D Gaussian Splatting
by: Hu, Kejun, et al.
Published: (2025)
by: Hu, Kejun, et al.
Published: (2025)
ROSA: Harnessing Robot States for Vision-Language and Action Alignment
by: Wen, Yuqing, et al.
Published: (2025)
by: Wen, Yuqing, et al.
Published: (2025)
AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation
by: Guo, Wenxuan, et al.
Published: (2026)
by: Guo, Wenxuan, et al.
Published: (2026)
UFO: Uncertainty-aware LiDAR-image Fusion for Off-road Semantic Terrain Map Estimation
by: Kim, Ohn, et al.
Published: (2024)
by: Kim, Ohn, et al.
Published: (2024)
UAOR: Uncertainty-aware Observation Reinjection for Vision-Language-Action Models
by: Yang, Jiabing, et al.
Published: (2026)
by: Yang, Jiabing, et al.
Published: (2026)
Collaborative Representation Learning for Alignment of Tactile, Language, and Vision Modalities
by: Zhou, Yiyun, et al.
Published: (2025)
by: Zhou, Yiyun, et al.
Published: (2025)
From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
by: Fang, Irving, et al.
Published: (2025)
by: Fang, Irving, et al.
Published: (2025)
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024)
by: Wen, Junjie, et al.
Published: (2024)
Semantic-Aware Particle Filter for Reliable Vineyard Robot Localisation
by: de Silva, Rajitha, et al.
Published: (2025)
by: de Silva, Rajitha, et al.
Published: (2025)
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
by: Han, Xiaofeng, et al.
Published: (2025)
by: Han, Xiaofeng, et al.
Published: (2025)
Ensuring Force Safety in Vision-Guided Robotic Manipulation via Implicit Tactile Calibration
by: Wei, Lai, et al.
Published: (2024)
by: Wei, Lai, et al.
Published: (2024)
Vibration-Based Energy Metric for Restoring Needle Alignment in Autonomous Robotic Ultrasound
by: Chen, Zhongyu, et al.
Published: (2025)
by: Chen, Zhongyu, et al.
Published: (2025)
Language-guided Robust Navigation for Mobile Robots in Dynamically-changing Environments
by: Simons, Cody, et al.
Published: (2024)
by: Simons, Cody, et al.
Published: (2024)
Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
by: Ma, Teli, et al.
Published: (2024)
by: Ma, Teli, et al.
Published: (2024)
UAV-VLN: End-to-End Vision Language guided Navigation for UAVs
by: Saxena, Pranav, et al.
Published: (2025)
by: Saxena, Pranav, et al.
Published: (2025)
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World
by: Mao, Weixin, et al.
Published: (2024)
by: Mao, Weixin, et al.
Published: (2024)
A Touch, Vision, and Language Dataset for Multimodal Alignment
by: Fu, Letian, et al.
Published: (2024)
by: Fu, Letian, et al.
Published: (2024)
Visual Anomaly Detection for Reliable Robotic Implantation of Flexible Microelectrode Array
by: Chen, Yitong, et al.
Published: (2025)
by: Chen, Yitong, et al.
Published: (2025)
MineInsight: A Multi-sensor Dataset for Humanitarian Demining Robotics in Off-Road Environments
by: Malizia, Mario, et al.
Published: (2025)
by: Malizia, Mario, et al.
Published: (2025)
Continuous Vision-Language-Action Co-Learning with Semantic-Physical Alignment for Behavioral Cloning
by: Qi, Xiuxiu, et al.
Published: (2025)
by: Qi, Xiuxiu, et al.
Published: (2025)
RoboTAG: End-to-end Robot Configuration Estimation via Topological Alignment Graph
by: Liu, Yifan, et al.
Published: (2025)
by: Liu, Yifan, et al.
Published: (2025)
ManiPose: A Comprehensive Benchmark for Pose-aware Object Manipulation in Robotics
by: Yu, Qiaojun, et al.
Published: (2024)
by: Yu, Qiaojun, et al.
Published: (2024)
Language-Guided Grasp Detection with Coarse-to-Fine Learning for Robotic Manipulation
by: Jiang, Zebin, et al.
Published: (2025)
by: Jiang, Zebin, et al.
Published: (2025)
Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment
by: Lin, Tao, et al.
Published: (2025)
by: Lin, Tao, et al.
Published: (2025)
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
by: Huang, Haifeng, et al.
Published: (2025)
by: Huang, Haifeng, et al.
Published: (2025)
Towards an Accurate and Effective Robot Vision (The Problem of Topological Localization for Mobile Robots)
by: Boros, Emanuela
Published: (2025)
by: Boros, Emanuela
Published: (2025)
What Matters in Building Vision-Language-Action Models for Generalist Robots
by: Li, Xinghang, et al.
Published: (2024)
by: Li, Xinghang, et al.
Published: (2024)
Observe Then Act: Asynchronous Active Vision-Action Model for Robotic Manipulation
by: Wang, Guokang, et al.
Published: (2024)
by: Wang, Guokang, et al.
Published: (2024)
Similar Items
-
System-Level Error Propagation and Tail-Risk Amplification in Reference-Based Robotic Navigation
by: Hu, Ning, et al.
Published: (2026) -
Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models
by: Lee, Seungjae, et al.
Published: (2025) -
HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System
by: Lyu, Kailin, et al.
Published: (2026) -
RAM-NAS: Resource-aware Multiobjective Neural Architecture Search Method for Robot Vision Tasks
by: Mao, Shouren, et al.
Published: (2025) -
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment
by: Liu, Kangcheng, et al.
Published: (2023)