:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gu, Junwen, Wu, Zhiheng, Si, Pengxuan, Qiu, Shuang, Zhang, Zhentao, Feng, Yukai, Sun, Luoyang, Luo, Laien, Yu, Lianyi, Wang, Jian, Wu, Zhengxing
Format:	Preprint
Published:	2025
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2510.07869
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

M$^{2}$GRPO: Mamba-based Multi-Agent Group Relative Policy Optimization for Biomimetic Underwater Robots Pursuit
by: Feng, Yukai, et al.
Published: (2026)

PriorVLA: Prior-Preserving Adaptation for Vision-Language-Action Models
by: Guo, Xinyu, et al.
Published: (2026)

SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology
by: Wu, Dongli, et al.
Published: (2025)

ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning
by: Yang, Yandan, et al.
Published: (2026)

LaST$_{0}$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model
by: Liu, Zhuoyang, et al.
Published: (2026)

ALAM: Algebraically Consistent Latent Action Model for Vision-Language-Action Models
by: Tang, Zuojin, et al.
Published: (2026)

Continually Evolving Skill Knowledge in Vision Language Action Model
by: Wu, Yuxuan, et al.
Published: (2025)

Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution
by: Cai, Rui, et al.
Published: (2026)

Absence of charged pion condensation in a magnetic field with parallel rotation
by: Bai, Puyuan, et al.
Published: (2025)

Letter to “Etiologies and clinical characteristics of primary amenorrhea: A study from a quaternary care hospital in southern Thailand”
by: Lianyi Bao, et al.
Published: (2025)

UnderwaterVLA: Dual-brain Vision-Language-Action architecture for Autonomous Underwater Navigation
by: Wang, Zhangyuan, et al.
Published: (2025)

Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation
by: Xiao, Junjin, et al.
Published: (2026)

Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
by: Wu, Ziheng, et al.
Published: (2025)

Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models
by: Luo, Yulin, et al.
Published: (2026)

End-To-End Underwater Video Enhancement: Dataset and Model
by: Du, Dazhao, et al.
Published: (2024)

Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing
by: Liu, Yuang, et al.
Published: (2024)

Global well-posedness of the defocusing, cubic nonlinear wave equation outside of the ball with radial data
by: Xu, Guixiang, et al.
Published: (2024)

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024)

See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation
by: Dai, Tingjun, et al.
Published: (2026)

Uncertainty Aware Mapping for Vision-Based Underwater Robots
by: Bhowmik, Abhimanyu, et al.
Published: (2025)

$τ_0$-WM: A Unified Video-Action World Model for Robotic Manipulation
by: Zhou, Pengfei, et al.
Published: (2026)

Diver-Robot Communication Dataset for Underwater Hand Gesture Recognition
by: Kvasić, Igor, et al.
Published: (2025)

Vision-Language-Action in Robotics: A Survey of Datasets, Benchmarks, and Data Engines
by: Wang, Ziyao, et al.
Published: (2026)

Accuracy, Efficiency, and Patient‐ and Surgeon‐Reported Outcomes of Static Versus Robotic Computer‐Assisted Implant Surgery: A Randomized Clinical Trial
by: Zhilin Luo, et al.
Published: (2026)

ROSA: Harnessing Robot States for Vision-Language and Action Alignment
by: Wen, Yuqing, et al.
Published: (2025)

Spatiotemporal Calibration of Doppler Velocity Logs for Underwater Robots
by: Zhao, Hongxu, et al.
Published: (2025)

HiViS: Hiding Visual Tokens from the Drafter for Speculative Decoding in Vision-Language Models
by: Xie, Zhinan, et al.
Published: (2025)

Unified Description for Reentrance and Tc Enhancement in Ferromagnetic Superconductors
by: Wang, Xusheng, et al.
Published: (2025)

Zooplankton diel vertical migration enhances carbon export via distinct mechanisms in a warming North Pacific
by: Chenying Guo, et al.
Published: (2026)

Underwater Robotic Simulators Review for Autonomous System Development
by: Aldhaheri, Sara, et al.
Published: (2025)

A Sonar-Visual Dataset for Cross-Modal Underwater Robot Perception
by: Chen, Weitung, et al.
Published: (2026)

Will technological innovation uncertainty affect the distribution of benefits from low‐carbon innovation activities in industrial clusters?—A study based on gray Shapley values
by: Xi Tang, et al.
Published: (2024)

Propagating Unsafe Actions in LLM Controlled Multi-Robot Collaboration via Single Robot Compromise
by: Huang, Zhen, et al.
Published: (2026)

CKMImageNet: A Comprehensive Dataset to Enable Channel Knowledge Map Construction via Computer Vision
by: Wu, Di, et al.
Published: (2024)

Monochromatic polynomial sumset structures on $\mathbb{N}$
by: Lian, Zhengxing, et al.
Published: (2024)

Assessing Vision-Language Models for Perception in Autonomous Underwater Robotic Software
by: Yousaf, Muhammad, et al.
Published: (2026)

Performance Prediction and Optimization of Single‐Piston Free Piston Expander‐Linear Generator Based on Machine Learning and Genetic Algorithm
by: Jian Li, et al.
Published: (2024)

Conversational Disease Diagnosis via External Planner-Controlled Large Language Models
by: Sun, Zhoujian, et al.
Published: (2024)

ActionFlow: A Pipelined Action Acceleration for Vision Language Models on Edge
by: Dai, Yuntao, et al.
Published: (2025)

VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models
by: Si, Shengyu, et al.
Published: (2026)