Saved in:
| Main Authors: | Gu, Junwen, Wu, Zhiheng, Si, Pengxuan, Qiu, Shuang, Zhang, Zhentao, Feng, Yukai, Sun, Luoyang, Luo, Laien, Yu, Lianyi, Wang, Jian, Wu, Zhengxing |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.07869 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
M$^{2}$GRPO: Mamba-based Multi-Agent Group Relative Policy Optimization for Biomimetic Underwater Robots Pursuit
by: Feng, Yukai, et al.
Published: (2026)
by: Feng, Yukai, et al.
Published: (2026)
PriorVLA: Prior-Preserving Adaptation for Vision-Language-Action Models
by: Guo, Xinyu, et al.
Published: (2026)
by: Guo, Xinyu, et al.
Published: (2026)
SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology
by: Wu, Dongli, et al.
Published: (2025)
by: Wu, Dongli, et al.
Published: (2025)
ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning
by: Yang, Yandan, et al.
Published: (2026)
by: Yang, Yandan, et al.
Published: (2026)
LaST$_{0}$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model
by: Liu, Zhuoyang, et al.
Published: (2026)
by: Liu, Zhuoyang, et al.
Published: (2026)
ALAM: Algebraically Consistent Latent Action Model for Vision-Language-Action Models
by: Tang, Zuojin, et al.
Published: (2026)
by: Tang, Zuojin, et al.
Published: (2026)
Continually Evolving Skill Knowledge in Vision Language Action Model
by: Wu, Yuxuan, et al.
Published: (2025)
by: Wu, Yuxuan, et al.
Published: (2025)
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution
by: Cai, Rui, et al.
Published: (2026)
by: Cai, Rui, et al.
Published: (2026)
Absence of charged pion condensation in a magnetic field with parallel rotation
by: Bai, Puyuan, et al.
Published: (2025)
by: Bai, Puyuan, et al.
Published: (2025)
Letter to “Etiologies and clinical characteristics of primary amenorrhea: A study from a quaternary care hospital in southern Thailand”
by: Lianyi Bao, et al.
Published: (2025)
by: Lianyi Bao, et al.
Published: (2025)
UnderwaterVLA: Dual-brain Vision-Language-Action architecture for Autonomous Underwater Navigation
by: Wang, Zhangyuan, et al.
Published: (2025)
by: Wang, Zhangyuan, et al.
Published: (2025)
Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation
by: Xiao, Junjin, et al.
Published: (2026)
by: Xiao, Junjin, et al.
Published: (2026)
Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
by: Wu, Ziheng, et al.
Published: (2025)
by: Wu, Ziheng, et al.
Published: (2025)
Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models
by: Luo, Yulin, et al.
Published: (2026)
by: Luo, Yulin, et al.
Published: (2026)
End-To-End Underwater Video Enhancement: Dataset and Model
by: Du, Dazhao, et al.
Published: (2024)
by: Du, Dazhao, et al.
Published: (2024)
Nested-TNT: Hierarchical Vision Transformers with Multi-Scale Feature Processing
by: Liu, Yuang, et al.
Published: (2024)
by: Liu, Yuang, et al.
Published: (2024)
Global well-posedness of the defocusing, cubic nonlinear wave equation outside of the ball with radial data
by: Xu, Guixiang, et al.
Published: (2024)
by: Xu, Guixiang, et al.
Published: (2024)
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024)
by: Wen, Junjie, et al.
Published: (2024)
See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation
by: Dai, Tingjun, et al.
Published: (2026)
by: Dai, Tingjun, et al.
Published: (2026)
Uncertainty Aware Mapping for Vision-Based Underwater Robots
by: Bhowmik, Abhimanyu, et al.
Published: (2025)
by: Bhowmik, Abhimanyu, et al.
Published: (2025)
$τ_0$-WM: A Unified Video-Action World Model for Robotic Manipulation
by: Zhou, Pengfei, et al.
Published: (2026)
by: Zhou, Pengfei, et al.
Published: (2026)
Diver-Robot Communication Dataset for Underwater Hand Gesture Recognition
by: Kvasić, Igor, et al.
Published: (2025)
by: Kvasić, Igor, et al.
Published: (2025)
Vision-Language-Action in Robotics: A Survey of Datasets, Benchmarks, and Data Engines
by: Wang, Ziyao, et al.
Published: (2026)
by: Wang, Ziyao, et al.
Published: (2026)
Accuracy, Efficiency, and Patient‐ and Surgeon‐Reported Outcomes of Static Versus Robotic Computer‐Assisted Implant Surgery: A Randomized Clinical Trial
by: Zhilin Luo, et al.
Published: (2026)
by: Zhilin Luo, et al.
Published: (2026)
ROSA: Harnessing Robot States for Vision-Language and Action Alignment
by: Wen, Yuqing, et al.
Published: (2025)
by: Wen, Yuqing, et al.
Published: (2025)
Spatiotemporal Calibration of Doppler Velocity Logs for Underwater Robots
by: Zhao, Hongxu, et al.
Published: (2025)
by: Zhao, Hongxu, et al.
Published: (2025)
HiViS: Hiding Visual Tokens from the Drafter for Speculative Decoding in Vision-Language Models
by: Xie, Zhinan, et al.
Published: (2025)
by: Xie, Zhinan, et al.
Published: (2025)
Unified Description for Reentrance and Tc Enhancement in Ferromagnetic Superconductors
by: Wang, Xusheng, et al.
Published: (2025)
by: Wang, Xusheng, et al.
Published: (2025)
Zooplankton diel vertical migration enhances carbon export via distinct mechanisms in a warming North Pacific
by: Chenying Guo, et al.
Published: (2026)
by: Chenying Guo, et al.
Published: (2026)
Underwater Robotic Simulators Review for Autonomous System Development
by: Aldhaheri, Sara, et al.
Published: (2025)
by: Aldhaheri, Sara, et al.
Published: (2025)
A Sonar-Visual Dataset for Cross-Modal Underwater Robot Perception
by: Chen, Weitung, et al.
Published: (2026)
by: Chen, Weitung, et al.
Published: (2026)
Will technological innovation uncertainty affect the distribution of benefits from low‐carbon innovation activities in industrial clusters?—A study based on gray Shapley values
by: Xi Tang, et al.
Published: (2024)
by: Xi Tang, et al.
Published: (2024)
Propagating Unsafe Actions in LLM Controlled Multi-Robot Collaboration via Single Robot Compromise
by: Huang, Zhen, et al.
Published: (2026)
by: Huang, Zhen, et al.
Published: (2026)
CKMImageNet: A Comprehensive Dataset to Enable Channel Knowledge Map Construction via Computer Vision
by: Wu, Di, et al.
Published: (2024)
by: Wu, Di, et al.
Published: (2024)
Monochromatic polynomial sumset structures on $\mathbb{N}$
by: Lian, Zhengxing, et al.
Published: (2024)
by: Lian, Zhengxing, et al.
Published: (2024)
Assessing Vision-Language Models for Perception in Autonomous Underwater Robotic Software
by: Yousaf, Muhammad, et al.
Published: (2026)
by: Yousaf, Muhammad, et al.
Published: (2026)
Performance Prediction and Optimization of Single‐Piston Free Piston Expander‐Linear Generator Based on Machine Learning and Genetic Algorithm
by: Jian Li, et al.
Published: (2024)
by: Jian Li, et al.
Published: (2024)
Conversational Disease Diagnosis via External Planner-Controlled Large Language Models
by: Sun, Zhoujian, et al.
Published: (2024)
by: Sun, Zhoujian, et al.
Published: (2024)
ActionFlow: A Pipelined Action Acceleration for Vision Language Models on Edge
by: Dai, Yuntao, et al.
Published: (2025)
by: Dai, Yuntao, et al.
Published: (2025)
VLA-Pro: Cross-Task Procedural Memory Transfer for Vision-Language-Action Models
by: Si, Shengyu, et al.
Published: (2026)
by: Si, Shengyu, et al.
Published: (2026)
Similar Items
-
M$^{2}$GRPO: Mamba-based Multi-Agent Group Relative Policy Optimization for Biomimetic Underwater Robots Pursuit
by: Feng, Yukai, et al.
Published: (2026) -
PriorVLA: Prior-Preserving Adaptation for Vision-Language-Action Models
by: Guo, Xinyu, et al.
Published: (2026) -
SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology
by: Wu, Dongli, et al.
Published: (2025) -
ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning
by: Yang, Yandan, et al.
Published: (2026) -
LaST$_{0}$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model
by: Liu, Zhuoyang, et al.
Published: (2026)