Saved in:
| Main Authors: | Jin, Piaopiao, Wang, Qi, Sun, Guokang, Cai, Ziwen, He, Pinjia, You, Yangwei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.13774 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FPC-VLA: A Vision-Language-Action Framework with a Supervisor for Failure Prediction and Correction
by: Yang, Yifan, et al.
Published: (2025)
by: Yang, Yifan, et al.
Published: (2025)
DyDexHandover: Human-like Bimanual Dynamic Dexterous Handover using RGB-only Perception
by: Zhou, Haoran, et al.
Published: (2025)
by: Zhou, Haoran, et al.
Published: (2025)
VLA-GSE: Boosting Parameter-Efficient Fine-Tuning in VLA with Generalized and Specialized Experts
by: Jiang, Yuhua, et al.
Published: (2026)
by: Jiang, Yuhua, et al.
Published: (2026)
TacRefineNet: Tactile-Only Grasp Refinement Between Arbitrary In-Hand Object Poses
by: Wang, Shuaijun, et al.
Published: (2025)
by: Wang, Shuaijun, et al.
Published: (2025)
What to Ignore, What to React: Visually Robust RL Fine-Tuning of VLA Models
by: Peng, Yuanfang, et al.
Published: (2026)
by: Peng, Yuanfang, et al.
Published: (2026)
ReFineVLA: Reasoning-Aware Teacher-Guided Transfer Fine-Tuning
by: Van Vo, Tuan, et al.
Published: (2025)
by: Van Vo, Tuan, et al.
Published: (2025)
World-VLA-Loop: Closed-Loop Learning of Video World Model and VLA Policy
by: Liu, Xiaokang, et al.
Published: (2026)
by: Liu, Xiaokang, et al.
Published: (2026)
STARE-VLA: Progressive Stage-Aware Reinforcement for Fine-Tuning Vision-Language-Action Models
by: Xu, Feng, et al.
Published: (2025)
by: Xu, Feng, et al.
Published: (2025)
Towards Long-Lived Robots: Continual Learning VLA Models via Reinforcement Fine-Tuning
by: Liu, Yuan, et al.
Published: (2026)
by: Liu, Yuan, et al.
Published: (2026)
BSCAL: Gesture Recognition Network Based on Dual‐Stream Information Fusion of EMG and IMU Signals
by: Yindi Wang, et al.
Published: (2025)
by: Yindi Wang, et al.
Published: (2025)
RationalVLA: A Rational Vision-Language-Action Model with Dual System
by: Song, Wenxuan, et al.
Published: (2025)
by: Song, Wenxuan, et al.
Published: (2025)
Open-Loop Planning, Closed-Loop Verification: Speculative Verification for VLA
by: Wang, Zihua, et al.
Published: (2026)
by: Wang, Zihua, et al.
Published: (2026)
BioProVLA-Agent: An Affordable, Protocol-Driven, Vision-Enhanced VLA-Enabled Embodied Multi-Agent System with Closed-Loop-Capable Reasoning for Biological Laboratory Manipulation
by: Du, Zhaohui, et al.
Published: (2026)
by: Du, Zhaohui, et al.
Published: (2026)
CRAFT: Adapting VLA Models to Contact-rich Manipulation via Force-aware Curriculum Fine-tuning
by: Zhang, Yike, et al.
Published: (2026)
by: Zhang, Yike, et al.
Published: (2026)
CollabVLA: Self-Reflective Vision-Language-Action Model Dreaming Together with Human
by: Sun, Nan, et al.
Published: (2025)
by: Sun, Nan, et al.
Published: (2025)
VLA-ATTC: Adaptive Test-Time Compute for VLA Models with Relative Action Critic Model
by: Li, Wenhao, et al.
Published: (2026)
by: Li, Wenhao, et al.
Published: (2026)
State Estimation Transformers for Agile Legged Locomotion
by: Yu, Chen, et al.
Published: (2024)
by: Yu, Chen, et al.
Published: (2024)
Towards Deploying VLA without Fine-Tuning: Plug-and-Play Inference-Time VLA Policy Steering via Embodied Evolutionary Diffusion
by: Li, Zhuo, et al.
Published: (2025)
by: Li, Zhuo, et al.
Published: (2025)
TIDAL: Temporally Interleaved Diffusion and Action Loop for High-Frequency VLA Control
by: Sun, Yuteng, et al.
Published: (2026)
by: Sun, Yuteng, et al.
Published: (2026)
Sentinel-VLA: A Metacognitive VLA Model with Active Status Monitoring for Dynamic Reasoning and Error Recovery
by: Li, Wenhao, et al.
Published: (2026)
by: Li, Wenhao, et al.
Published: (2026)
Decomposed Object Manipulation via Dual-Actor Policy
by: Fan, Bin, et al.
Published: (2025)
by: Fan, Bin, et al.
Published: (2025)
EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy
by: Ng, Chi Kit, et al.
Published: (2025)
by: Ng, Chi Kit, et al.
Published: (2025)
Towards Accessible Physical AI: LoRA-Based Fine-Tuning of VLA Models for Real-World Robot Control
by: Omaisan, Abdullah Yahya Abdullah, et al.
Published: (2025)
by: Omaisan, Abdullah Yahya Abdullah, et al.
Published: (2025)
ReFineVLA: Multimodal Reasoning-Aware Generalist Robotic Policies via Teacher-Guided Fine-Tuning
by: Van Vo, Tuan, et al.
Published: (2026)
by: Van Vo, Tuan, et al.
Published: (2026)
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model
by: Sun, Jingwen, et al.
Published: (2026)
by: Sun, Jingwen, et al.
Published: (2026)
Libra-VLA: Achieving Learning Equilibrium via Asynchronous Coarse-to-Fine Dual-System
by: Wei, Yifei, et al.
Published: (2026)
by: Wei, Yifei, et al.
Published: (2026)
FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies
by: Hu, Xintong, et al.
Published: (2026)
by: Hu, Xintong, et al.
Published: (2026)
Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends
by: Xiang, Tian-Yu, et al.
Published: (2025)
by: Xiang, Tian-Yu, et al.
Published: (2025)
PlannerRFT: Reinforcing Diffusion Planners through Closed-Loop and Sample-Efficient Fine-Tuning
by: Li, Hongchen, et al.
Published: (2026)
by: Li, Hongchen, et al.
Published: (2026)
Observe Then Act: Asynchronous Active Vision-Action Model for Robotic Manipulation
by: Wang, Guokang, et al.
Published: (2024)
by: Wang, Guokang, et al.
Published: (2024)
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
by: Zhang, Wenyao, et al.
Published: (2025)
by: Zhang, Wenyao, et al.
Published: (2025)
LoopVLA: Learning Sufficiency in Recurrent Refinement for Vision-Language-Action Models
by: Shen, Boyang, et al.
Published: (2026)
by: Shen, Boyang, et al.
Published: (2026)
Critic in the Loop: A Tri-System VLA Framework for Robust Long-Horizon Manipulation
by: Yi, Pengfei, et al.
Published: (2026)
by: Yi, Pengfei, et al.
Published: (2026)
PosA-VLA: Enhancing Action Generation via Pose-Conditioned Anchor Attention
by: Li, Ziwen, et al.
Published: (2025)
by: Li, Ziwen, et al.
Published: (2025)
VLA Model-Expert Collaboration for Bi-directional Manipulation Learning
by: Xiang, Tian-Yu, et al.
Published: (2025)
by: Xiang, Tian-Yu, et al.
Published: (2025)
GazeVLA: Learning Human Intention for Robotic Manipulation
by: Li, Chengyang, et al.
Published: (2026)
by: Li, Chengyang, et al.
Published: (2026)
KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition
by: Han, Gaoge, et al.
Published: (2026)
by: Han, Gaoge, et al.
Published: (2026)
Can Aerial VLA Models Cooperate? Evaluating Closed-Loop Air-Ground Coordination with CARLA-Air
by: Zeng, Tianle, et al.
Published: (2026)
by: Zeng, Tianle, et al.
Published: (2026)
A Pragmatic VLA Foundation Model
by: Wu, Wei, et al.
Published: (2026)
by: Wu, Wei, et al.
Published: (2026)
TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking
by: Liu, Jiahang, et al.
Published: (2025)
by: Liu, Jiahang, et al.
Published: (2025)
Similar Items
-
FPC-VLA: A Vision-Language-Action Framework with a Supervisor for Failure Prediction and Correction
by: Yang, Yifan, et al.
Published: (2025) -
DyDexHandover: Human-like Bimanual Dynamic Dexterous Handover using RGB-only Perception
by: Zhou, Haoran, et al.
Published: (2025) -
VLA-GSE: Boosting Parameter-Efficient Fine-Tuning in VLA with Generalized and Specialized Experts
by: Jiang, Yuhua, et al.
Published: (2026) -
TacRefineNet: Tactile-Only Grasp Refinement Between Arbitrary In-Hand Object Poses
by: Wang, Shuaijun, et al.
Published: (2025) -
What to Ignore, What to React: Visually Robust RL Fine-Tuning of VLA Models
by: Peng, Yuanfang, et al.
Published: (2026)