Saved in:
| Main Authors: | Li, Wenhao, Su, Xiu, Niu, Dan, Cao, Yichao, Xu, Hongyan, Qu, Zhe, Fan, Lei, You, Shan, Xu, Chang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.01191 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VLA-ATTC: Adaptive Test-Time Compute for VLA Models with Relative Action Critic Model
by: Li, Wenhao, et al.
Published: (2026)
by: Li, Wenhao, et al.
Published: (2026)
Decoupled Video Generation with Chain of Training-free Diffusion Model Experts
by: Li, Wenhao, et al.
Published: (2024)
by: Li, Wenhao, et al.
Published: (2024)
Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning
by: Peng, Zhenghao "Mark", et al.
Published: (2025)
by: Peng, Zhenghao "Mark", et al.
Published: (2025)
Adaptive Training Meets Progressive Scaling: Elevating Efficiency in Diffusion Models
by: Li, Wenhao, et al.
Published: (2023)
by: Li, Wenhao, et al.
Published: (2023)
JEPA-VLA: Video Predictive Embedding is Needed for VLA Models
by: Miao, Shangchen, et al.
Published: (2026)
by: Miao, Shangchen, et al.
Published: (2026)
BlockVLA: Accelerating Autoregressive VLA via Block Diffusion Finetuning
by: Wang, Ruiheng, et al.
Published: (2026)
by: Wang, Ruiheng, et al.
Published: (2026)
Environmental Monitoring Requirements for the ngVLA
by: Sridharan, T. K., et al.
Published: (2025)
by: Sridharan, T. K., et al.
Published: (2025)
BioProVLA-Agent: An Affordable, Protocol-Driven, Vision-Enhanced VLA-Enabled Embodied Multi-Agent System with Closed-Loop-Capable Reasoning for Biological Laboratory Manipulation
by: Du, Zhaohui, et al.
Published: (2026)
by: Du, Zhaohui, et al.
Published: (2026)
TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking
by: Liu, Jiahang, et al.
Published: (2025)
by: Liu, Jiahang, et al.
Published: (2025)
TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches
by: Huang, Zhengxian, et al.
Published: (2026)
by: Huang, Zhengxian, et al.
Published: (2026)
DroneVLA: VLA based Aerial Manipulation
by: Mehboob, Fawad, et al.
Published: (2026)
by: Mehboob, Fawad, et al.
Published: (2026)
Open-Loop Planning, Closed-Loop Verification: Speculative Verification for VLA
by: Wang, Zihua, et al.
Published: (2026)
by: Wang, Zihua, et al.
Published: (2026)
Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models
by: Bai, Shuanghao, et al.
Published: (2026)
by: Bai, Shuanghao, et al.
Published: (2026)
MWA and VLA Observations of Diffuse Radio Lobes in M 87
by: Wu, Linhui, et al.
Published: (2025)
by: Wu, Linhui, et al.
Published: (2025)
Sci-VLA: Agentic VLA Inference Plugin for Long-Horizon Tasks in Scientific Experiments
by: Pang, Yiwen, et al.
Published: (2026)
by: Pang, Yiwen, et al.
Published: (2026)
GazeVLA: Learning Human Intention for Robotic Manipulation
by: Li, Chengyang, et al.
Published: (2026)
by: Li, Chengyang, et al.
Published: (2026)
VLA-RAIL: A Real-Time Asynchronous Inference Linker for VLA Models and Robots
by: Zhao, Yongsheng, et al.
Published: (2025)
by: Zhao, Yongsheng, et al.
Published: (2025)
RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models
by: Liufu, Weijia, et al.
Published: (2026)
by: Liufu, Weijia, et al.
Published: (2026)
VLA+VLBA to ngVLA Transition Option Concepts
by: Corsi, Alessandra, et al.
Published: (2025)
by: Corsi, Alessandra, et al.
Published: (2025)
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
by: Li, Haozhan, et al.
Published: (2025)
by: Li, Haozhan, et al.
Published: (2025)
How Fast Can I Run My VLA? Demystifying VLA Inference Performance with VLA-Perf
by: Jiang, Wenqi, et al.
Published: (2026)
by: Jiang, Wenqi, et al.
Published: (2026)
DeCoP: Enhancing Self-Supervised Time Series Representation with Dependency Controlled Pre-training
by: Wu, Yuemin, et al.
Published: (2025)
by: Wu, Yuemin, et al.
Published: (2025)
On-the-Fly VLA Adaptation via Test-Time Reinforcement Learning
by: Liu, Changyu, et al.
Published: (2026)
by: Liu, Changyu, et al.
Published: (2026)
Any3D-VLA: Enhancing VLA Robustness via Diverse Point Clouds
by: Fan, Xianzhe, et al.
Published: (2026)
by: Fan, Xianzhe, et al.
Published: (2026)
SimVLA: A Simple VLA Baseline for Robotic Manipulation
by: Luo, Yuankai, et al.
Published: (2026)
by: Luo, Yuankai, et al.
Published: (2026)
AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge
by: Hirose, Noriaki, et al.
Published: (2026)
by: Hirose, Noriaki, et al.
Published: (2026)
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators
by: Shen, Yichao, et al.
Published: (2025)
by: Shen, Yichao, et al.
Published: (2025)
MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization
by: Zhao, Yunlong, et al.
Published: (2024)
by: Zhao, Yunlong, et al.
Published: (2024)
StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
by: Deng, Shengliang, et al.
Published: (2025)
by: Deng, Shengliang, et al.
Published: (2025)
RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training
by: Sun, Haoran, et al.
Published: (2026)
by: Sun, Haoran, et al.
Published: (2026)
VLA-Cache: Efficient Vision-Language-Action Manipulation via Adaptive Token Caching
by: Xu, Siyu, et al.
Published: (2025)
by: Xu, Siyu, et al.
Published: (2025)
OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning
by: Lin, Fanqi, et al.
Published: (2025)
by: Lin, Fanqi, et al.
Published: (2025)
Identify, Isolate, and Purge: Mitigating Hallucinations in LVLMs via Self-Evolving Distillation
by: Li, Wenhao, et al.
Published: (2025)
by: Li, Wenhao, et al.
Published: (2025)
DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models
by: Yin, Cheng, et al.
Published: (2025)
by: Yin, Cheng, et al.
Published: (2025)
SP-VLA: A Joint Model Scheduling and Token Pruning Approach for VLA Model Acceleration
by: Li, Ye, et al.
Published: (2025)
by: Li, Ye, et al.
Published: (2025)
AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention
by: Xiao, Lei, et al.
Published: (2025)
by: Xiao, Lei, et al.
Published: (2025)
KV-Efficient VLA: A Method to Speed up Vision Language Models with RNN-Gated Chunked KV Cache
by: Xu, Wanshun, et al.
Published: (2025)
by: Xu, Wanshun, et al.
Published: (2025)
EchoVLA: Synergistic Declarative Memory for VLA-Driven Mobile Manipulation
by: Lin, Min, et al.
Published: (2025)
by: Lin, Min, et al.
Published: (2025)
EvoDriveVLA: Evolving Driving VLA Models via Collaborative Perception-Planning Distillation
by: Cao, Jiajun, et al.
Published: (2026)
by: Cao, Jiajun, et al.
Published: (2026)
Think Proprioceptively: Embodied Visual Reasoning for VLA Manipulation
by: Wang, Fangyuan, et al.
Published: (2026)
by: Wang, Fangyuan, et al.
Published: (2026)
Similar Items
-
VLA-ATTC: Adaptive Test-Time Compute for VLA Models with Relative Action Critic Model
by: Li, Wenhao, et al.
Published: (2026) -
Decoupled Video Generation with Chain of Training-free Diffusion Model Experts
by: Li, Wenhao, et al.
Published: (2024) -
Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning
by: Peng, Zhenghao "Mark", et al.
Published: (2025) -
Adaptive Training Meets Progressive Scaling: Elevating Efficiency in Diffusion Models
by: Li, Wenhao, et al.
Published: (2023) -
JEPA-VLA: Video Predictive Embedding is Needed for VLA Models
by: Miao, Shangchen, et al.
Published: (2026)