:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Wenhao, Su, Xiu, Niu, Dan, Cao, Yichao, Xu, Hongyan, Qu, Zhe, Fan, Lei, You, Shan, Xu, Chang
Format:	Preprint
Published:	2026
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2605.01191
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

VLA-ATTC: Adaptive Test-Time Compute for VLA Models with Relative Action Critic Model
by: Li, Wenhao, et al.
Published: (2026)

Decoupled Video Generation with Chain of Training-free Diffusion Model Experts
by: Li, Wenhao, et al.
Published: (2024)

Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning
by: Peng, Zhenghao "Mark", et al.
Published: (2025)

Adaptive Training Meets Progressive Scaling: Elevating Efficiency in Diffusion Models
by: Li, Wenhao, et al.
Published: (2023)

JEPA-VLA: Video Predictive Embedding is Needed for VLA Models
by: Miao, Shangchen, et al.
Published: (2026)

BlockVLA: Accelerating Autoregressive VLA via Block Diffusion Finetuning
by: Wang, Ruiheng, et al.
Published: (2026)

Environmental Monitoring Requirements for the ngVLA
by: Sridharan, T. K., et al.
Published: (2025)

BioProVLA-Agent: An Affordable, Protocol-Driven, Vision-Enhanced VLA-Enabled Embodied Multi-Agent System with Closed-Loop-Capable Reasoning for Biological Laboratory Manipulation
by: Du, Zhaohui, et al.
Published: (2026)

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking
by: Liu, Jiahang, et al.
Published: (2025)

TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches
by: Huang, Zhengxian, et al.
Published: (2026)

DroneVLA: VLA based Aerial Manipulation
by: Mehboob, Fawad, et al.
Published: (2026)

Open-Loop Planning, Closed-Loop Verification: Speculative Verification for VLA
by: Wang, Zihua, et al.
Published: (2026)

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models
by: Bai, Shuanghao, et al.
Published: (2026)

MWA and VLA Observations of Diffuse Radio Lobes in M 87
by: Wu, Linhui, et al.
Published: (2025)

Sci-VLA: Agentic VLA Inference Plugin for Long-Horizon Tasks in Scientific Experiments
by: Pang, Yiwen, et al.
Published: (2026)

GazeVLA: Learning Human Intention for Robotic Manipulation
by: Li, Chengyang, et al.
Published: (2026)

VLA-RAIL: A Real-Time Asynchronous Inference Linker for VLA Models and Robots
by: Zhao, Yongsheng, et al.
Published: (2025)

RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models
by: Liufu, Weijia, et al.
Published: (2026)

VLA+VLBA to ngVLA Transition Option Concepts
by: Corsi, Alessandra, et al.
Published: (2025)

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
by: Li, Haozhan, et al.
Published: (2025)

How Fast Can I Run My VLA? Demystifying VLA Inference Performance with VLA-Perf
by: Jiang, Wenqi, et al.
Published: (2026)

DeCoP: Enhancing Self-Supervised Time Series Representation with Dependency Controlled Pre-training
by: Wu, Yuemin, et al.
Published: (2025)

On-the-Fly VLA Adaptation via Test-Time Reinforcement Learning
by: Liu, Changyu, et al.
Published: (2026)

Any3D-VLA: Enhancing VLA Robustness via Diverse Point Clouds
by: Fan, Xianzhe, et al.
Published: (2026)

SimVLA: A Simple VLA Baseline for Robotic Manipulation
by: Luo, Yuankai, et al.
Published: (2026)

AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge
by: Hirose, Noriaki, et al.
Published: (2026)

VideoVLA: Video Generators Can Be Generalizable Robot Manipulators
by: Shen, Yichao, et al.
Published: (2025)

MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization
by: Zhao, Yunlong, et al.
Published: (2024)

StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
by: Deng, Shengliang, et al.
Published: (2025)

RL-VLA$^3$: A Flexible and Asynchronous Reinforcement Learning Framework for VLA Training
by: Sun, Haoran, et al.
Published: (2026)

VLA-Cache: Efficient Vision-Language-Action Manipulation via Adaptive Token Caching
by: Xu, Siyu, et al.
Published: (2025)

OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning
by: Lin, Fanqi, et al.
Published: (2025)

Identify, Isolate, and Purge: Mitigating Hallucinations in LVLMs via Self-Evolving Distillation
by: Li, Wenhao, et al.
Published: (2025)

DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models
by: Yin, Cheng, et al.
Published: (2025)

SP-VLA: A Joint Model Scheduling and Token Pruning Approach for VLA Model Acceleration
by: Li, Ye, et al.
Published: (2025)

AVA-VLA: Improving Vision-Language-Action models with Active Visual Attention
by: Xiao, Lei, et al.
Published: (2025)

KV-Efficient VLA: A Method to Speed up Vision Language Models with RNN-Gated Chunked KV Cache
by: Xu, Wanshun, et al.
Published: (2025)

EchoVLA: Synergistic Declarative Memory for VLA-Driven Mobile Manipulation
by: Lin, Min, et al.
Published: (2025)

EvoDriveVLA: Evolving Driving VLA Models via Collaborative Perception-Planning Distillation
by: Cao, Jiajun, et al.
Published: (2026)

Think Proprioceptively: Embodied Visual Reasoning for VLA Manipulation
by: Wang, Fangyuan, et al.
Published: (2026)