Saved in:
| Main Authors: | Pang, Yiwen, Zhou, Bo, Li, Changjin, Wang, Xuanhao, Xu, Shengxiang, Wang, Deng-Bao, Zhang, Min-Ling, Di, Shimin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.09430 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks
by: Yang, Yi, et al.
Published: (2025)
by: Yang, Yi, et al.
Published: (2025)
Long-Horizon Manipulation via Trace-Conditioned VLA Planning
by: Liu, Isabella, et al.
Published: (2026)
by: Liu, Isabella, et al.
Published: (2026)
Anticipation-VLA: Solving Long-Horizon Embodied Tasks via Anticipation-based Subgoal Generation
by: Zhang, Zhilong, et al.
Published: (2026)
by: Zhang, Zhilong, et al.
Published: (2026)
GraSP-VLA: Graph-based Symbolic Action Representation for Long-Horizon Planning with VLA Policies
by: Neau, Maëlic, et al.
Published: (2025)
by: Neau, Maëlic, et al.
Published: (2025)
How Fast Can I Run My VLA? Demystifying VLA Inference Performance with VLA-Perf
by: Jiang, Wenqi, et al.
Published: (2026)
by: Jiang, Wenqi, et al.
Published: (2026)
Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation
by: Fan, Yiguo, et al.
Published: (2025)
by: Fan, Yiguo, et al.
Published: (2025)
SeqVLA: Sequential Task Execution for Long-Horizon Manipulation with Completion-Aware Vision-Language-Action Model
by: Yang, Ran, et al.
Published: (2025)
by: Yang, Ran, et al.
Published: (2025)
LongNav-R1: Horizon-Adaptive Multi-Turn RL for Long-Horizon VLA Navigation
by: Hu, Yue, et al.
Published: (2026)
by: Hu, Yue, et al.
Published: (2026)
Critic in the Loop: A Tri-System VLA Framework for Robust Long-Horizon Manipulation
by: Yi, Pengfei, et al.
Published: (2026)
by: Yi, Pengfei, et al.
Published: (2026)
BlockVLA: Accelerating Autoregressive VLA via Block Diffusion Finetuning
by: Wang, Ruiheng, et al.
Published: (2026)
by: Wang, Ruiheng, et al.
Published: (2026)
BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation
by: Hu, Yucheng, et al.
Published: (2026)
by: Hu, Yucheng, et al.
Published: (2026)
IG-RFT: An Interaction-Guided RL Framework for VLA Models in Long-Horizon Robotic Manipulation
by: Su, Zhian, et al.
Published: (2026)
by: Su, Zhian, et al.
Published: (2026)
SimVLA: A Simple VLA Baseline for Robotic Manipulation
by: Luo, Yuankai, et al.
Published: (2026)
by: Luo, Yuankai, et al.
Published: (2026)
EchoVLA: Synergistic Declarative Memory for VLA-Driven Mobile Manipulation
by: Lin, Min, et al.
Published: (2025)
by: Lin, Min, et al.
Published: (2025)
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
by: Li, Haozhan, et al.
Published: (2025)
by: Li, Haozhan, et al.
Published: (2025)
Long-Term Memory for VLA-based Agents in Open-World Task Execution
by: Huang, Xu, et al.
Published: (2026)
by: Huang, Xu, et al.
Published: (2026)
OxyGen: Unified KV Cache Management for VLA Inference under Multi-Task Parallelism
by: Li, Xiangyu, et al.
Published: (2026)
by: Li, Xiangyu, et al.
Published: (2026)
VLA-RAIL: A Real-Time Asynchronous Inference Linker for VLA Models and Robots
by: Zhao, Yongsheng, et al.
Published: (2025)
by: Zhao, Yongsheng, et al.
Published: (2025)
DroneVLA: VLA based Aerial Manipulation
by: Mehboob, Fawad, et al.
Published: (2026)
by: Mehboob, Fawad, et al.
Published: (2026)
Any3D-VLA: Enhancing VLA Robustness via Diverse Point Clouds
by: Fan, Xianzhe, et al.
Published: (2026)
by: Fan, Xianzhe, et al.
Published: (2026)
VLA-GSE: Boosting Parameter-Efficient Fine-Tuning in VLA with Generalized and Specialized Experts
by: Jiang, Yuhua, et al.
Published: (2026)
by: Jiang, Yuhua, et al.
Published: (2026)
VLAgents: A Policy Server for Efficient VLA Inference
by: Jülg, Tobias, et al.
Published: (2026)
by: Jülg, Tobias, et al.
Published: (2026)
VLA Knows Its Limits
by: Wang, Haoxuan, et al.
Published: (2026)
by: Wang, Haoxuan, et al.
Published: (2026)
Reflection-Based Task Adaptation for Self-Improving VLA
by: Li, Baicheng, et al.
Published: (2025)
by: Li, Baicheng, et al.
Published: (2025)
Fast-dVLA: Accelerating Discrete Diffusion VLA to Real-Time Performance
by: Song, Wenxuan, et al.
Published: (2026)
by: Song, Wenxuan, et al.
Published: (2026)
RoboGene: Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation
by: Zhang, Yixue, et al.
Published: (2026)
by: Zhang, Yixue, et al.
Published: (2026)
RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks
by: Li, Ruiying, et al.
Published: (2026)
by: Li, Ruiying, et al.
Published: (2026)
SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead
by: Ni, Chaojun, et al.
Published: (2025)
by: Ni, Chaojun, et al.
Published: (2025)
VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation
by: Zhao, Han, et al.
Published: (2025)
by: Zhao, Han, et al.
Published: (2025)
AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge
by: Hirose, Noriaki, et al.
Published: (2026)
by: Hirose, Noriaki, et al.
Published: (2026)
Selective Perception for Robot: Task-Aware Attention in Multimodal VLA
by: Son, Young-Chae, et al.
Published: (2026)
by: Son, Young-Chae, et al.
Published: (2026)
Goal-VLA: Image-Generative VLMs as Object-Centric World Models Empowering Zero-shot Robot Manipulation
by: Chen, Haonan, et al.
Published: (2025)
by: Chen, Haonan, et al.
Published: (2025)
Realtime-VLA FLASH: Speculative Inference Framework for Diffusion-based VLAs
by: Niu, Jiahui, et al.
Published: (2026)
by: Niu, Jiahui, et al.
Published: (2026)
JEPA-VLA: Video Predictive Embedding is Needed for VLA Models
by: Miao, Shangchen, et al.
Published: (2026)
by: Miao, Shangchen, et al.
Published: (2026)
PriorVLA: Prior-Preserving Adaptation for Vision-Language-Action Models
by: Guo, Xinyu, et al.
Published: (2026)
by: Guo, Xinyu, et al.
Published: (2026)
A Pragmatic VLA Foundation Model
by: Wu, Wei, et al.
Published: (2026)
by: Wu, Wei, et al.
Published: (2026)
Unified Noise Steering for Efficient Human-Guided VLA Adaptation
by: Lu, Junjie, et al.
Published: (2026)
by: Lu, Junjie, et al.
Published: (2026)
Beyond Task Success: Behavioral and Representational Diagnostics for WAM and VLA
by: Mai, Hung, et al.
Published: (2026)
by: Mai, Hung, et al.
Published: (2026)
ST4VLA: Spatially Guided Training for Vision-Language-Action Models
by: Ye, Jinhui, et al.
Published: (2026)
by: Ye, Jinhui, et al.
Published: (2026)
RaceVLA: VLA-based Racing Drone Navigation with Human-like Behaviour
by: Serpiva, Valerii, et al.
Published: (2025)
by: Serpiva, Valerii, et al.
Published: (2025)
Similar Items
-
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks
by: Yang, Yi, et al.
Published: (2025) -
Long-Horizon Manipulation via Trace-Conditioned VLA Planning
by: Liu, Isabella, et al.
Published: (2026) -
Anticipation-VLA: Solving Long-Horizon Embodied Tasks via Anticipation-based Subgoal Generation
by: Zhang, Zhilong, et al.
Published: (2026) -
GraSP-VLA: Graph-based Symbolic Action Representation for Long-Horizon Planning with VLA Policies
by: Neau, Maëlic, et al.
Published: (2025) -
How Fast Can I Run My VLA? Demystifying VLA Inference Performance with VLA-Perf
by: Jiang, Wenqi, et al.
Published: (2026)