:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Minqing, Xiang, Yujiao, Liang, Zihan, Huang, Jiajie, Wang, Jingqi, Xu, Zhi, Tan, Feiyang, Zhou, Hangning, Yang, Mu, Che, Gong
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.10426
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ChainFlow-VLA: Causal Flow Planning with Vision-Language Models
by: Wang, Xiyang, et al.
Published: (2026)

DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving
by: jia, Feiyang, et al.
Published: (2026)

Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving
by: Liu, Qiqi, et al.
Published: (2026)

DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving
by: Shang, Shuyao, et al.
Published: (2026)

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation
by: Zhou, Xin, et al.
Published: (2026)

ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving
by: Sheng, Zihao, et al.
Published: (2026)

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
by: Li, Yingyan, et al.
Published: (2025)

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
by: Zhou, Xin, et al.
Published: (2025)

MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving
by: Wang, Xiyang, et al.
Published: (2024)

SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving
by: You, Zihan, et al.
Published: (2026)

FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving
by: Zeng, Shuang, et al.
Published: (2025)

The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey
by: Tu, Sifan, et al.
Published: (2025)

DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
by: Hu, Xiaotao, et al.
Published: (2024)

DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation
by: Shen, Mu-Yi, et al.
Published: (2024)

SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving
by: Zhang, Diankun, et al.
Published: (2024)

MindVLA-U1: VLA Beats VA with Unified Streaming Architecture for Autonomous Driving
by: Huang, Yuzhou, et al.
Published: (2026)

BEVWorld: A Multimodal World Simulator for Autonomous Driving via Scene-Level BEV Latents
by: Zhang, Yumeng, et al.
Published: (2024)

LaV-CoT: Language-Aware Visual CoT with Multi-Aspect Reward Optimization for Real-World Multilingual VQA
by: Huang, Jing, et al.
Published: (2025)

Vehicle Dynamics Embedded World Models for Autonomous Driving
by: Li, Huiqian, et al.
Published: (2025)

LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving
by: Luo, Yuechen, et al.
Published: (2026)

Doe-1: Closed-Loop Autonomous Driving with Large World Model
by: Zheng, Wenzhao, et al.
Published: (2024)

Bridging Scene Generation and Planning: Driving with World Model via Unifying Vision and Motion Representation
by: Gui, Xingtai, et al.
Published: (2026)

WorldVLA: Towards Autoregressive Action World Model
by: Cen, Jun, et al.
Published: (2025)

Age-Energy Analysis in Multi-Source Systems with Wake-up Control and Packet Management
by: Gong, Jie, et al.
Published: (2025)

VLA-R: Vision-Language Action Retrieval toward Open-World End-to-End Autonomous Driving
by: Seong, Hyunki, et al.
Published: (2025)

CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
by: Arai, Hidehisa, et al.
Published: (2024)

Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles
by: Liao, Haicheng, et al.
Published: (2025)

Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
by: Li, Qifeng, et al.
Published: (2024)

CoC-VLA: Delving into Adversarial Domain Transfer for Explainable Autonomous Driving via Chain-of-Causality Visual-Language-Action Model
by: Zhang, Dapeng, et al.
Published: (2025)

DriveFuture: Future-Aware Latent World Models for Autonomous Driving
by: Hong, Yufeng, et al.
Published: (2026)

Epona: Autoregressive Diffusion World Model for Autonomous Driving
by: Zhang, Kaiwen, et al.
Published: (2025)

CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving
by: Zheng, Xiaoji, et al.
Published: (2025)

World-Env: Leveraging World Model as a Virtual Environment for VLA Post-Training
by: Xiao, Junjin, et al.
Published: (2025)

DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving
by: Min, Chen, et al.
Published: (2024)

SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving
by: Li, Jingyu, et al.
Published: (2026)

Enhancing End-to-End Autonomous Driving with Latent World Model
by: Li, Yingyan, et al.
Published: (2024)

TrajDiff: End-to-end Autonomous Driving without Perception Annotation
by: Gui, Xingtai, et al.
Published: (2025)

VLA-REPLICA: A Low-Cost, Reproducible Benchmark for Real-World Evaluation of Vision-Language-Action Models
by: Huang, Alex S., et al.
Published: (2026)

AdaThinkDrive: Adaptive Thinking via Reinforcement Learning for Autonomous Driving
by: Luo, Yuechen, et al.
Published: (2025)

Xiaomi Auto World Model: A Joint World Model Integrating Reconstruction and Generation for Autonomous Driving
by: Zhou, Lijun, et al.
Published: (2026)