Saved in:
| Main Authors: | Huang, Minqing, Xiang, Yujiao, Liang, Zihan, Huang, Jiajie, Wang, Jingqi, Xu, Zhi, Tan, Feiyang, Zhou, Hangning, Yang, Mu, Che, Gong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.10426 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ChainFlow-VLA: Causal Flow Planning with Vision-Language Models
by: Wang, Xiyang, et al.
Published: (2026)
by: Wang, Xiyang, et al.
Published: (2026)
DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving
by: jia, Feiyang, et al.
Published: (2026)
by: jia, Feiyang, et al.
Published: (2026)
Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving
by: Liu, Qiqi, et al.
Published: (2026)
by: Liu, Qiqi, et al.
Published: (2026)
DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving
by: Shang, Shuyao, et al.
Published: (2026)
by: Shang, Shuyao, et al.
Published: (2026)
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation
by: Zhou, Xin, et al.
Published: (2026)
by: Zhou, Xin, et al.
Published: (2026)
ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving
by: Sheng, Zihao, et al.
Published: (2026)
by: Sheng, Zihao, et al.
Published: (2026)
DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
by: Li, Yingyan, et al.
Published: (2025)
by: Li, Yingyan, et al.
Published: (2025)
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
by: Zhou, Xin, et al.
Published: (2025)
by: Zhou, Xin, et al.
Published: (2025)
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving
by: Wang, Xiyang, et al.
Published: (2024)
by: Wang, Xiyang, et al.
Published: (2024)
SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving
by: You, Zihan, et al.
Published: (2026)
by: You, Zihan, et al.
Published: (2026)
FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving
by: Zeng, Shuang, et al.
Published: (2025)
by: Zeng, Shuang, et al.
Published: (2025)
The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey
by: Tu, Sifan, et al.
Published: (2025)
by: Tu, Sifan, et al.
Published: (2025)
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
by: Hu, Xiaotao, et al.
Published: (2024)
by: Hu, Xiaotao, et al.
Published: (2024)
DriveEnv-NeRF: Exploration of A NeRF-Based Autonomous Driving Environment for Real-World Performance Validation
by: Shen, Mu-Yi, et al.
Published: (2024)
by: Shen, Mu-Yi, et al.
Published: (2024)
SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving
by: Zhang, Diankun, et al.
Published: (2024)
by: Zhang, Diankun, et al.
Published: (2024)
MindVLA-U1: VLA Beats VA with Unified Streaming Architecture for Autonomous Driving
by: Huang, Yuzhou, et al.
Published: (2026)
by: Huang, Yuzhou, et al.
Published: (2026)
BEVWorld: A Multimodal World Simulator for Autonomous Driving via Scene-Level BEV Latents
by: Zhang, Yumeng, et al.
Published: (2024)
by: Zhang, Yumeng, et al.
Published: (2024)
LaV-CoT: Language-Aware Visual CoT with Multi-Aspect Reward Optimization for Real-World Multilingual VQA
by: Huang, Jing, et al.
Published: (2025)
by: Huang, Jing, et al.
Published: (2025)
Vehicle Dynamics Embedded World Models for Autonomous Driving
by: Li, Huiqian, et al.
Published: (2025)
by: Li, Huiqian, et al.
Published: (2025)
LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving
by: Luo, Yuechen, et al.
Published: (2026)
by: Luo, Yuechen, et al.
Published: (2026)
Doe-1: Closed-Loop Autonomous Driving with Large World Model
by: Zheng, Wenzhao, et al.
Published: (2024)
by: Zheng, Wenzhao, et al.
Published: (2024)
Bridging Scene Generation and Planning: Driving with World Model via Unifying Vision and Motion Representation
by: Gui, Xingtai, et al.
Published: (2026)
by: Gui, Xingtai, et al.
Published: (2026)
WorldVLA: Towards Autoregressive Action World Model
by: Cen, Jun, et al.
Published: (2025)
by: Cen, Jun, et al.
Published: (2025)
Age-Energy Analysis in Multi-Source Systems with Wake-up Control and Packet Management
by: Gong, Jie, et al.
Published: (2025)
by: Gong, Jie, et al.
Published: (2025)
VLA-R: Vision-Language Action Retrieval toward Open-World End-to-End Autonomous Driving
by: Seong, Hyunki, et al.
Published: (2025)
by: Seong, Hyunki, et al.
Published: (2025)
CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving
by: Arai, Hidehisa, et al.
Published: (2024)
by: Arai, Hidehisa, et al.
Published: (2024)
Think Before You Drive: World Model-Inspired Multimodal Grounding for Autonomous Vehicles
by: Liao, Haicheng, et al.
Published: (2025)
by: Liao, Haicheng, et al.
Published: (2025)
Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)
by: Li, Qifeng, et al.
Published: (2024)
by: Li, Qifeng, et al.
Published: (2024)
CoC-VLA: Delving into Adversarial Domain Transfer for Explainable Autonomous Driving via Chain-of-Causality Visual-Language-Action Model
by: Zhang, Dapeng, et al.
Published: (2025)
by: Zhang, Dapeng, et al.
Published: (2025)
DriveFuture: Future-Aware Latent World Models for Autonomous Driving
by: Hong, Yufeng, et al.
Published: (2026)
by: Hong, Yufeng, et al.
Published: (2026)
Epona: Autoregressive Diffusion World Model for Autonomous Driving
by: Zhang, Kaiwen, et al.
Published: (2025)
by: Zhang, Kaiwen, et al.
Published: (2025)
CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving
by: Zheng, Xiaoji, et al.
Published: (2025)
by: Zheng, Xiaoji, et al.
Published: (2025)
World-Env: Leveraging World Model as a Virtual Environment for VLA Post-Training
by: Xiao, Junjin, et al.
Published: (2025)
by: Xiao, Junjin, et al.
Published: (2025)
DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving
by: Min, Chen, et al.
Published: (2024)
by: Min, Chen, et al.
Published: (2024)
SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving
by: Li, Jingyu, et al.
Published: (2026)
by: Li, Jingyu, et al.
Published: (2026)
Enhancing End-to-End Autonomous Driving with Latent World Model
by: Li, Yingyan, et al.
Published: (2024)
by: Li, Yingyan, et al.
Published: (2024)
TrajDiff: End-to-end Autonomous Driving without Perception Annotation
by: Gui, Xingtai, et al.
Published: (2025)
by: Gui, Xingtai, et al.
Published: (2025)
VLA-REPLICA: A Low-Cost, Reproducible Benchmark for Real-World Evaluation of Vision-Language-Action Models
by: Huang, Alex S., et al.
Published: (2026)
by: Huang, Alex S., et al.
Published: (2026)
AdaThinkDrive: Adaptive Thinking via Reinforcement Learning for Autonomous Driving
by: Luo, Yuechen, et al.
Published: (2025)
by: Luo, Yuechen, et al.
Published: (2025)
Xiaomi Auto World Model: A Joint World Model Integrating Reconstruction and Generation for Autonomous Driving
by: Zhou, Lijun, et al.
Published: (2026)
by: Zhou, Lijun, et al.
Published: (2026)
Similar Items
-
ChainFlow-VLA: Causal Flow Planning with Vision-Language Models
by: Wang, Xiyang, et al.
Published: (2026) -
DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving
by: jia, Feiyang, et al.
Published: (2026) -
Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving
by: Liu, Qiqi, et al.
Published: (2026) -
DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving
by: Shang, Shuyao, et al.
Published: (2026) -
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation
by: Zhou, Xin, et al.
Published: (2026)