Saved in:
| Main Authors: | Ang, Sining, Yang, Yuguang, Dang, Chenxu, Chen, Canyu, Chi, Cheng, Liu, Haiyan, Mao, Xuanyao, Bao, Jason, Xuliang, Sun, Bingchuan, Wang, Yan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.10719 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CLOVER: Closed-Loop Value Estimation and Ranking for End-to-End Autonomous Driving Planning
by: Ang, Sining, et al.
Published: (2026)
by: Ang, Sining, et al.
Published: (2026)
Devil is in Narrow Policy: Unleashing Exploration in Driving VLA Models
by: Chen, Canyu, et al.
Published: (2026)
by: Chen, Canyu, et al.
Published: (2026)
SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving
by: You, Zihan, et al.
Published: (2026)
by: You, Zihan, et al.
Published: (2026)
SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries
by: Dang, Chenxu, et al.
Published: (2025)
by: Dang, Chenxu, et al.
Published: (2025)
VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
by: Xu, Yi, et al.
Published: (2024)
by: Xu, Yi, et al.
Published: (2024)
PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent Predictive Representation
by: Fan, Zehua, et al.
Published: (2026)
by: Fan, Zehua, et al.
Published: (2026)
DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving
by: Dang, Chenxu, et al.
Published: (2026)
by: Dang, Chenxu, et al.
Published: (2026)
Enhancing End-to-End Autonomous Driving with Risk Semantic Distillaion from VLM
by: Qin, Jack, et al.
Published: (2025)
by: Qin, Jack, et al.
Published: (2025)
AR Forcing: Towards Long-Horizon Robot Navigation World Model
by: Yang, Yifei, et al.
Published: (2026)
by: Yang, Yifei, et al.
Published: (2026)
SimpleVSF: VLM-Scoring Fusion for Trajectory Prediction of End-to-End Autonomous Driving
by: Zheng, Peiru, et al.
Published: (2025)
by: Zheng, Peiru, et al.
Published: (2025)
InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving
by: Song, Ruiqi, et al.
Published: (2025)
by: Song, Ruiqi, et al.
Published: (2025)
Senna-2: Aligning VLM and End-to-End Driving Policy for Consistent Decision Making and Planning
by: Song, Yuehao, et al.
Published: (2026)
by: Song, Yuehao, et al.
Published: (2026)
V2X-VLM: End-to-End V2X Cooperative Autonomous Driving Through Large Vision-Language Models
by: You, Junwei, et al.
Published: (2024)
by: You, Junwei, et al.
Published: (2024)
RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy
by: Zhao, Zhonghan, et al.
Published: (2025)
by: Zhao, Zhonghan, et al.
Published: (2025)
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
by: Sun, Wenchao, et al.
Published: (2024)
by: Sun, Wenchao, et al.
Published: (2024)
GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving
by: Wan, Chi, et al.
Published: (2025)
by: Wan, Chi, et al.
Published: (2025)
VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion
by: Liu, Pei, et al.
Published: (2025)
by: Liu, Pei, et al.
Published: (2025)
AppleVLM: End-to-end Autonomous Driving with Advanced Perception and Planning-Enhanced Vision-Language Models
by: Han, Yuxuan, et al.
Published: (2026)
by: Han, Yuxuan, et al.
Published: (2026)
FROST-Drive: Scalable and Efficient End-to-End Driving with a Frozen Vision Encoder
by: Dong, Zeyu, et al.
Published: (2026)
by: Dong, Zeyu, et al.
Published: (2026)
DualAD: Disentangling the Dynamic and Static World for End-to-End Driving
by: Doll, Simon, et al.
Published: (2024)
by: Doll, Simon, et al.
Published: (2024)
Exploring Contextual Representation and Multi-Modality for End-to-End Autonomous Driving
by: Azam, Shoaib, et al.
Published: (2022)
by: Azam, Shoaib, et al.
Published: (2022)
Navigation-Guided Sparse Scene Representation for End-to-End Autonomous Driving
by: Li, Peidong, et al.
Published: (2024)
by: Li, Peidong, et al.
Published: (2024)
VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception
by: Chang, Fuhao, et al.
Published: (2025)
by: Chang, Fuhao, et al.
Published: (2025)
SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving
by: Chen, Xuesong, et al.
Published: (2025)
by: Chen, Xuesong, et al.
Published: (2025)
When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation
by: Min, Anna, et al.
Published: (2025)
by: Min, Anna, et al.
Published: (2025)
VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers
by: Guo, Ziang, et al.
Published: (2025)
by: Guo, Ziang, et al.
Published: (2025)
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
by: Jiang, Bo, et al.
Published: (2024)
by: Jiang, Bo, et al.
Published: (2024)
LMAD: Integrated End-to-End Vision-Language Model for Explainable Autonomous Driving
by: Song, Nan, et al.
Published: (2025)
by: Song, Nan, et al.
Published: (2025)
GMF-Drive: Gated Mamba Fusion with Spatial-Aware BEV Representation for End-to-End Autonomous Driving
by: Wang, Jian, et al.
Published: (2025)
by: Wang, Jian, et al.
Published: (2025)
VECTOR-Drive: Tightly Coupled Vision-Language and Trajectory Expert Routing for End-to-End Autonomous Driving
by: Zhao, Rui, et al.
Published: (2026)
by: Zhao, Rui, et al.
Published: (2026)
DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving
by: Yang, Zhenjie, et al.
Published: (2025)
by: Yang, Zhenjie, et al.
Published: (2025)
PIE: Perception and Interaction Enhanced End-to-End Motion Planning for Autonomous Driving
by: Yuan, Chengran, et al.
Published: (2025)
by: Yuan, Chengran, et al.
Published: (2025)
QASMTrans: An End-to-End QASM Compilation Framework with Pulse Generation for Near-Term Quantum Devices
by: Hoyt, Aaron, et al.
Published: (2026)
by: Hoyt, Aaron, et al.
Published: (2026)
AutoMoT: A Unified Vision-Language-Action Model with Asynchronous Mixture-of-Transformers for End-to-End Autonomous Driving
by: Huang, Wenhui, et al.
Published: (2026)
by: Huang, Wenhui, et al.
Published: (2026)
NudgeVAD: Language-Nudged End-to-End Driving via FiLM Residuals
by: Yang, Chieh-Chi, et al.
Published: (2026)
by: Yang, Chieh-Chi, et al.
Published: (2026)
Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking
by: Zhang, Wei, et al.
Published: (2024)
by: Zhang, Wei, et al.
Published: (2024)
P-Hologen: An End-to-End Generative Framework for Phase-Only Holograms
by: Park, JooHyun, et al.
Published: (2024)
by: Park, JooHyun, et al.
Published: (2024)
P‐Hologen: An End‐to‐End Generative Framework for Phase‐Only Holograms
by: JooHyun Park, et al.
Published: (2024)
by: JooHyun Park, et al.
Published: (2024)
Collision-Aware Vision-Language Learning for End-to-End Driving with Multimodal Infraction Datasets
by: Koran, Alex, et al.
Published: (2026)
by: Koran, Alex, et al.
Published: (2026)
WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model
by: Zhang, Songyan, et al.
Published: (2024)
by: Zhang, Songyan, et al.
Published: (2024)
Similar Items
-
CLOVER: Closed-Loop Value Estimation and Ranking for End-to-End Autonomous Driving Planning
by: Ang, Sining, et al.
Published: (2026) -
Devil is in Narrow Policy: Unleashing Exploration in Driving VLA Models
by: Chen, Canyu, et al.
Published: (2026) -
SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving
by: You, Zihan, et al.
Published: (2026) -
SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries
by: Dang, Chenxu, et al.
Published: (2025) -
VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
by: Xu, Yi, et al.
Published: (2024)