Saved in:
| Main Authors: | Liu, Shuo, Shi, Lei, Liu, Haowen, Xu, Jing, Gao, Yufei, Shi, Yucheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.04475 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving
by: Liu, Pei, et al.
Published: (2025)
by: Liu, Pei, et al.
Published: (2025)
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving
by: Shi, Chen, et al.
Published: (2025)
by: Shi, Chen, et al.
Published: (2025)
DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving
by: Min, Chen, et al.
Published: (2024)
by: Min, Chen, et al.
Published: (2024)
A Neuro-Symbolic Framework Combining Inductive and Deductive Reasoning for Autonomous Driving Planning
by: Wei, Hongyan, et al.
Published: (2026)
by: Wei, Hongyan, et al.
Published: (2026)
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving
by: Pei, Muleilan, et al.
Published: (2025)
by: Pei, Muleilan, et al.
Published: (2025)
Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?
by: Bai, Yifan, et al.
Published: (2024)
by: Bai, Yifan, et al.
Published: (2024)
SceneRAG: Scene-level Retrieval-Augmented Generation for Video Understanding
by: Zeng, Nianbo, et al.
Published: (2025)
by: Zeng, Nianbo, et al.
Published: (2025)
MGNet: Monocular Geometric Scene Understanding for Autonomous Driving
by: Schön, Markus, et al.
Published: (2022)
by: Schön, Markus, et al.
Published: (2022)
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
by: Sun, Wenchao, et al.
Published: (2024)
by: Sun, Wenchao, et al.
Published: (2024)
NeuroCLIP: Neuromorphic Data Understanding by CLIP and SNN
by: Guo, Yufei, et al.
Published: (2023)
by: Guo, Yufei, et al.
Published: (2023)
Towards Neuro-Symbolic Video Understanding
by: Choi, Minkyu, et al.
Published: (2024)
by: Choi, Minkyu, et al.
Published: (2024)
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving
by: Kong, Lingdong, et al.
Published: (2024)
by: Kong, Lingdong, et al.
Published: (2024)
MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving
by: Zhang, Zhiyuan, et al.
Published: (2025)
by: Zhang, Zhiyuan, et al.
Published: (2025)
DriveWAM: Video Generative Priors Enable Scalable World-Action Modeling for Autonomous Driving
by: Shi, Chen, et al.
Published: (2026)
by: Shi, Chen, et al.
Published: (2026)
Real2Sim: A Physics-driven and Editable Gaussian Splatting Framework for Autonomous Driving Scenes
by: Huang, Kaicong, et al.
Published: (2026)
by: Huang, Kaicong, et al.
Published: (2026)
SparseWorld: Enhancing End-to-End Autonomous Driving via World Models with Sparse Scene Representation
by: Wang, Ruoyu, et al.
Published: (2026)
by: Wang, Ruoyu, et al.
Published: (2026)
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents
by: Wei, Yuxi, et al.
Published: (2024)
by: Wei, Yuxi, et al.
Published: (2024)
FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering
by: Zhou, Jingqiu, et al.
Published: (2025)
by: Zhou, Jingqiu, et al.
Published: (2025)
RSUD20K: A Dataset for Road Scene Understanding In Autonomous Driving
by: Zunair, Hasib, et al.
Published: (2024)
by: Zunair, Hasib, et al.
Published: (2024)
PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving
by: Shi, Yining, et al.
Published: (2024)
by: Shi, Yining, et al.
Published: (2024)
Symbolic Graph Inference for Compound Scene Understanding
by: Aryan, FNU, et al.
Published: (2024)
by: Aryan, FNU, et al.
Published: (2024)
Neuro-Symbolic Manipulation Understanding with Enriched Semantic Event Chains
by: Ziaeetabar, Fatemeh
Published: (2026)
by: Ziaeetabar, Fatemeh
Published: (2026)
RALAD: Bridging the Real-to-Sim Domain Gap in Autonomous Driving with Retrieval-Augmented Learning
by: Zuo, Jiacheng, et al.
Published: (2025)
by: Zuo, Jiacheng, et al.
Published: (2025)
PADriver: Towards Personalized Autonomous Driving
by: Kou, Genghua, et al.
Published: (2025)
by: Kou, Genghua, et al.
Published: (2025)
CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving
by: Xiang, Qi, et al.
Published: (2025)
by: Xiang, Qi, et al.
Published: (2025)
TopoPoint: Enhance Topology Reasoning via Endpoint Detection in Autonomous Driving
by: Fu, Yanping, et al.
Published: (2025)
by: Fu, Yanping, et al.
Published: (2025)
Nighttime Autonomous Driving Scene Reconstruction with Physically-Based Gaussian Splatting
by: Kim, Tae-Kyeong, et al.
Published: (2026)
by: Kim, Tae-Kyeong, et al.
Published: (2026)
NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving
by: Ljungbergh, William, et al.
Published: (2024)
by: Ljungbergh, William, et al.
Published: (2024)
T2SG: Traffic Topology Scene Graph for Topology Reasoning in Autonomous Driving
by: Lv, Changsheng, et al.
Published: (2024)
by: Lv, Changsheng, et al.
Published: (2024)
ReSim: Reliable World Simulation for Autonomous Driving
by: Yang, Jiazhi, et al.
Published: (2025)
by: Yang, Jiazhi, et al.
Published: (2025)
PreGSU-A Generalized Traffic Scene Understanding Model for Autonomous Driving based on Pre-trained Graph Attention Network
by: Wang, Yuning, et al.
Published: (2024)
by: Wang, Yuning, et al.
Published: (2024)
AD-EE: Early Exiting for Fast and Reliable Vision-Language Models in Autonomous Driving
by: Huang, Lianming, et al.
Published: (2025)
by: Huang, Lianming, et al.
Published: (2025)
VSA4VQA: Scaling a Vector Symbolic Architecture to Visual Question Answering on Natural Images
by: Penzkofer, Anna, et al.
Published: (2024)
by: Penzkofer, Anna, et al.
Published: (2024)
OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving
by: Zheng, Lianqing, et al.
Published: (2024)
by: Zheng, Lianqing, et al.
Published: (2024)
Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving
by: Gao, Haoxiang, et al.
Published: (2025)
by: Gao, Haoxiang, et al.
Published: (2025)
DriveXQA: Cross-modal Visual Question Answering for Adverse Driving Scene Understanding
by: Tao, Mingzhe, et al.
Published: (2026)
by: Tao, Mingzhe, et al.
Published: (2026)
InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving
by: Song, Ruiqi, et al.
Published: (2025)
by: Song, Ruiqi, et al.
Published: (2025)
FSF-Net: Enhance 4D Occupancy Forecasting with Coarse BEV Scene Flow for Autonomous Driving
by: Guo, Erxin, et al.
Published: (2024)
by: Guo, Erxin, et al.
Published: (2024)
MindDrive: An All-in-One Framework Bridging World Models and Vision-Language Model for End-to-End Autonomous Driving
by: Sun, Bin, et al.
Published: (2025)
by: Sun, Bin, et al.
Published: (2025)
Unifying Language-Action Understanding and Generation for Autonomous Driving
by: Wang, Xinyang, et al.
Published: (2026)
by: Wang, Xinyang, et al.
Published: (2026)
Similar Items
-
OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving
by: Liu, Pei, et al.
Published: (2025) -
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving
by: Shi, Chen, et al.
Published: (2025) -
DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving
by: Min, Chen, et al.
Published: (2024) -
A Neuro-Symbolic Framework Combining Inductive and Deductive Reasoning for Autonomous Driving Planning
by: Wei, Hongyan, et al.
Published: (2026) -
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving
by: Pei, Muleilan, et al.
Published: (2025)