Saved in:
| Main Authors: | Yang, Zeyu, Song, Nan, Li, Wei, Zhu, Xiatian, Zhang, Li, Torr, Philip H. S. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.05075 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DeMo++: Motion Decoupling for Autonomous Driving
by: Zhang, Bozhou, et al.
Published: (2025)
by: Zhang, Bozhou, et al.
Published: (2025)
RealEngine: Simulating Autonomous Driving in Realistic Context
by: Jiang, Junzhe, et al.
Published: (2025)
by: Jiang, Junzhe, et al.
Published: (2025)
LMAD: Integrated End-to-End Vision-Language Model for Explainable Autonomous Driving
by: Song, Nan, et al.
Published: (2025)
by: Song, Nan, et al.
Published: (2025)
Motion Forecasting in Continuous Driving
by: Song, Nan, et al.
Published: (2024)
by: Song, Nan, et al.
Published: (2024)
See Tomorrow, Act Today: Foresight-Driven Autonomous Driving
by: Zhang, Bozhou, et al.
Published: (2026)
by: Zhang, Bozhou, et al.
Published: (2026)
Driving View Synthesis on Free-form Trajectories with Generative Prior
by: Yang, Zeyu, et al.
Published: (2024)
by: Yang, Zeyu, et al.
Published: (2024)
4D Gaussian Splatting: Modeling Dynamic Scenes with Native 4D Primitives
by: Yang, Zeyu, et al.
Published: (2024)
by: Yang, Zeyu, et al.
Published: (2024)
ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving
by: Li, Jingyu, et al.
Published: (2025)
by: Li, Jingyu, et al.
Published: (2025)
Future-Aware End-to-End Driving: Bidirectional Modeling of Trajectory Planning and Scene Evolution
by: Zhang, Bozhou, et al.
Published: (2025)
by: Zhang, Bozhou, et al.
Published: (2025)
Efficient4D: Fast Dynamic 3D Object Generation from a Single-view Video
by: Pan, Zijie, et al.
Published: (2024)
by: Pan, Zijie, et al.
Published: (2024)
SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving
by: Li, Jingyu, et al.
Published: (2026)
by: Li, Jingyu, et al.
Published: (2026)
Tetrahedron Splatting for 3D Generation
by: Gu, Chun, et al.
Published: (2024)
by: Gu, Chun, et al.
Published: (2024)
FlowAD: Ego-Scene Interactive Modeling for Autonomous Driving
by: Guo, Mingzhe, et al.
Published: (2026)
by: Guo, Mingzhe, et al.
Published: (2026)
Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving
by: Liu, Qiqi, et al.
Published: (2026)
by: Liu, Qiqi, et al.
Published: (2026)
VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
by: Li, Runjia, et al.
Published: (2025)
by: Li, Runjia, et al.
Published: (2025)
Perception in Plan: Coupled Perception and Planning for End-to-End Autonomous Driving
by: Zhang, Bozhou, et al.
Published: (2025)
by: Zhang, Bozhou, et al.
Published: (2025)
Multi-human Interactive Talking Dataset
by: Zhu, Zeyu, et al.
Published: (2025)
by: Zhu, Zeyu, et al.
Published: (2025)
Vision Transformers: From Semantic Segmentation to Dense Prediction
by: Zhang, Li, et al.
Published: (2022)
by: Zhang, Li, et al.
Published: (2022)
GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving
by: Zhang, Yunpeng, et al.
Published: (2024)
by: Zhang, Yunpeng, et al.
Published: (2024)
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
by: Ding, Xinpeng, et al.
Published: (2024)
by: Ding, Xinpeng, et al.
Published: (2024)
DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving
by: Wang, Sheng, et al.
Published: (2024)
by: Wang, Sheng, et al.
Published: (2024)
Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning
by: Zhang, Bozhou, et al.
Published: (2025)
by: Zhang, Bozhou, et al.
Published: (2025)
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
by: Cui, Erfei, et al.
Published: (2023)
by: Cui, Erfei, et al.
Published: (2023)
Autonomous Character-Scene Interaction Synthesis from Text Instruction
by: Jiang, Nan, et al.
Published: (2024)
by: Jiang, Nan, et al.
Published: (2024)
TensoFlow: Tensorial Flow-based Sampler for Inverse Rendering
by: Gu, Chun, et al.
Published: (2025)
by: Gu, Chun, et al.
Published: (2025)
Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving
by: Lou, Yang, et al.
Published: (2023)
by: Lou, Yang, et al.
Published: (2023)
Deep Leakage with Generative Flow Matching Denoiser
by: Baglin, Isaac, et al.
Published: (2026)
by: Baglin, Isaac, et al.
Published: (2026)
Post-interactive Multimodal Trajectory Prediction for Autonomous Driving
by: Huang, Ziyi, et al.
Published: (2025)
by: Huang, Ziyi, et al.
Published: (2025)
Towards Online Multi-Modal Social Interaction Understanding
by: Li, Xinpeng, et al.
Published: (2025)
by: Li, Xinpeng, et al.
Published: (2025)
ProIn: Learning to Predict Trajectory Based on Progressive Interactions for Autonomous Driving
by: Dong, Yinke, et al.
Published: (2024)
by: Dong, Yinke, et al.
Published: (2024)
Unsupervised Audio-Visual Segmentation with Modality Alignment
by: Bhosale, Swapnil, et al.
Published: (2024)
by: Bhosale, Swapnil, et al.
Published: (2024)
123D: Unifying Multi-Modal Autonomous Driving Data at Scale
by: Dauner, Daniel, et al.
Published: (2026)
by: Dauner, Daniel, et al.
Published: (2026)
RealDriveSim: A Realistic Multi-Modal Multi-Task Synthetic Dataset for Autonomous Driving
by: Jadon, Arpit, et al.
Published: (2025)
by: Jadon, Arpit, et al.
Published: (2025)
CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction
by: Zhao, Liang, et al.
Published: (2024)
by: Zhao, Liang, et al.
Published: (2024)
Neural Radiance Field in Autonomous Driving: A Survey
by: He, Lei, et al.
Published: (2024)
by: He, Lei, et al.
Published: (2024)
MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving
by: Duan, Yiqun, et al.
Published: (2024)
by: Duan, Yiqun, et al.
Published: (2024)
VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive Interaction
by: Li, Shiying, et al.
Published: (2025)
by: Li, Shiying, et al.
Published: (2025)
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
by: Zhao, Zongchuang, et al.
Published: (2025)
by: Zhao, Zongchuang, et al.
Published: (2025)
M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving
by: Xu, Dongyang, et al.
Published: (2024)
by: Xu, Dongyang, et al.
Published: (2024)
Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving
by: Sani, Depanshu, et al.
Published: (2024)
by: Sani, Depanshu, et al.
Published: (2024)
Similar Items
-
DeMo++: Motion Decoupling for Autonomous Driving
by: Zhang, Bozhou, et al.
Published: (2025) -
RealEngine: Simulating Autonomous Driving in Realistic Context
by: Jiang, Junzhe, et al.
Published: (2025) -
LMAD: Integrated End-to-End Vision-Language Model for Explainable Autonomous Driving
by: Song, Nan, et al.
Published: (2025) -
Motion Forecasting in Continuous Driving
by: Song, Nan, et al.
Published: (2024) -
See Tomorrow, Act Today: Foresight-Driven Autonomous Driving
by: Zhang, Bozhou, et al.
Published: (2026)