:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Zeyu, Song, Nan, Li, Wei, Zhu, Xiatian, Zhang, Li, Torr, Philip H. S.
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2408.05075
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DeMo++: Motion Decoupling for Autonomous Driving
by: Zhang, Bozhou, et al.
Published: (2025)

RealEngine: Simulating Autonomous Driving in Realistic Context
by: Jiang, Junzhe, et al.
Published: (2025)

LMAD: Integrated End-to-End Vision-Language Model for Explainable Autonomous Driving
by: Song, Nan, et al.
Published: (2025)

Motion Forecasting in Continuous Driving
by: Song, Nan, et al.
Published: (2024)

See Tomorrow, Act Today: Foresight-Driven Autonomous Driving
by: Zhang, Bozhou, et al.
Published: (2026)

Driving View Synthesis on Free-form Trajectories with Generative Prior
by: Yang, Zeyu, et al.
Published: (2024)

4D Gaussian Splatting: Modeling Dynamic Scenes with Native 4D Primitives
by: Yang, Zeyu, et al.
Published: (2024)

ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving
by: Li, Jingyu, et al.
Published: (2025)

Future-Aware End-to-End Driving: Bidirectional Modeling of Trajectory Planning and Scene Evolution
by: Zhang, Bozhou, et al.
Published: (2025)

Efficient4D: Fast Dynamic 3D Object Generation from a Single-view Video
by: Pan, Zijie, et al.
Published: (2024)

SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving
by: Li, Jingyu, et al.
Published: (2026)

Tetrahedron Splatting for 3D Generation
by: Gu, Chun, et al.
Published: (2024)

FlowAD: Ego-Scene Interactive Modeling for Autonomous Driving
by: Guo, Mingzhe, et al.
Published: (2026)

Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving
by: Liu, Qiqi, et al.
Published: (2026)

VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
by: Li, Runjia, et al.
Published: (2025)

Perception in Plan: Coupled Perception and Planning for End-to-End Autonomous Driving
by: Zhang, Bozhou, et al.
Published: (2025)

Multi-human Interactive Talking Dataset
by: Zhu, Zeyu, et al.
Published: (2025)

Vision Transformers: From Semantic Segmentation to Dense Prediction
by: Zhang, Li, et al.
Published: (2022)

GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving
by: Zhang, Yunpeng, et al.
Published: (2024)

Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
by: Ding, Xinpeng, et al.
Published: (2024)

DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving
by: Wang, Sheng, et al.
Published: (2024)

Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning
by: Zhang, Bozhou, et al.
Published: (2025)

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
by: Cui, Erfei, et al.
Published: (2023)

Autonomous Character-Scene Interaction Synthesis from Text Instruction
by: Jiang, Nan, et al.
Published: (2024)

TensoFlow: Tensorial Flow-based Sampler for Inverse Rendering
by: Gu, Chun, et al.
Published: (2025)

Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving
by: Lou, Yang, et al.
Published: (2023)

Deep Leakage with Generative Flow Matching Denoiser
by: Baglin, Isaac, et al.
Published: (2026)

Post-interactive Multimodal Trajectory Prediction for Autonomous Driving
by: Huang, Ziyi, et al.
Published: (2025)

Towards Online Multi-Modal Social Interaction Understanding
by: Li, Xinpeng, et al.
Published: (2025)

ProIn: Learning to Predict Trajectory Based on Progressive Interactions for Autonomous Driving
by: Dong, Yinke, et al.
Published: (2024)

Unsupervised Audio-Visual Segmentation with Modality Alignment
by: Bhosale, Swapnil, et al.
Published: (2024)

123D: Unifying Multi-Modal Autonomous Driving Data at Scale
by: Dauner, Daniel, et al.
Published: (2026)

RealDriveSim: A Realistic Multi-Modal Multi-Task Synthetic Dataset for Autonomous Driving
by: Jadon, Arpit, et al.
Published: (2025)

CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction
by: Zhao, Liang, et al.
Published: (2024)

Neural Radiance Field in Autonomous Driving: A Survey
by: He, Lei, et al.
Published: (2024)

MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving
by: Duan, Yiqun, et al.
Published: (2024)

VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive Interaction
by: Li, Shiying, et al.
Published: (2025)

Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
by: Zhao, Zongchuang, et al.
Published: (2025)

M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving
by: Xu, Dongyang, et al.
Published: (2024)

Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving
by: Sani, Depanshu, et al.
Published: (2024)