Guardado en:
| Autores principales: | Yang, Zhenwei, Ai, Yibo, Zhang, Weidong |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2512.21831 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
LiDAR-based End-to-end Temporal Perception for Vehicle-Infrastructure Cooperation
por: Yang, Zhenwei, et al.
Publicado: (2024)
por: Yang, Zhenwei, et al.
Publicado: (2024)
End-to-End Autonomous Driving through V2X Cooperation
por: Yu, Haibao, et al.
Publicado: (2024)
por: Yu, Haibao, et al.
Publicado: (2024)
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection
por: Zhang, Jiaqing, et al.
Publicado: (2024)
por: Zhang, Jiaqing, et al.
Publicado: (2024)
Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception
por: Li, Xiaoyu, et al.
Publicado: (2025)
por: Li, Xiaoyu, et al.
Publicado: (2025)
Unified End-to-End V2X Cooperative Autonomous Driving
por: Li, Zhiwei, et al.
Publicado: (2024)
por: Li, Zhiwei, et al.
Publicado: (2024)
Perception in Plan: Coupled Perception and Planning for End-to-End Autonomous Driving
por: Zhang, Bozhou, et al.
Publicado: (2025)
por: Zhang, Bozhou, et al.
Publicado: (2025)
VLM-3D:End-to-End Vision-Language Models for Open-World 3D Perception
por: Chang, Fuhao, et al.
Publicado: (2025)
por: Chang, Fuhao, et al.
Publicado: (2025)
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding
por: Xiong, Haomiao, et al.
Publicado: (2025)
por: Xiong, Haomiao, et al.
Publicado: (2025)
UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving
por: Song, Ziyi, et al.
Publicado: (2025)
por: Song, Ziyi, et al.
Publicado: (2025)
VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion
por: Liu, Pei, et al.
Publicado: (2025)
por: Liu, Pei, et al.
Publicado: (2025)
Li-ViP3D++: Query-Gated Deformable Camera-LiDAR Fusion for End-to-End Perception and Trajectory Prediction
por: Halinkovic, Matej, et al.
Publicado: (2026)
por: Halinkovic, Matej, et al.
Publicado: (2026)
Decoupling Scene Perception and Ego Status: A Multi-Context Fusion Approach for Enhanced Generalization in End-to-End Autonomous Driving
por: Tang, Jiacheng, et al.
Publicado: (2025)
por: Tang, Jiacheng, et al.
Publicado: (2025)
E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models
por: Cong, Wenyan, et al.
Publicado: (2025)
por: Cong, Wenyan, et al.
Publicado: (2025)
An Effective End-to-End Solution for Multimodal Action Recognition
por: Wang, Songping, et al.
Publicado: (2025)
por: Wang, Songping, et al.
Publicado: (2025)
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting
por: Zhu, Runsong, et al.
Publicado: (2025)
por: Zhu, Runsong, et al.
Publicado: (2025)
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
por: Gong, Kehong, et al.
Publicado: (2026)
por: Gong, Kehong, et al.
Publicado: (2026)
USAD: End-to-End Human Activity Recognition via Diffusion Model with Spatiotemporal Attention
por: Xiao, Hang, et al.
Publicado: (2025)
por: Xiao, Hang, et al.
Publicado: (2025)
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection
por: Yin, Junbo, et al.
Publicado: (2024)
por: Yin, Junbo, et al.
Publicado: (2024)
RAP: 3D Rasterization Augmented End-to-End Planning
por: Feng, Lan, et al.
Publicado: (2025)
por: Feng, Lan, et al.
Publicado: (2025)
Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System
por: Liu, Genjia, et al.
Publicado: (2024)
por: Liu, Genjia, et al.
Publicado: (2024)
Drive-JEPA: Video JEPA Meets Multimodal Trajectory Distillation for End-to-End Driving
por: Wang, Linhan, et al.
Publicado: (2026)
por: Wang, Linhan, et al.
Publicado: (2026)
SS3D: End2End Self-Supervised 3D from Web Videos
por: Hariat, Marwane, et al.
Publicado: (2026)
por: Hariat, Marwane, et al.
Publicado: (2026)
E2E-GMNER: End-to-End Generative Grounded Multimodal Named Entity Recognition
por: Zhang, Meng, et al.
Publicado: (2026)
por: Zhang, Meng, et al.
Publicado: (2026)
Multimodal Action Diffusion for Robust End-to-End Autonomous Driving
por: Rodríguez-Vidal, Jorge Daniel, et al.
Publicado: (2026)
por: Rodríguez-Vidal, Jorge Daniel, et al.
Publicado: (2026)
GaussianFusion: Gaussian-Based Multi-Sensor Fusion for End-to-End Autonomous Driving
por: Liu, Shuai, et al.
Publicado: (2025)
por: Liu, Shuai, et al.
Publicado: (2025)
End-to-End Spatial-Temporal Transformer for Real-time 4D HOI Reconstruction
por: Zhang, Haoyu, et al.
Publicado: (2026)
por: Zhang, Haoyu, et al.
Publicado: (2026)
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving
por: Xing, Zebin, et al.
Publicado: (2025)
por: Xing, Zebin, et al.
Publicado: (2025)
Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition
por: Hao, Ruiyang, et al.
Publicado: (2025)
por: Hao, Ruiyang, et al.
Publicado: (2025)
Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation
por: Lei, Biwen, et al.
Publicado: (2025)
por: Lei, Biwen, et al.
Publicado: (2025)
CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception
por: Zhong, Jiaru, et al.
Publicado: (2025)
por: Zhong, Jiaru, et al.
Publicado: (2025)
Fose: Fusion of One-Step Diffusion and End-to-End Network for Pansharpening
por: Liu, Kai, et al.
Publicado: (2025)
por: Liu, Kai, et al.
Publicado: (2025)
LFP: Efficient and Accurate End-to-End Lane-Level Planning via Camera-LiDAR Fusion
por: You, Guoliang, et al.
Publicado: (2024)
por: You, Guoliang, et al.
Publicado: (2024)
Percept-WAM: Perception-Enhanced World-Awareness-Action Model for Robust End-to-End Autonomous Driving
por: Han, Jianhua, et al.
Publicado: (2025)
por: Han, Jianhua, et al.
Publicado: (2025)
End-to-End Autonomous Driving without Costly Modularization and 3D Manual Annotation
por: Guo, Mingzhe, et al.
Publicado: (2024)
por: Guo, Mingzhe, et al.
Publicado: (2024)
REMM:Rotation-Equivariant Framework for End-to-End Multimodal Image Matching
por: Nie, Han, et al.
Publicado: (2024)
por: Nie, Han, et al.
Publicado: (2024)
HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving
por: Xia, Zhongyu, et al.
Publicado: (2025)
por: Xia, Zhongyu, et al.
Publicado: (2025)
FusionTrack: End-to-End Multi-Object Tracking in Arbitrary Multi-View Environment
por: Li, Xiaohe, et al.
Publicado: (2025)
por: Li, Xiaohe, et al.
Publicado: (2025)
Spatially Visual Perception for End-to-End Robotic Learning
por: Davies, Travis, et al.
Publicado: (2024)
por: Davies, Travis, et al.
Publicado: (2024)
DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
por: Zou, Jialv, et al.
Publicado: (2025)
por: Zou, Jialv, et al.
Publicado: (2025)
Enhancing Synthetic CT from CBCT via Multimodal Fusion and End-To-End Registration
por: Tschuchnig, Maximilian, et al.
Publicado: (2025)
por: Tschuchnig, Maximilian, et al.
Publicado: (2025)
Ejemplares similares
-
LiDAR-based End-to-end Temporal Perception for Vehicle-Infrastructure Cooperation
por: Yang, Zhenwei, et al.
Publicado: (2024) -
End-to-End Autonomous Driving through V2X Cooperation
por: Yu, Haibao, et al.
Publicado: (2024) -
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection
por: Zhang, Jiaqing, et al.
Publicado: (2024) -
Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception
por: Li, Xiaoyu, et al.
Publicado: (2025) -
Unified End-to-End V2X Cooperative Autonomous Driving
por: Li, Zhiwei, et al.
Publicado: (2024)