Saved in:
| Main Authors: | Li, Ke, Zhang, Chenyu, Ding, Yuxin, Hu, Xianbiao, Qin, Ruwen |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.17101 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CrashChat: A Multimodal Large Language Model for Multitask Traffic Crash Video Analysis
by: Liang, Kaidi, et al.
Published: (2025)
by: Liang, Kaidi, et al.
Published: (2025)
HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors
by: Li, Ke, et al.
Published: (2026)
by: Li, Ke, et al.
Published: (2026)
Attention-Enhanced Co-Interactive Fusion Network (AECIF-Net) for Automated Structural Condition Assessment in Visual Inspection
by: Zhang, Chenyu, et al.
Published: (2023)
by: Zhang, Chenyu, et al.
Published: (2023)
PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation
by: Li, Xiangyu, et al.
Published: (2025)
by: Li, Xiangyu, et al.
Published: (2025)
Diverse and Tailored Image Generation for Zero-shot Multi-label Classification
by: Zhang, Kaixin, et al.
Published: (2024)
by: Zhang, Kaixin, et al.
Published: (2024)
Unity in Diversity: Multi-expert Knowledge Confrontation and Collaboration for Generalizable Vehicle Re-identification
by: Kuang, Zhenyu, et al.
Published: (2024)
by: Kuang, Zhenyu, et al.
Published: (2024)
BoxMAC -- A Boxing Dataset for Multi-label Action Classification
by: Sahoo, Shashikanta
Published: (2024)
by: Sahoo, Shashikanta
Published: (2024)
Active Learning from Scene Embeddings for End-to-End Autonomous Driving
by: Jiang, Wenhao, et al.
Published: (2025)
by: Jiang, Wenhao, et al.
Published: (2025)
A Novel Perspective for Multi-modal Multi-label Skin Lesion Classification
by: Zhang, Yuan, et al.
Published: (2024)
by: Zhang, Yuan, et al.
Published: (2024)
Towards Lifelong Scene Graph Generation with Knowledge-ware In-context Prompt Learning
by: He, Tao, et al.
Published: (2024)
by: He, Tao, et al.
Published: (2024)
UniFlow: Zero-Shot LiDAR Scene Flow for Autonomous Vehicles
by: Li, Siyi, et al.
Published: (2025)
by: Li, Siyi, et al.
Published: (2025)
HorizonForge: Driving Scene Editing with Any Trajectories and Any Vehicles
by: Wang, Yifan, et al.
Published: (2026)
by: Wang, Yifan, et al.
Published: (2026)
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking
by: Rahman, Md. Mahfuzur, et al.
Published: (2024)
by: Rahman, Md. Mahfuzur, et al.
Published: (2024)
HD$^2$-SSC: High-Dimension High-Density Semantic Scene Completion for Autonomous Driving
by: Yang, Zhiwen, et al.
Published: (2025)
by: Yang, Zhiwen, et al.
Published: (2025)
MultiEgo: A Multi-View Egocentric Video Dataset for 4D Scene Reconstruction
by: Li, Bate, et al.
Published: (2025)
by: Li, Bate, et al.
Published: (2025)
SeePerSea: Multi-modal Perception Dataset of In-water Objects for Autonomous Surface Vehicles
by: Jeong, Mingi, et al.
Published: (2024)
by: Jeong, Mingi, et al.
Published: (2024)
Medical Report Generation Is A Multi-label Classification Problem
by: Fan, Yijian, et al.
Published: (2024)
by: Fan, Yijian, et al.
Published: (2024)
Real-Time Environment Condition Classification for Autonomous Vehicles
by: Introvigne, Marco, et al.
Published: (2024)
by: Introvigne, Marco, et al.
Published: (2024)
Can We Build Scene Graphs, Not Classify Them? FlowSG: Progressive Image-Conditioned Scene Graph Generation with Flow Matching
by: Hu, Xin, et al.
Published: (2026)
by: Hu, Xin, et al.
Published: (2026)
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving
by: Shi, Chen, et al.
Published: (2025)
by: Shi, Chen, et al.
Published: (2025)
Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification
by: Huang, Gexin, et al.
Published: (2024)
by: Huang, Gexin, et al.
Published: (2024)
ReconDrive: Fast Feed-Forward 4D Gaussian Splatting for Autonomous Driving Scene Reconstruction
by: Yu, Haibao, et al.
Published: (2026)
by: Yu, Haibao, et al.
Published: (2026)
Multi-label Classification with Panoptic Context Aggregation Networks
by: Jiu, Mingyuan, et al.
Published: (2025)
by: Jiu, Mingyuan, et al.
Published: (2025)
Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation
by: Li, Muquan, et al.
Published: (2024)
by: Li, Muquan, et al.
Published: (2024)
V2VSSC: A 3D Semantic Scene Completion Benchmark for Perception with Vehicle to Vehicle Communication
by: Zhang, Yuanfang, et al.
Published: (2024)
by: Zhang, Yuanfang, et al.
Published: (2024)
Acquiring Submillimeter-Accurate Multi-Task Vision Datasets for Computer-Assisted Orthopedic Surgery
by: Most, Emma, et al.
Published: (2025)
by: Most, Emma, et al.
Published: (2025)
Dataset and Benchmark: Novel Sensors for Autonomous Vehicle Perception
by: Carmichael, Spencer, et al.
Published: (2024)
by: Carmichael, Spencer, et al.
Published: (2024)
SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation
by: Qin, Zhenyuan, et al.
Published: (2025)
by: Qin, Zhenyuan, et al.
Published: (2025)
Double Helix Diffusion for Cross-Domain Anomaly Image Generation
by: Wu, Linchun, et al.
Published: (2025)
by: Wu, Linchun, et al.
Published: (2025)
Dataset Diversity Metrics and Impact on Classification Models
by: Sourget, Théo, et al.
Published: (2026)
by: Sourget, Théo, et al.
Published: (2026)
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
by: Wu, Zehuan, et al.
Published: (2024)
by: Wu, Zehuan, et al.
Published: (2024)
Knowledge-Aware Neuron Interpretation for Scene Classification
by: Guan, Yong, et al.
Published: (2024)
by: Guan, Yong, et al.
Published: (2024)
Adaptive Dataset Quantization
by: Li, Muquan, et al.
Published: (2024)
by: Li, Muquan, et al.
Published: (2024)
Semantic Segmentation based Scene Understanding in Autonomous Vehicles
by: Rassekh, Ehsan
Published: (2025)
by: Rassekh, Ehsan
Published: (2025)
SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving
by: Li, Jingyu, et al.
Published: (2026)
by: Li, Jingyu, et al.
Published: (2026)
DriveIndia: An Object Detection Dataset for Diverse Indian Traffic Scenes
by: Kumar, Rishav, et al.
Published: (2025)
by: Kumar, Rishav, et al.
Published: (2025)
Vehicle-Scene Interaction: A Text-Driven 3D Lidar Place Recognition Method for Autonomous Driving
by: Shang, Tianyi, et al.
Published: (2025)
by: Shang, Tianyi, et al.
Published: (2025)
Beyond Fixed Thresholds and Domain-Specific Benchmarks for Explainable Multi-Task Classification in Autonomous Vehicles
by: Azad, Maryam Sadat Hosseini, et al.
Published: (2026)
by: Azad, Maryam Sadat Hosseini, et al.
Published: (2026)
GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving
by: Zhang, Yunpeng, et al.
Published: (2024)
by: Zhang, Yunpeng, et al.
Published: (2024)
Multi-Resolution Alignment for Voxel Sparsity in Camera-Based 3D Semantic Scene Completion
by: Yang, Zhiwen, et al.
Published: (2026)
by: Yang, Zhiwen, et al.
Published: (2026)
Similar Items
-
CrashChat: A Multimodal Large Language Model for Multitask Traffic Crash Video Analysis
by: Liang, Kaidi, et al.
Published: (2025) -
HMPDM: A Diffusion Model for Driving Video Prediction with Historical Motion Priors
by: Li, Ke, et al.
Published: (2026) -
Attention-Enhanced Co-Interactive Fusion Network (AECIF-Net) for Automated Structural Condition Assessment in Visual Inspection
by: Zhang, Chenyu, et al.
Published: (2023) -
PAVE: An End-to-End Dataset for Production Autonomous Vehicle Evaluation
by: Li, Xiangyu, et al.
Published: (2025) -
Diverse and Tailored Image Generation for Zero-shot Multi-label Classification
by: Zhang, Kaixin, et al.
Published: (2024)