Saved in:
| Main Authors: | Li, Baolu, Yu, Hongkai, Sun, Huiming, Ma, Jin, Lin, Yuewei, Ma, Lu, Du, Yonghua |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.00836 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Domain Adaptation based Object Detection for Autonomous Driving in Foggy and Rainy Weather
by: Li, Jinlong, et al.
Published: (2023)
by: Li, Jinlong, et al.
Published: (2023)
EVD4UAV: An Altitude-Sensitive Benchmark to Evade Vehicle Detection in UAV
by: Sun, Huiming, et al.
Published: (2024)
by: Sun, Huiming, et al.
Published: (2024)
Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception from Independent Private Sources
by: Li, Jinlong, et al.
Published: (2024)
by: Li, Jinlong, et al.
Published: (2024)
S2R-ViT for Multi-Agent Cooperative Perception: Bridging the Gap from Simulation to Reality
by: Li, Jinlong, et al.
Published: (2023)
by: Li, Jinlong, et al.
Published: (2023)
Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving
by: Li, Jinlong, et al.
Published: (2024)
by: Li, Jinlong, et al.
Published: (2024)
DarkDriving: A Real-World Day and Night Aligned Dataset for Autonomous Driving in the Dark Environment
by: Wang, Wuqi, et al.
Published: (2026)
by: Wang, Wuqi, et al.
Published: (2026)
V2X-DG: Domain Generalization for Vehicle-to-Everything Cooperative Perception
by: Li, Baolu, et al.
Published: (2025)
by: Li, Baolu, et al.
Published: (2025)
CoMamba: Real-time Cooperative Perception Unlocked with State Space Models
by: Li, Jinlong, et al.
Published: (2024)
by: Li, Jinlong, et al.
Published: (2024)
VehicleGAN: Pair-flexible Pose Guided Image Synthesis for Vehicle Re-identification
by: Li, Baolu, et al.
Published: (2023)
by: Li, Baolu, et al.
Published: (2023)
Unsupervised Multi-agent and Single-agent Perception from Cooperative Views
by: Yang, Haochen, et al.
Published: (2026)
by: Yang, Haochen, et al.
Published: (2026)
AdvGPS: Adversarial GPS for Multi-Agent Perception Attack
by: Li, Jinlong, et al.
Published: (2024)
by: Li, Jinlong, et al.
Published: (2024)
V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions
by: Li, Baolu, et al.
Published: (2024)
by: Li, Baolu, et al.
Published: (2024)
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
by: Wang, Qinghe, et al.
Published: (2024)
by: Wang, Qinghe, et al.
Published: (2024)
GenCellAgent: Generalizable, Training-Free Cellular Image Segmentation via Large Language Model Agents
by: Yu, Xi, et al.
Published: (2025)
by: Yu, Xi, et al.
Published: (2025)
A Low-Rank Method for Vision Language Model Hallucination Mitigation in Autonomous Driving
by: Long, Keke, et al.
Published: (2025)
by: Long, Keke, et al.
Published: (2025)
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
by: Yang, Xindi, et al.
Published: (2025)
by: Yang, Xindi, et al.
Published: (2025)
Advancing Autonomous Driving Perception: Analysis of Sensor Fusion and Computer Vision Techniques
by: Bharti, Urvishkumar, et al.
Published: (2024)
by: Bharti, Urvishkumar, et al.
Published: (2024)
Video2LoRA: Unified Semantic-Controlled Video Generation via Per-Reference-Video LoRA
by: Wu, Zexi, et al.
Published: (2026)
by: Wu, Zexi, et al.
Published: (2026)
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
by: Yang, Yu, et al.
Published: (2024)
by: Yang, Yu, et al.
Published: (2024)
VisionPulse: Dynamic Visual Sparsity for Efficient Multimodal Reasoning
by: Xu, Hengbo, et al.
Published: (2026)
by: Xu, Hengbo, et al.
Published: (2026)
Leveraging Human-Machine Interactions for Computer Vision Dataset Quality Enhancement
by: Anzaku, Esla Timothy, et al.
Published: (2024)
by: Anzaku, Esla Timothy, et al.
Published: (2024)
More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models
by: Lin, Hongkai, et al.
Published: (2025)
by: Lin, Hongkai, et al.
Published: (2025)
DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction
by: Du, Xiaobiao, et al.
Published: (2024)
by: Du, Xiaobiao, et al.
Published: (2024)
SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving
by: Chen, Xuesong, et al.
Published: (2025)
by: Chen, Xuesong, et al.
Published: (2025)
DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation
by: Wu, Yifan, et al.
Published: (2024)
by: Wu, Yifan, et al.
Published: (2024)
Research on the Application of Computer Vision Based on Deep Learning in Autonomous Driving Technology
by: Zhang, Jingyu, et al.
Published: (2024)
by: Zhang, Jingyu, et al.
Published: (2024)
EventLens: Leveraging Event-Aware Pretraining and Cross-modal Linking Enhances Visual Commonsense Reasoning
by: Ma, Mingjie, et al.
Published: (2024)
by: Ma, Mingjie, et al.
Published: (2024)
Recent Advances of Continual Learning in Computer Vision: An Overview
by: Qu, Haoxuan, et al.
Published: (2021)
by: Qu, Haoxuan, et al.
Published: (2021)
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
by: Li, Baolu, et al.
Published: (2025)
by: Li, Baolu, et al.
Published: (2025)
HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets and CLIP Models
by: Wei, Zhixiang, et al.
Published: (2025)
by: Wei, Zhixiang, et al.
Published: (2025)
CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation
by: Li, Jiahao, et al.
Published: (2025)
by: Li, Jiahao, et al.
Published: (2025)
Efficient Training of Large Vision Models via Advanced Automated Progressive Learning
by: Li, Changlin, et al.
Published: (2024)
by: Li, Changlin, et al.
Published: (2024)
Autonomous Computer Vision Development with Agentic AI
by: Kim, Jin, et al.
Published: (2025)
by: Kim, Jin, et al.
Published: (2025)
RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought
by: Qiao, Junbo, et al.
Published: (2025)
by: Qiao, Junbo, et al.
Published: (2025)
Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation
by: Li, Han, et al.
Published: (2024)
by: Li, Han, et al.
Published: (2024)
Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement
by: Li, Wenxuan, et al.
Published: (2024)
by: Li, Wenxuan, et al.
Published: (2024)
VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation
by: Lian, Ruyi, et al.
Published: (2024)
by: Lian, Ruyi, et al.
Published: (2024)
Hypergraph Convolutional Network based Weakly Supervised Point Cloud Semantic Segmentation with Scene-Level Annotations
by: Lu, Zhuheng, et al.
Published: (2022)
by: Lu, Zhuheng, et al.
Published: (2022)
Fine-grained Metrics for Point Cloud Semantic Segmentation
by: Lu, Zhuheng, et al.
Published: (2024)
by: Lu, Zhuheng, et al.
Published: (2024)
Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference
by: Ahmed, Sk Miraj, et al.
Published: (2026)
by: Ahmed, Sk Miraj, et al.
Published: (2026)
Similar Items
-
Domain Adaptation based Object Detection for Autonomous Driving in Foggy and Rainy Weather
by: Li, Jinlong, et al.
Published: (2023) -
EVD4UAV: An Altitude-Sensitive Benchmark to Evade Vehicle Detection in UAV
by: Sun, Huiming, et al.
Published: (2024) -
Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception from Independent Private Sources
by: Li, Jinlong, et al.
Published: (2024) -
S2R-ViT for Multi-Agent Cooperative Perception: Bridging the Gap from Simulation to Reality
by: Li, Jinlong, et al.
Published: (2023) -
Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving
by: Li, Jinlong, et al.
Published: (2024)