:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Baolu, Yu, Hongkai, Sun, Huiming, Ma, Jin, Lin, Yuewei, Ma, Lu, Du, Yonghua
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2506.00836
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Domain Adaptation based Object Detection for Autonomous Driving in Foggy and Rainy Weather
by: Li, Jinlong, et al.
Published: (2023)

EVD4UAV: An Altitude-Sensitive Benchmark to Evade Vehicle Detection in UAV
by: Sun, Huiming, et al.
Published: (2024)

Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception from Independent Private Sources
by: Li, Jinlong, et al.
Published: (2024)

S2R-ViT for Multi-Agent Cooperative Perception: Bridging the Gap from Simulation to Reality
by: Li, Jinlong, et al.
Published: (2023)

Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving
by: Li, Jinlong, et al.
Published: (2024)

DarkDriving: A Real-World Day and Night Aligned Dataset for Autonomous Driving in the Dark Environment
by: Wang, Wuqi, et al.
Published: (2026)

V2X-DG: Domain Generalization for Vehicle-to-Everything Cooperative Perception
by: Li, Baolu, et al.
Published: (2025)

CoMamba: Real-time Cooperative Perception Unlocked with State Space Models
by: Li, Jinlong, et al.
Published: (2024)

VehicleGAN: Pair-flexible Pose Guided Image Synthesis for Vehicle Re-identification
by: Li, Baolu, et al.
Published: (2023)

Unsupervised Multi-agent and Single-agent Perception from Cooperative Views
by: Yang, Haochen, et al.
Published: (2026)

AdvGPS: Adversarial GPS for Multi-Agent Perception Attack
by: Li, Jinlong, et al.
Published: (2024)

V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions
by: Li, Baolu, et al.
Published: (2024)

CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
by: Wang, Qinghe, et al.
Published: (2024)

GenCellAgent: Generalizable, Training-Free Cellular Image Segmentation via Large Language Model Agents
by: Yu, Xi, et al.
Published: (2025)

A Low-Rank Method for Vision Language Model Hallucination Mitigation in Autonomous Driving
by: Long, Keke, et al.
Published: (2025)

VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
by: Yang, Xindi, et al.
Published: (2025)

Advancing Autonomous Driving Perception: Analysis of Sensor Fusion and Computer Vision Techniques
by: Bharti, Urvishkumar, et al.
Published: (2024)

Video2LoRA: Unified Semantic-Controlled Video Generation via Per-Reference-Video LoRA
by: Wu, Zexi, et al.
Published: (2026)

Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
by: Yang, Yu, et al.
Published: (2024)

VisionPulse: Dynamic Visual Sparsity for Efficient Multimodal Reasoning
by: Xu, Hengbo, et al.
Published: (2026)

Leveraging Human-Machine Interactions for Computer Vision Dataset Quality Enhancement
by: Anzaku, Esla Timothy, et al.
Published: (2024)

More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models
by: Lin, Hongkai, et al.
Published: (2025)

DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction
by: Du, Xiaobiao, et al.
Published: (2024)

SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving
by: Chen, Xuesong, et al.
Published: (2025)

DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation
by: Wu, Yifan, et al.
Published: (2024)

Research on the Application of Computer Vision Based on Deep Learning in Autonomous Driving Technology
by: Zhang, Jingyu, et al.
Published: (2024)

EventLens: Leveraging Event-Aware Pretraining and Cross-modal Linking Enhances Visual Commonsense Reasoning
by: Ma, Mingjie, et al.
Published: (2024)

Recent Advances of Continual Learning in Computer Vision: An Overview
by: Qu, Haoxuan, et al.
Published: (2021)

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
by: Li, Baolu, et al.
Published: (2025)

HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets and CLIP Models
by: Wei, Zhixiang, et al.
Published: (2025)

CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation
by: Li, Jiahao, et al.
Published: (2025)

Efficient Training of Large Vision Models via Advanced Automated Progressive Learning
by: Li, Changlin, et al.
Published: (2024)

Autonomous Computer Vision Development with Agentic AI
by: Kim, Jin, et al.
Published: (2025)

RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought
by: Qiao, Junbo, et al.
Published: (2025)

Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation
by: Li, Han, et al.
Published: (2024)

Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement
by: Li, Wenxuan, et al.
Published: (2024)

VAPO: Visibility-Aware Keypoint Localization for Efficient 6DoF Object Pose Estimation
by: Lian, Ruyi, et al.
Published: (2024)

Hypergraph Convolutional Network based Weakly Supervised Point Cloud Semantic Segmentation with Scene-Level Annotations
by: Lu, Zhuheng, et al.
Published: (2022)

Fine-grained Metrics for Point Cloud Semantic Segmentation
by: Lu, Zhuheng, et al.
Published: (2024)

Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference
by: Ahmed, Sk Miraj, et al.
Published: (2026)