:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Xiahan, Chen, Mingjian, Tang, Sanli, Niu, Yi, Zhu, Jiang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2404.05280
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection
by: Yang, Lei, et al.
Published: (2024)

TEACH: Text Encoding as Curriculum Hints for Scene Text Recognition
by: Yang, Xiahan, et al.
Published: (2025)

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
by: Wang, Wenjie, et al.
Published: (2024)

HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer
by: Zhang, Zhang, et al.
Published: (2025)

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception
by: Zhu, Xiaosu, et al.
Published: (2024)

HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
by: Liu, Pei, et al.
Published: (2024)

Weighted Bayesian Gaussian Mixture Model for Roadside LiDAR Object Detection
by: Zhang, Tianya, et al.
Published: (2022)

2.5D Object Detection for Intelligent Roadside Infrastructure
by: Polley, Nikolai, et al.
Published: (2025)

Roadside Monocular 3D Detection Prompted by 2D Detection
by: Ma, Yechi, et al.
Published: (2024)

Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking
by: Zhang, Haonan, et al.
Published: (2025)

HazyDet: Open-Source Benchmark for Drone-View Object Detection with Depth-Cues in Hazy Scenes
by: Feng, Changfeng, et al.
Published: (2024)

Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track
by: Gao, Mingqi, et al.
Published: (2026)

TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder
by: Meng, Qinghao, et al.
Published: (2025)

RoadSceneVQA: Benchmarking Visual Question Answering in Roadside Perception Systems for Intelligent Transportation System
by: Guan, Runwei, et al.
Published: (2025)

Learnability-Driven Submodular Optimization for Active Roadside 3D Detection
by: Mao, Ruiyu, et al.
Published: (2026)

Learning Temporal Cues by Predicting Objects Move for Multi-camera 3D Object Detection
by: Moon, Seokha, et al.
Published: (2024)

IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain
by: Wang, Zhe, et al.
Published: (2025)

WARM-3D: A Weakly-Supervised Sim2Real Domain Adaptation Framework for Roadside Monocular 3D Object Detection
by: Zhou, Xingcheng, et al.
Published: (2024)

CoBEV: Elevating Roadside 3D Object Detection with Depth and Height Complementarity
by: Shi, Hao, et al.
Published: (2023)

STSeg-Complex Video Object Segmentation: The 1st Solution for 4th PVUW MOSE Challenge
by: Song, Kehuan, et al.
Published: (2025)

FedRSU: Federated Learning for Scene Flow Estimation on Roadside Units
by: Fang, Shaoheng, et al.
Published: (2024)

IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection
by: Yin, Junbo, et al.
Published: (2024)

Syn3DTxt: Embedding 3D Cues for Scene Text Generation
by: Hsiung, Li-Syun, et al.
Published: (2025)

1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation
by: Miao, Deshui, et al.
Published: (2024)

2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
by: Xu, Zhensong, et al.
Published: (2024)

3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling
by: Jiang, Chaokang, et al.
Published: (2024)

Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
by: Zhang, Runmin, et al.
Published: (2025)

Fusion4CA: Boosting 3D Object Detection via Comprehensive Image Exploitation
by: Luo, Kang, et al.
Published: (2026)

RoLID-11K: A Dashcam Dataset for Small-Object Roadside Litter Detection
by: Wu, Tao, et al.
Published: (2026)

3rd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
by: Liu, Xinyu, et al.
Published: (2024)

FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
by: Wang, Mengjiao, et al.
Published: (2025)

MASSeg : 2nd Technical Report for 4th PVUW MOSE Track
by: Cao, Xuqiang, et al.
Published: (2025)

Density-based Object Detection in Crowded Scenes
by: Zhao, Chenyang, et al.
Published: (2025)

Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations
by: Wang, Meng, et al.
Published: (2025)

Fusion Meets Diverse Conditions: A High-diversity Benchmark and Baseline for UAV-based Multimodal Object Detection with Condition Cues
by: Chen, Chen, et al.
Published: (2025)

RoamScene3D: Immersive Text-to-3D Scene Generation via Adaptive Object-aware Roaming
by: Chu, Jisheng, et al.
Published: (2026)

Zoo3D: Zero-Shot 3D Object Detection at Scene Level
by: Lemeshko, Andrey, et al.
Published: (2025)

Multimodal HD Mapping for Intersections by Intelligent Roadside Units
by: Chen, Zhongzhang, et al.
Published: (2025)

SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection
by: Cheng, Ching-Hung, et al.
Published: (2025)

Descrip3D: Enhancing Large Language Model-based 3D Scene Understanding with Object-Level Text Descriptions
by: Xue, Jintang, et al.
Published: (2025)