Saved in:
| Main Authors: | Chen, Xiahan, Chen, Mingjian, Tang, Sanli, Niu, Yi, Zhu, Jiang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.05280 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection
by: Yang, Lei, et al.
Published: (2024)
by: Yang, Lei, et al.
Published: (2024)
TEACH: Text Encoding as Curriculum Hints for Scene Text Recognition
by: Yang, Xiahan, et al.
Published: (2025)
by: Yang, Xiahan, et al.
Published: (2025)
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
by: Wang, Wenjie, et al.
Published: (2024)
by: Wang, Wenjie, et al.
Published: (2024)
HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer
by: Zhang, Zhang, et al.
Published: (2025)
by: Zhang, Zhang, et al.
Published: (2025)
RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception
by: Zhu, Xiaosu, et al.
Published: (2024)
by: Zhu, Xiaosu, et al.
Published: (2024)
HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective
by: Liu, Pei, et al.
Published: (2024)
by: Liu, Pei, et al.
Published: (2024)
Weighted Bayesian Gaussian Mixture Model for Roadside LiDAR Object Detection
by: Zhang, Tianya, et al.
Published: (2022)
by: Zhang, Tianya, et al.
Published: (2022)
2.5D Object Detection for Intelligent Roadside Infrastructure
by: Polley, Nikolai, et al.
Published: (2025)
by: Polley, Nikolai, et al.
Published: (2025)
Roadside Monocular 3D Detection Prompted by 2D Detection
by: Ma, Yechi, et al.
Published: (2024)
by: Ma, Yechi, et al.
Published: (2024)
Delving into Dynamic Scene Cue-Consistency for Robust 3D Multi-Object Tracking
by: Zhang, Haonan, et al.
Published: (2025)
by: Zhang, Haonan, et al.
Published: (2025)
HazyDet: Open-Source Benchmark for Drone-View Object Detection with Depth-Cues in Hazy Scenes
by: Feng, Changfeng, et al.
Published: (2024)
by: Feng, Changfeng, et al.
Published: (2024)
Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track
by: Gao, Mingqi, et al.
Published: (2026)
by: Gao, Mingqi, et al.
Published: (2026)
TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder
by: Meng, Qinghao, et al.
Published: (2025)
by: Meng, Qinghao, et al.
Published: (2025)
RoadSceneVQA: Benchmarking Visual Question Answering in Roadside Perception Systems for Intelligent Transportation System
by: Guan, Runwei, et al.
Published: (2025)
by: Guan, Runwei, et al.
Published: (2025)
Learnability-Driven Submodular Optimization for Active Roadside 3D Detection
by: Mao, Ruiyu, et al.
Published: (2026)
by: Mao, Ruiyu, et al.
Published: (2026)
Learning Temporal Cues by Predicting Objects Move for Multi-camera 3D Object Detection
by: Moon, Seokha, et al.
Published: (2024)
by: Moon, Seokha, et al.
Published: (2024)
IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain
by: Wang, Zhe, et al.
Published: (2025)
by: Wang, Zhe, et al.
Published: (2025)
WARM-3D: A Weakly-Supervised Sim2Real Domain Adaptation Framework for Roadside Monocular 3D Object Detection
by: Zhou, Xingcheng, et al.
Published: (2024)
by: Zhou, Xingcheng, et al.
Published: (2024)
CoBEV: Elevating Roadside 3D Object Detection with Depth and Height Complementarity
by: Shi, Hao, et al.
Published: (2023)
by: Shi, Hao, et al.
Published: (2023)
STSeg-Complex Video Object Segmentation: The 1st Solution for 4th PVUW MOSE Challenge
by: Song, Kehuan, et al.
Published: (2025)
by: Song, Kehuan, et al.
Published: (2025)
FedRSU: Federated Learning for Scene Flow Estimation on Roadside Units
by: Fang, Shaoheng, et al.
Published: (2024)
by: Fang, Shaoheng, et al.
Published: (2024)
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection
by: Yin, Junbo, et al.
Published: (2024)
by: Yin, Junbo, et al.
Published: (2024)
Syn3DTxt: Embedding 3D Cues for Scene Text Generation
by: Hsiung, Li-Syun, et al.
Published: (2025)
by: Hsiung, Li-Syun, et al.
Published: (2025)
1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation
by: Miao, Deshui, et al.
Published: (2024)
by: Miao, Deshui, et al.
Published: (2024)
2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
by: Xu, Zhensong, et al.
Published: (2024)
by: Xu, Zhensong, et al.
Published: (2024)
3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling
by: Jiang, Chaokang, et al.
Published: (2024)
by: Jiang, Chaokang, et al.
Published: (2024)
Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
by: Zhang, Runmin, et al.
Published: (2025)
by: Zhang, Runmin, et al.
Published: (2025)
Fusion4CA: Boosting 3D Object Detection via Comprehensive Image Exploitation
by: Luo, Kang, et al.
Published: (2026)
by: Luo, Kang, et al.
Published: (2026)
RoLID-11K: A Dashcam Dataset for Small-Object Roadside Litter Detection
by: Wu, Tao, et al.
Published: (2026)
by: Wu, Tao, et al.
Published: (2026)
3rd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation
by: Liu, Xinyu, et al.
Published: (2024)
by: Liu, Xinyu, et al.
Published: (2024)
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution
by: Wang, Mengjiao, et al.
Published: (2025)
by: Wang, Mengjiao, et al.
Published: (2025)
MASSeg : 2nd Technical Report for 4th PVUW MOSE Track
by: Cao, Xuqiang, et al.
Published: (2025)
by: Cao, Xuqiang, et al.
Published: (2025)
Density-based Object Detection in Crowded Scenes
by: Zhao, Chenyang, et al.
Published: (2025)
by: Zhao, Chenyang, et al.
Published: (2025)
Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations
by: Wang, Meng, et al.
Published: (2025)
by: Wang, Meng, et al.
Published: (2025)
Fusion Meets Diverse Conditions: A High-diversity Benchmark and Baseline for UAV-based Multimodal Object Detection with Condition Cues
by: Chen, Chen, et al.
Published: (2025)
by: Chen, Chen, et al.
Published: (2025)
RoamScene3D: Immersive Text-to-3D Scene Generation via Adaptive Object-aware Roaming
by: Chu, Jisheng, et al.
Published: (2026)
by: Chu, Jisheng, et al.
Published: (2026)
Zoo3D: Zero-Shot 3D Object Detection at Scene Level
by: Lemeshko, Andrey, et al.
Published: (2025)
by: Lemeshko, Andrey, et al.
Published: (2025)
Multimodal HD Mapping for Intersections by Intelligent Roadside Units
by: Chen, Zhongzhang, et al.
Published: (2025)
by: Chen, Zhongzhang, et al.
Published: (2025)
SOP^2: Transfer Learning with Scene-Oriented Prompt Pool on 3D Object Detection
by: Cheng, Ching-Hung, et al.
Published: (2025)
by: Cheng, Ching-Hung, et al.
Published: (2025)
Descrip3D: Enhancing Large Language Model-based 3D Scene Understanding with Object-Level Text Descriptions
by: Xue, Jintang, et al.
Published: (2025)
by: Xue, Jintang, et al.
Published: (2025)
Similar Items
-
SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection
by: Yang, Lei, et al.
Published: (2024) -
TEACH: Text Encoding as Curriculum Hints for Scene Text Recognition
by: Yang, Xiahan, et al.
Published: (2025) -
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
by: Wang, Wenjie, et al.
Published: (2024) -
HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer
by: Zhang, Zhang, et al.
Published: (2025) -
RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception
by: Zhu, Xiaosu, et al.
Published: (2024)