:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Longrong, Zhou, Xianpan, Li, Xuewei, Qiao, Liang, Li, Zheyang, Yang, Ziwei, Wang, Gaoang, Li, Xi
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2308.14286
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Long-Tailed Distribution-Aware Router For Mixture-of-Experts in Large Vision-Language Model
by: Cai, Chaoxiang, et al.
Published: (2025)

LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
by: Zheng, Guangcong, et al.
Published: (2023)

Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension
by: Miao, Peihan, et al.
Published: (2022)

Exploring Inconsistent Knowledge Distillation for Object Detection with Data Augmentation
by: Liang, Jiawei, et al.
Published: (2022)

RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements
by: Zheng, Guangcong, et al.
Published: (2025)

Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform
by: Zhou, Xianpan
Published: (2025)

SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation
by: Li, Xuewei, et al.
Published: (2023)

Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
by: Yang, Longrong, et al.
Published: (2024)

$\mathcal{B}^{3}$-Net: Controlled Posterior Bridge Learning for Multi-Task Dense Prediction
by: Zhou, Meihua, et al.
Published: (2026)

CrossKD: Cross-Head Knowledge Distillation for Object Detection
by: Wang, Jiabao, et al.
Published: (2023)

DSG-World: Learning a 3D Gaussian World Model from Dual State Videos
by: Hu, Wenhao, et al.
Published: (2025)

BridgeNet: Comprehensive and Effective Feature Interactions via Bridge Feature for Multi-task Dense Predictions
by: Zhang, Jingdong, et al.
Published: (2023)

DenSe-AdViT: A novel Vision Transformer for Dense SAR Object Detection
by: Zhang, Yang, et al.
Published: (2025)

CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
by: Wu, Tao, et al.
Published: (2024)

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
by: Wang, Wenjie, et al.
Published: (2024)

Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency
by: Wei, Riling, et al.
Published: (2025)

DenseMTL: Cross-task Attention Mechanism for Dense Multi-task Learning
by: Lopes, Ivan, et al.
Published: (2022)

SphereDrag: Spherical Geometry-Aware Panoramic Image Editing
by: Feng, Zhiao, et al.
Published: (2025)

A Flying Bird Object Detection Method for Surveillance Video
by: Sun, Ziwei, et al.
Published: (2024)

Enhancing Dataset Distillation via Label Inconsistency Elimination and Learning Pattern Refinement
by: Zhou, Chuhao, et al.
Published: (2024)

Object-level Cross-view Geo-localization with Location Enhancement and Multi-Head Cross Attention
by: Huang, Zheyang, et al.
Published: (2025)

Selective Transfer Learning of Cross-Modality Distillation for Monocular 3D Object Detection
by: Ding, Rui, et al.
Published: (2026)

Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection
by: Huang, Xun, et al.
Published: (2024)

Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection
by: Zheng, Haowen, et al.
Published: (2024)

Ego3DT: Tracking Every 3D Object in Ego-centric Videos
by: Hao, Shengyu, et al.
Published: (2024)

DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing
by: Zhou, Minghang, et al.
Published: (2024)

Recent Advances in Embedding Methods for Multi-Object Tracking: A Survey
by: Wang, Gaoang, et al.
Published: (2022)

Low-light Object Detection
by: Li, Pengpeng, et al.
Published: (2024)

TransNormal: Dense Visual Semantics for Diffusion-based Transparent Object Normal Estimation
by: Li, Mingwei, et al.
Published: (2026)

Object Style Diffusion for Generalized Object Detection in Urban Scene
by: Li, Hao, et al.
Published: (2024)

Active Object Detection with Knowledge Aggregation and Distillation from Large Models
by: Yang, Dejie, et al.
Published: (2024)

SAMKD: Spatial-aware Adaptive Masking Knowledge Distillation for Object Detection
by: Zhang, Zhourui, et al.
Published: (2025)

Hyperbolic Distillation: Geometry-Guided Cross-Modal Transfer for Robust 3D Object Detection
by: Ning, Kanglin, et al.
Published: (2026)

EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation
by: Liu, Longfei, et al.
Published: (2026)

MUSE: Multi-Scale Dense Self-Distillation for Nucleus Detection and Classification
by: Yang, Zijiang, et al.
Published: (2025)

CORE: Compact Object-centric REpresentations as a New Paradigm for Token Merging in LVLMs
by: Lei, Jingyu, et al.
Published: (2025)

IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion
by: Hu, Wenhao, et al.
Published: (2025)

OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds
by: Yang, Longrong, et al.
Published: (2025)

Towards Robust Object Detection: Identifying and Removing Backdoors via Module Inconsistency Analysis
by: Zhang, Xianda, et al.
Published: (2024)

Inconsistency-based Active Learning for LiDAR Object Detection
by: Rivera, Esteban, et al.
Published: (2025)