Saved in:
| Main Authors: | Yang, Longrong, Zhou, Xianpan, Li, Xuewei, Qiao, Liang, Li, Zheyang, Yang, Ziwei, Wang, Gaoang, Li, Xi |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2308.14286 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Long-Tailed Distribution-Aware Router For Mixture-of-Experts in Large Vision-Language Model
by: Cai, Chaoxiang, et al.
Published: (2025)
by: Cai, Chaoxiang, et al.
Published: (2025)
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
by: Zheng, Guangcong, et al.
Published: (2023)
by: Zheng, Guangcong, et al.
Published: (2023)
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension
by: Miao, Peihan, et al.
Published: (2022)
by: Miao, Peihan, et al.
Published: (2022)
Exploring Inconsistent Knowledge Distillation for Object Detection with Data Augmentation
by: Liang, Jiawei, et al.
Published: (2022)
by: Liang, Jiawei, et al.
Published: (2022)
RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements
by: Zheng, Guangcong, et al.
Published: (2025)
by: Zheng, Guangcong, et al.
Published: (2025)
Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform
by: Zhou, Xianpan
Published: (2025)
by: Zhou, Xianpan
Published: (2025)
SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation
by: Li, Xuewei, et al.
Published: (2023)
by: Li, Xuewei, et al.
Published: (2023)
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
by: Yang, Longrong, et al.
Published: (2024)
by: Yang, Longrong, et al.
Published: (2024)
$\mathcal{B}^{3}$-Net: Controlled Posterior Bridge Learning for Multi-Task Dense Prediction
by: Zhou, Meihua, et al.
Published: (2026)
by: Zhou, Meihua, et al.
Published: (2026)
CrossKD: Cross-Head Knowledge Distillation for Object Detection
by: Wang, Jiabao, et al.
Published: (2023)
by: Wang, Jiabao, et al.
Published: (2023)
DSG-World: Learning a 3D Gaussian World Model from Dual State Videos
by: Hu, Wenhao, et al.
Published: (2025)
by: Hu, Wenhao, et al.
Published: (2025)
BridgeNet: Comprehensive and Effective Feature Interactions via Bridge Feature for Multi-task Dense Predictions
by: Zhang, Jingdong, et al.
Published: (2023)
by: Zhang, Jingdong, et al.
Published: (2023)
DenSe-AdViT: A novel Vision Transformer for Dense SAR Object Detection
by: Zhang, Yang, et al.
Published: (2025)
by: Zhang, Yang, et al.
Published: (2025)
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
by: Wu, Tao, et al.
Published: (2024)
by: Wu, Tao, et al.
Published: (2024)
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
by: Wang, Wenjie, et al.
Published: (2024)
by: Wang, Wenjie, et al.
Published: (2024)
Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency
by: Wei, Riling, et al.
Published: (2025)
by: Wei, Riling, et al.
Published: (2025)
DenseMTL: Cross-task Attention Mechanism for Dense Multi-task Learning
by: Lopes, Ivan, et al.
Published: (2022)
by: Lopes, Ivan, et al.
Published: (2022)
SphereDrag: Spherical Geometry-Aware Panoramic Image Editing
by: Feng, Zhiao, et al.
Published: (2025)
by: Feng, Zhiao, et al.
Published: (2025)
A Flying Bird Object Detection Method for Surveillance Video
by: Sun, Ziwei, et al.
Published: (2024)
by: Sun, Ziwei, et al.
Published: (2024)
Enhancing Dataset Distillation via Label Inconsistency Elimination and Learning Pattern Refinement
by: Zhou, Chuhao, et al.
Published: (2024)
by: Zhou, Chuhao, et al.
Published: (2024)
Object-level Cross-view Geo-localization with Location Enhancement and Multi-Head Cross Attention
by: Huang, Zheyang, et al.
Published: (2025)
by: Huang, Zheyang, et al.
Published: (2025)
Selective Transfer Learning of Cross-Modality Distillation for Monocular 3D Object Detection
by: Ding, Rui, et al.
Published: (2026)
by: Ding, Rui, et al.
Published: (2026)
Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection
by: Huang, Xun, et al.
Published: (2024)
by: Huang, Xun, et al.
Published: (2024)
Distilling Temporal Knowledge with Masked Feature Reconstruction for 3D Object Detection
by: Zheng, Haowen, et al.
Published: (2024)
by: Zheng, Haowen, et al.
Published: (2024)
Ego3DT: Tracking Every 3D Object in Ego-centric Videos
by: Hao, Shengyu, et al.
Published: (2024)
by: Hao, Shengyu, et al.
Published: (2024)
DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing
by: Zhou, Minghang, et al.
Published: (2024)
by: Zhou, Minghang, et al.
Published: (2024)
Recent Advances in Embedding Methods for Multi-Object Tracking: A Survey
by: Wang, Gaoang, et al.
Published: (2022)
by: Wang, Gaoang, et al.
Published: (2022)
Low-light Object Detection
by: Li, Pengpeng, et al.
Published: (2024)
by: Li, Pengpeng, et al.
Published: (2024)
TransNormal: Dense Visual Semantics for Diffusion-based Transparent Object Normal Estimation
by: Li, Mingwei, et al.
Published: (2026)
by: Li, Mingwei, et al.
Published: (2026)
Object Style Diffusion for Generalized Object Detection in Urban Scene
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
Active Object Detection with Knowledge Aggregation and Distillation from Large Models
by: Yang, Dejie, et al.
Published: (2024)
by: Yang, Dejie, et al.
Published: (2024)
SAMKD: Spatial-aware Adaptive Masking Knowledge Distillation for Object Detection
by: Zhang, Zhourui, et al.
Published: (2025)
by: Zhang, Zhourui, et al.
Published: (2025)
Hyperbolic Distillation: Geometry-Guided Cross-Modal Transfer for Robust 3D Object Detection
by: Ning, Kanglin, et al.
Published: (2026)
by: Ning, Kanglin, et al.
Published: (2026)
EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation
by: Liu, Longfei, et al.
Published: (2026)
by: Liu, Longfei, et al.
Published: (2026)
MUSE: Multi-Scale Dense Self-Distillation for Nucleus Detection and Classification
by: Yang, Zijiang, et al.
Published: (2025)
by: Yang, Zijiang, et al.
Published: (2025)
CORE: Compact Object-centric REpresentations as a New Paradigm for Token Merging in LVLMs
by: Lei, Jingyu, et al.
Published: (2025)
by: Lei, Jingyu, et al.
Published: (2025)
IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion
by: Hu, Wenhao, et al.
Published: (2025)
by: Hu, Wenhao, et al.
Published: (2025)
OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds
by: Yang, Longrong, et al.
Published: (2025)
by: Yang, Longrong, et al.
Published: (2025)
Towards Robust Object Detection: Identifying and Removing Backdoors via Module Inconsistency Analysis
by: Zhang, Xianda, et al.
Published: (2024)
by: Zhang, Xianda, et al.
Published: (2024)
Inconsistency-based Active Learning for LiDAR Object Detection
by: Rivera, Esteban, et al.
Published: (2025)
by: Rivera, Esteban, et al.
Published: (2025)
Similar Items
-
Long-Tailed Distribution-Aware Router For Mixture-of-Experts in Large Vision-Language Model
by: Cai, Chaoxiang, et al.
Published: (2025) -
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
by: Zheng, Guangcong, et al.
Published: (2023) -
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension
by: Miao, Peihan, et al.
Published: (2022) -
Exploring Inconsistent Knowledge Distillation for Object Detection with Data Augmentation
by: Liang, Jiawei, et al.
Published: (2022) -
RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements
by: Zheng, Guangcong, et al.
Published: (2025)