:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Zile, Zhang, Chong, Jin, Mingyu, Wu, Fangyu, Liu, Chengzhi, Jin, Xiaobo
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2407.06127
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Multi-task Prompt Words Learning for Social Media Content Generation
by: Xue, Haochen, et al.
Published: (2024)

Align-DETR: Enhancing End-to-end Object Detection with Aligned Loss
by: Cai, Zhi, et al.
Published: (2023)

OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better Practicality
by: Fan, Chao, et al.
Published: (2024)

Is Bigger Always Better? Efficiency Analysis in Resource-Constrained Small Object Detection
by: Mbobda-Kuate, Kwame, et al.
Published: (2026)

End4: End-to-end Denoising Diffusion for Diffusion-Based Inpainting Detection
by: Wang, Fei, et al.
Published: (2025)

Bridging the Projection Gap: Overcoming Projection Bias Through Parameterized Distance Learning
by: Zhang, Chong, et al.
Published: (2023)

Tracking by Detection and Query: An Efficient End-to-End Framework for Multi-Object Tracking
by: Jia, Shukun, et al.
Published: (2024)

SpikeDet: Better Firing Patterns for Accurate and Energy-Efficient Object Detection with Spiking Neural Networks
by: Fan, Yimeng, et al.
Published: (2025)

Better Matching, Less Forgetting: A Quality-Guided Matcher for Transformer-based Incremental Object Detection
by: Wu, Qirui, et al.
Published: (2026)

Towards Better De-raining Generalization via Rainy Characteristics Memorization and Replay
by: Wang, Kunyu, et al.
Published: (2025)

Are Sparse Neural Networks Better Hard Sample Learners?
by: Xiao, Qiao, et al.
Published: (2024)

MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection
by: Li, Yichen, et al.
Published: (2025)

PO3AD: Predicting Point Offsets toward Better 3D Point Cloud Anomaly Detection
by: Ye, Jianan, et al.
Published: (2024)

Anomize: Better Open Vocabulary Video Anomaly Detection
by: Li, Fei, et al.
Published: (2025)

Better Eyes, Better Thoughts: Why Vision Chain-of-Thought Fails in Medicine
by: Wu, Yuan, et al.
Published: (2026)

SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
by: Huang, Mingxin, et al.
Published: (2024)

Are Object-Centric Representations Better At Compositional Generalization?
by: Kapl, Ferdinand, et al.
Published: (2026)

UHR-DETR: Efficient End-to-End Small Object Detection for Ultra-High-Resolution Remote Sensing Imagery
by: Li, Jingfang, et al.
Published: (2026)

Twin Trigger Generative Networks for Backdoor Attacks against Object Detection
by: Li, Zhiying, et al.
Published: (2024)

A Simple and Better Baseline for Visual Grounding
by: Wang, Jingchao, et al.
Published: (2025)

AlphaVAE: Unified End-to-End RGBA Image Reconstruction and Generation with Alpha-Aware Representation Learning
by: Wang, Zile, et al.
Published: (2025)

ESOD: Efficient Small Object Detection on High-Resolution Images
by: Liu, Kai, et al.
Published: (2024)

Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
by: Tan, Jiaqi, et al.
Published: (2025)

Probing Deep into Temporal Profile Makes the Infrared Small Target Detector Much Better
by: Li, Ruojing, et al.
Published: (2025)

FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting
by: Wu, Fangyu, et al.
Published: (2024)

Decomposition Betters Tracking Everything Everywhere
by: Li, Rui, et al.
Published: (2024)

Towards Better Robustness: Pose-Free 3D Gaussian Splatting for Arbitrarily Long Videos
by: Dong, Zhen-Hui, et al.
Published: (2025)

A Cross Branch Fusion-Based Contrastive Learning Framework for Point Cloud Self-supervised Learning
by: Wu, Chengzhi, et al.
Published: (2025)

Switch EMA: A Free Lunch for Better Flatness and Sharpness
by: Li, Siyuan, et al.
Published: (2024)

Diffusion Feedback Helps CLIP See Better
by: Wang, Wenxuan, et al.
Published: (2024)

OpenAnimals: Revisiting Person Re-Identification for Animals Towards Better Generalization
by: Hou, Saihui, et al.
Published: (2024)

AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection
by: Chao, Yuhao, et al.
Published: (2025)

Independently Keypoint Learning for Small Object Semantic Correspondence
by: Jin, Hailong, et al.
Published: (2024)

Adaptive Slicing-Assisted Hyper Inference for Enhanced Small Object Detection in High-Resolution Imagery
by: Moretti, Francesco, et al.
Published: (2026)

Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis
by: Liu, Chengzhi, et al.
Published: (2025)

Modality Prompts for Arbitrary Modality Salient Object Detection
by: Huang, Nianchang, et al.
Published: (2024)

Towards A Better Metric for Text-to-Video Generation
by: Wu, Jay Zhangjie, et al.
Published: (2024)

Enhanced Textual Feature Extraction for Visual Question Answering: A Simple Convolutional Approach
by: Zhang, Zhilin, et al.
Published: (2024)

DepthAgent: Towards Better Universal Depth Estimation via Sample-wise Expert Selection
by: Zhu, Jie, et al.
Published: (2026)

Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
by: Du, Zewen, et al.
Published: (2024)