Saved in:
| Main Authors: | Huang, Zile, Zhang, Chong, Jin, Mingyu, Wu, Fangyu, Liu, Chengzhi, Jin, Xiaobo |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.06127 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multi-task Prompt Words Learning for Social Media Content Generation
by: Xue, Haochen, et al.
Published: (2024)
by: Xue, Haochen, et al.
Published: (2024)
Align-DETR: Enhancing End-to-end Object Detection with Aligned Loss
by: Cai, Zhi, et al.
Published: (2023)
by: Cai, Zhi, et al.
Published: (2023)
OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better Practicality
by: Fan, Chao, et al.
Published: (2024)
by: Fan, Chao, et al.
Published: (2024)
Is Bigger Always Better? Efficiency Analysis in Resource-Constrained Small Object Detection
by: Mbobda-Kuate, Kwame, et al.
Published: (2026)
by: Mbobda-Kuate, Kwame, et al.
Published: (2026)
End4: End-to-end Denoising Diffusion for Diffusion-Based Inpainting Detection
by: Wang, Fei, et al.
Published: (2025)
by: Wang, Fei, et al.
Published: (2025)
Bridging the Projection Gap: Overcoming Projection Bias Through Parameterized Distance Learning
by: Zhang, Chong, et al.
Published: (2023)
by: Zhang, Chong, et al.
Published: (2023)
Tracking by Detection and Query: An Efficient End-to-End Framework for Multi-Object Tracking
by: Jia, Shukun, et al.
Published: (2024)
by: Jia, Shukun, et al.
Published: (2024)
SpikeDet: Better Firing Patterns for Accurate and Energy-Efficient Object Detection with Spiking Neural Networks
by: Fan, Yimeng, et al.
Published: (2025)
by: Fan, Yimeng, et al.
Published: (2025)
Better Matching, Less Forgetting: A Quality-Guided Matcher for Transformer-based Incremental Object Detection
by: Wu, Qirui, et al.
Published: (2026)
by: Wu, Qirui, et al.
Published: (2026)
Towards Better De-raining Generalization via Rainy Characteristics Memorization and Replay
by: Wang, Kunyu, et al.
Published: (2025)
by: Wang, Kunyu, et al.
Published: (2025)
Are Sparse Neural Networks Better Hard Sample Learners?
by: Xiao, Qiao, et al.
Published: (2024)
by: Xiao, Qiao, et al.
Published: (2024)
MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection
by: Li, Yichen, et al.
Published: (2025)
by: Li, Yichen, et al.
Published: (2025)
PO3AD: Predicting Point Offsets toward Better 3D Point Cloud Anomaly Detection
by: Ye, Jianan, et al.
Published: (2024)
by: Ye, Jianan, et al.
Published: (2024)
Anomize: Better Open Vocabulary Video Anomaly Detection
by: Li, Fei, et al.
Published: (2025)
by: Li, Fei, et al.
Published: (2025)
Better Eyes, Better Thoughts: Why Vision Chain-of-Thought Fails in Medicine
by: Wu, Yuan, et al.
Published: (2026)
by: Wu, Yuan, et al.
Published: (2026)
SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting
by: Huang, Mingxin, et al.
Published: (2024)
by: Huang, Mingxin, et al.
Published: (2024)
Are Object-Centric Representations Better At Compositional Generalization?
by: Kapl, Ferdinand, et al.
Published: (2026)
by: Kapl, Ferdinand, et al.
Published: (2026)
UHR-DETR: Efficient End-to-End Small Object Detection for Ultra-High-Resolution Remote Sensing Imagery
by: Li, Jingfang, et al.
Published: (2026)
by: Li, Jingfang, et al.
Published: (2026)
Twin Trigger Generative Networks for Backdoor Attacks against Object Detection
by: Li, Zhiying, et al.
Published: (2024)
by: Li, Zhiying, et al.
Published: (2024)
A Simple and Better Baseline for Visual Grounding
by: Wang, Jingchao, et al.
Published: (2025)
by: Wang, Jingchao, et al.
Published: (2025)
AlphaVAE: Unified End-to-End RGBA Image Reconstruction and Generation with Alpha-Aware Representation Learning
by: Wang, Zile, et al.
Published: (2025)
by: Wang, Zile, et al.
Published: (2025)
ESOD: Efficient Small Object Detection on High-Resolution Images
by: Liu, Kai, et al.
Published: (2024)
by: Liu, Kai, et al.
Published: (2024)
Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
by: Tan, Jiaqi, et al.
Published: (2025)
by: Tan, Jiaqi, et al.
Published: (2025)
Probing Deep into Temporal Profile Makes the Infrared Small Target Detector Much Better
by: Li, Ruojing, et al.
Published: (2025)
by: Li, Ruojing, et al.
Published: (2025)
FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting
by: Wu, Fangyu, et al.
Published: (2024)
by: Wu, Fangyu, et al.
Published: (2024)
Decomposition Betters Tracking Everything Everywhere
by: Li, Rui, et al.
Published: (2024)
by: Li, Rui, et al.
Published: (2024)
Towards Better Robustness: Pose-Free 3D Gaussian Splatting for Arbitrarily Long Videos
by: Dong, Zhen-Hui, et al.
Published: (2025)
by: Dong, Zhen-Hui, et al.
Published: (2025)
A Cross Branch Fusion-Based Contrastive Learning Framework for Point Cloud Self-supervised Learning
by: Wu, Chengzhi, et al.
Published: (2025)
by: Wu, Chengzhi, et al.
Published: (2025)
Switch EMA: A Free Lunch for Better Flatness and Sharpness
by: Li, Siyuan, et al.
Published: (2024)
by: Li, Siyuan, et al.
Published: (2024)
Diffusion Feedback Helps CLIP See Better
by: Wang, Wenxuan, et al.
Published: (2024)
by: Wang, Wenxuan, et al.
Published: (2024)
OpenAnimals: Revisiting Person Re-Identification for Animals Towards Better Generalization
by: Hou, Saihui, et al.
Published: (2024)
by: Hou, Saihui, et al.
Published: (2024)
AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection
by: Chao, Yuhao, et al.
Published: (2025)
by: Chao, Yuhao, et al.
Published: (2025)
Independently Keypoint Learning for Small Object Semantic Correspondence
by: Jin, Hailong, et al.
Published: (2024)
by: Jin, Hailong, et al.
Published: (2024)
Adaptive Slicing-Assisted Hyper Inference for Enhanced Small Object Detection in High-Resolution Imagery
by: Moretti, Francesco, et al.
Published: (2026)
by: Moretti, Francesco, et al.
Published: (2026)
Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis
by: Liu, Chengzhi, et al.
Published: (2025)
by: Liu, Chengzhi, et al.
Published: (2025)
Modality Prompts for Arbitrary Modality Salient Object Detection
by: Huang, Nianchang, et al.
Published: (2024)
by: Huang, Nianchang, et al.
Published: (2024)
Towards A Better Metric for Text-to-Video Generation
by: Wu, Jay Zhangjie, et al.
Published: (2024)
by: Wu, Jay Zhangjie, et al.
Published: (2024)
Enhanced Textual Feature Extraction for Visual Question Answering: A Simple Convolutional Approach
by: Zhang, Zhilin, et al.
Published: (2024)
by: Zhang, Zhilin, et al.
Published: (2024)
DepthAgent: Towards Better Universal Depth Estimation via Sample-wise Expert Selection
by: Zhu, Jie, et al.
Published: (2026)
by: Zhu, Jie, et al.
Published: (2026)
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
by: Du, Zewen, et al.
Published: (2024)
by: Du, Zewen, et al.
Published: (2024)
Similar Items
-
Multi-task Prompt Words Learning for Social Media Content Generation
by: Xue, Haochen, et al.
Published: (2024) -
Align-DETR: Enhancing End-to-end Object Detection with Aligned Loss
by: Cai, Zhi, et al.
Published: (2023) -
OpenGait: A Comprehensive Benchmark Study for Gait Recognition towards Better Practicality
by: Fan, Chao, et al.
Published: (2024) -
Is Bigger Always Better? Efficiency Analysis in Resource-Constrained Small Object Detection
by: Mbobda-Kuate, Kwame, et al.
Published: (2026) -
End4: End-to-end Denoising Diffusion for Diffusion-Based Inpainting Detection
by: Wang, Fei, et al.
Published: (2025)