:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Song, Hao, Lin, Wei, Song, Wei, Wang, Man
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2402.06152
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Transmission Line Detection Based on Improved Hough Transform
by: Song, Wei, et al.
Published: (2024)

In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
by: Xue, Han, et al.
Published: (2024)

From Indoor To Outdoor: Unsupervised Domain Adaptive Gait Recognition
by: Wang, Likai, et al.
Published: (2022)

EI: Early Intervention for Multimodal Imaging based Disease Recognition
by: Wei, Qijie, et al.
Published: (2026)

CVVNet: A Cross-Vertical-View Network for Gait Recognition
by: Li, Xiangru, et al.
Published: (2025)

IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images
by: Wang, Meilin, et al.
Published: (2024)

A Visual Self-attention Mechanism Facial Expression Recognition Network beyond Convnext
by: Nan, Bingyu, et al.
Published: (2025)

Revisiting the Scale Loss Function and Gaussian-Shape Convolution for Infrared Small Target Detection
by: Li, Hao, et al.
Published: (2026)

Ancient Script Image Recognition and Processing: A Review
by: Diao, Xiaolei, et al.
Published: (2025)

Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition
by: Huang, Xiu-Feng, et al.
Published: (2024)

Towards SAR Automatic Target Recognition MultiCategory SAR Image Classification Based on Light Weight Vision Transformer
by: Zhao, Guibin, et al.
Published: (2024)

Copyright Infringement Detection in Text-to-Image Diffusion Models via Differential Privacy
by: Man, Xiafeng, et al.
Published: (2025)

ElectricSight: 3D Hazard Monitoring for Power Lines Using Low-Cost Sensors
by: Li, Xingchen, et al.
Published: (2025)

CasSR: Activating Image Power for Real-World Image Super-Resolution
by: Chen, Haolan, et al.
Published: (2024)

Latency-aware Unified Dynamic Networks for Efficient Image Recognition
by: Han, Yizeng, et al.
Published: (2023)

VFM$^{4}$SDG: Unveiling the Power of VFMs for Single-Domain Generalized Object Detection
by: Zhang, Yupeng, et al.
Published: (2026)

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
by: Tian, Changyao, et al.
Published: (2023)

TAKT: Target-Aware Knowledge Transfer for Whole Slide Image Classification
by: Xiong, Conghao, et al.
Published: (2023)

Exploring Open-Vocabulary Object Recognition in Images using CLIP
by: Chen, Wei Yu, et al.
Published: (2026)

Unveiling the Power of Self-supervision for Multi-view Multi-human Association and Tracking
by: Feng, Wei, et al.
Published: (2024)

Phase-Interface Instance Segmentation as a Visual Sensor for Laboratory Process Monitoring
by: Li, Mingyue, et al.
Published: (2026)

How Powerful Potential of Attention on Image Restoration?
by: Wang, Cong, et al.
Published: (2024)

Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition
by: Tang, Wei, et al.
Published: (2025)

SARATR-X: Toward Building A Foundation Model for SAR Target Recognition
by: Li, Weijie, et al.
Published: (2024)

Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws
by: Guo, Lin, et al.
Published: (2025)

Leveraging CLIP Encoder for Multimodal Emotion Recognition
by: Song, Yehun, et al.
Published: (2025)

Beyond First-Order: Learning Riemannian Geometries for Invariant Visual Place Recognition
by: Cheng, Jintao, et al.
Published: (2026)

LIPT: Latency-aware Image Processing Transformer
by: Qiao, Junbo, et al.
Published: (2024)

QGFace: Quality-Guided Joint Training For Mixed-Quality Face Recognition
by: Song, Youzhe, et al.
Published: (2023)

PRFusion: Toward Effective and Robust Multi-Modal Place Recognition with Image and Point Cloud Fusion
by: Wang, Sijie, et al.
Published: (2024)

Unleashing the Power of Discrete-Time State Representation: Ultrafast Target-based IMU-Camera Spatial-Temporal Calibration
by: Song, Junlin, et al.
Published: (2025)

eLasmobranc Dataset: An Image Dataset for Elasmobranch Species Recognition and Biodiversity Monitoring
by: Beviá-Ballesteros, Ismael, et al.
Published: (2026)

Large-scale Remote Sensing Image Target Recognition and Automatic Annotation
by: Dong, Wuzheng
Published: (2024)

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
by: Wei, Fanyue, et al.
Published: (2024)

Informative Text-Image Alignment for Visual Affordance Learning with Foundation Models
by: Zhang, Qian, et al.
Published: (2025)

Hyperbolic-constraint Point Cloud Reconstruction from Single RGB-D Images
by: Li, Wenrui, et al.
Published: (2024)

Sample-level Adaptive Knowledge Distillation for Action Recognition
by: Li, Ping, et al.
Published: (2025)

SRRM: Semantic Region Relation Model for Indoor Scene Recognition
by: Song, Chuanxin, et al.
Published: (2023)

ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition
by: Xue, Mengqi, et al.
Published: (2022)

On-Device Training Under 256KB Memory
by: Lin, Ji, et al.
Published: (2022)