Similar Items
Transmission Line Detection Based on Improved Hough Transform
by: Song, Wei, et al.
Published: (2024)
by: Song, Wei, et al.
Published: (2024)
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
by: Xue, Han, et al.
Published: (2024)
by: Xue, Han, et al.
Published: (2024)
From Indoor To Outdoor: Unsupervised Domain Adaptive Gait Recognition
by: Wang, Likai, et al.
Published: (2022)
by: Wang, Likai, et al.
Published: (2022)
EI: Early Intervention for Multimodal Imaging based Disease Recognition
by: Wei, Qijie, et al.
Published: (2026)
by: Wei, Qijie, et al.
Published: (2026)
CVVNet: A Cross-Vertical-View Network for Gait Recognition
by: Li, Xiangru, et al.
Published: (2025)
by: Li, Xiangru, et al.
Published: (2025)
IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images
by: Wang, Meilin, et al.
Published: (2024)
by: Wang, Meilin, et al.
Published: (2024)
A Visual Self-attention Mechanism Facial Expression Recognition Network beyond Convnext
by: Nan, Bingyu, et al.
Published: (2025)
by: Nan, Bingyu, et al.
Published: (2025)
Revisiting the Scale Loss Function and Gaussian-Shape Convolution for Infrared Small Target Detection
by: Li, Hao, et al.
Published: (2026)
by: Li, Hao, et al.
Published: (2026)
Ancient Script Image Recognition and Processing: A Review
by: Diao, Xiaolei, et al.
Published: (2025)
by: Diao, Xiaolei, et al.
Published: (2025)
Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition
by: Huang, Xiu-Feng, et al.
Published: (2024)
by: Huang, Xiu-Feng, et al.
Published: (2024)
Towards SAR Automatic Target Recognition MultiCategory SAR Image Classification Based on Light Weight Vision Transformer
by: Zhao, Guibin, et al.
Published: (2024)
by: Zhao, Guibin, et al.
Published: (2024)
Copyright Infringement Detection in Text-to-Image Diffusion Models via Differential Privacy
by: Man, Xiafeng, et al.
Published: (2025)
by: Man, Xiafeng, et al.
Published: (2025)
ElectricSight: 3D Hazard Monitoring for Power Lines Using Low-Cost Sensors
by: Li, Xingchen, et al.
Published: (2025)
by: Li, Xingchen, et al.
Published: (2025)
CasSR: Activating Image Power for Real-World Image Super-Resolution
by: Chen, Haolan, et al.
Published: (2024)
by: Chen, Haolan, et al.
Published: (2024)
Latency-aware Unified Dynamic Networks for Efficient Image Recognition
by: Han, Yizeng, et al.
Published: (2023)
by: Han, Yizeng, et al.
Published: (2023)
VFM$^{4}$SDG: Unveiling the Power of VFMs for Single-Domain Generalized Object Detection
by: Zhang, Yupeng, et al.
Published: (2026)
by: Zhang, Yupeng, et al.
Published: (2026)
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
by: Tian, Changyao, et al.
Published: (2023)
by: Tian, Changyao, et al.
Published: (2023)
TAKT: Target-Aware Knowledge Transfer for Whole Slide Image Classification
by: Xiong, Conghao, et al.
Published: (2023)
by: Xiong, Conghao, et al.
Published: (2023)
Exploring Open-Vocabulary Object Recognition in Images using CLIP
by: Chen, Wei Yu, et al.
Published: (2026)
by: Chen, Wei Yu, et al.
Published: (2026)
Unveiling the Power of Self-supervision for Multi-view Multi-human Association and Tracking
by: Feng, Wei, et al.
Published: (2024)
by: Feng, Wei, et al.
Published: (2024)
Phase-Interface Instance Segmentation as a Visual Sensor for Laboratory Process Monitoring
by: Li, Mingyue, et al.
Published: (2026)
by: Li, Mingyue, et al.
Published: (2026)
How Powerful Potential of Attention on Image Restoration?
by: Wang, Cong, et al.
Published: (2024)
by: Wang, Cong, et al.
Published: (2024)
Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition
by: Tang, Wei, et al.
Published: (2025)
by: Tang, Wei, et al.
Published: (2025)
SARATR-X: Toward Building A Foundation Model for SAR Target Recognition
by: Li, Weijie, et al.
Published: (2024)
by: Li, Weijie, et al.
Published: (2024)
Revisiting Generative Infrared and Visible Image Fusion Based on Human Cognitive Laws
by: Guo, Lin, et al.
Published: (2025)
by: Guo, Lin, et al.
Published: (2025)
Leveraging CLIP Encoder for Multimodal Emotion Recognition
by: Song, Yehun, et al.
Published: (2025)
by: Song, Yehun, et al.
Published: (2025)
Beyond First-Order: Learning Riemannian Geometries for Invariant Visual Place Recognition
by: Cheng, Jintao, et al.
Published: (2026)
by: Cheng, Jintao, et al.
Published: (2026)
LIPT: Latency-aware Image Processing Transformer
by: Qiao, Junbo, et al.
Published: (2024)
by: Qiao, Junbo, et al.
Published: (2024)
QGFace: Quality-Guided Joint Training For Mixed-Quality Face Recognition
by: Song, Youzhe, et al.
Published: (2023)
by: Song, Youzhe, et al.
Published: (2023)
PRFusion: Toward Effective and Robust Multi-Modal Place Recognition with Image and Point Cloud Fusion
by: Wang, Sijie, et al.
Published: (2024)
by: Wang, Sijie, et al.
Published: (2024)
Unleashing the Power of Discrete-Time State Representation: Ultrafast Target-based IMU-Camera Spatial-Temporal Calibration
by: Song, Junlin, et al.
Published: (2025)
by: Song, Junlin, et al.
Published: (2025)
eLasmobranc Dataset: An Image Dataset for Elasmobranch Species Recognition and Biodiversity Monitoring
by: Beviá-Ballesteros, Ismael, et al.
Published: (2026)
by: Beviá-Ballesteros, Ismael, et al.
Published: (2026)
Large-scale Remote Sensing Image Target Recognition and Automatic Annotation
by: Dong, Wuzheng
Published: (2024)
by: Dong, Wuzheng
Published: (2024)
Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
by: Wei, Fanyue, et al.
Published: (2024)
by: Wei, Fanyue, et al.
Published: (2024)
Informative Text-Image Alignment for Visual Affordance Learning with Foundation Models
by: Zhang, Qian, et al.
Published: (2025)
by: Zhang, Qian, et al.
Published: (2025)
Hyperbolic-constraint Point Cloud Reconstruction from Single RGB-D Images
by: Li, Wenrui, et al.
Published: (2024)
by: Li, Wenrui, et al.
Published: (2024)
Sample-level Adaptive Knowledge Distillation for Action Recognition
by: Li, Ping, et al.
Published: (2025)
by: Li, Ping, et al.
Published: (2025)
SRRM: Semantic Region Relation Model for Indoor Scene Recognition
by: Song, Chuanxin, et al.
Published: (2023)
by: Song, Chuanxin, et al.
Published: (2023)
ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition
by: Xue, Mengqi, et al.
Published: (2022)
by: Xue, Mengqi, et al.
Published: (2022)
On-Device Training Under 256KB Memory
by: Lin, Ji, et al.
Published: (2022)
by: Lin, Ji, et al.
Published: (2022)
Similar Items
-
Transmission Line Detection Based on Improved Hough Transform
by: Song, Wei, et al.
Published: (2024) -
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation
by: Xue, Han, et al.
Published: (2024) -
From Indoor To Outdoor: Unsupervised Domain Adaptive Gait Recognition
by: Wang, Likai, et al.
Published: (2022) -
EI: Early Intervention for Multimodal Imaging based Disease Recognition
by: Wei, Qijie, et al.
Published: (2026) -
CVVNet: A Cross-Vertical-View Network for Gait Recognition
by: Li, Xiangru, et al.
Published: (2025)