:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Si, Zhaofeng, Lyu, Siwei
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition I.4.7
Online Access:	https://arxiv.org/abs/2504.17594
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Text-Visual Semantic Constrained AI-Generated Image Quality Assessment
by: Li, Qiang, et al.
Published: (2025)

High-Throughput Phenotyping using Computer Vision and Machine Learning
by: Singhvi, Vivaan, et al.
Published: (2024)

Modeling Time-Lapse Trajectories to Characterize Cranberry Growth
by: John, Ronan, et al.
Published: (2025)

Revisiting Multi-Granularity Representation via Group Contrastive Learning for Unsupervised Vehicle Re-identification
by: Chang, Zhigang, et al.
Published: (2024)

Carotid Artery Plaque Analysis in 3D Based on Distance Encoding in Mesh Representations
by: Rahlfs, Hinrich, et al.
Published: (2025)

MARVO: Marine-Adaptive Radiance-aware Visual Odometry
by: Sundar, Sacchin, et al.
Published: (2025)

Transformer-Based Vector Font Classification Using Different Font Formats: TrueType versus PostScript
by: Fujioka, Takumu, et al.
Published: (2025)

A Genealogy of Foundation Models in Remote Sensing
by: Lane, Kevin, et al.
Published: (2025)

Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
by: Li, Kai, et al.
Published: (2023)

CNN-based local features for navigation near an asteroid
by: Knuuttila, Olli, et al.
Published: (2023)

Supervised Embedded Methods for Hyperspectral Band Selection
by: Zimmer, Yaniv, et al.
Published: (2024)

A method for supervoxel-wise association studies of age and other non-imaging variables from coronary computed tomography angiograms
by: Öfverstedt, Johan, et al.
Published: (2024)

Quantized FCA: Efficient Zero-Shot Texture Anomaly Detection
by: Ardelean, Andrei-Timotei, et al.
Published: (2025)

Combined Hyperbolic and Euclidean Soft Triple Loss Beyond the Single Space Deep Metric Learning
by: Saeki, Shozo, et al.
Published: (2025)

Quantifying and Inducing Shape Bias in CNNs via Max-Pool Dilation
by: Sawada, Takito, et al.
Published: (2026)

Interpreting Video Representations with Spatio-Temporal Sparse Autoencoders
by: Dokme, Atahan, et al.
Published: (2026)

Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods
by: Yu, Hyeon, et al.
Published: (2024)

Defect Detection in Tire X-Ray Images: Conventional Methods Meet Deep Structures
by: Cozma, Andrei, et al.
Published: (2024)

Sign language recognition based on deep learning and low-cost handcrafted descriptors
by: Carneiro, Alvaro Leandro Cavalcante, et al.
Published: (2024)

Trapped in texture bias? A large scale comparison of deep instance segmentation
by: Theodoridis, Johannes, et al.
Published: (2024)

Exploiting Causality Signals in Medical Images: A Pilot Study with Empirical Results
by: Carloni, Gianluca, et al.
Published: (2023)

cp_measure: API-first feature extraction for image-based profiling workflows
by: Muñoz, Alán F., et al.
Published: (2025)

Whole-examination AI estimation of fetal biometrics from 20-week ultrasound scans
by: Venturini, Lorenzo, et al.
Published: (2024)

OOD Detection with immature Models
by: Montazeran, Behrooz, et al.
Published: (2025)

K-means Enhanced Density Gradient Analysis for Urban and Transport Metrics Using Multi-Modal Satellite Imagery
by: Tomkiewicz, P., et al.
Published: (2025)

Vision Transformer-based Model for Severity Quantification of Lung Pneumonia Using Chest X-ray Images
by: Slika, Bouthaina, et al.
Published: (2023)

Vision transformers in domain adaptation and domain generalization: a study of robustness
by: Alijani, Shadi, et al.
Published: (2024)

GraphTEN: Graph Enhanced Texture Encoding Network
by: Peng, Bo, et al.
Published: (2025)

On-Device Generative AI for GDPR-Compliant Visual Monitoring: Natural Language Alerts from Local Object Detection
by: Schappacher-Tilp, Gudrun, et al.
Published: (2026)

Time Step Generating: A Universal Synthesized Deepfake Image Detector
by: Zeng, Ziyue, et al.
Published: (2024)

Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets
by: Chuquimarca, Luis, et al.
Published: (2025)

Fast Normalized Cross-Correlation for Template Matching with Rotations
by: Almira, José María, et al.
Published: (2023)

MANGO: Learning Disentangled Image Transformation Manifolds with Grouped Operators
by: Ancelin, Brighton, et al.
Published: (2024)

IDFace: Face Template Protection for Efficient and Secure Identification
by: Kim, Sunpill, et al.
Published: (2025)

A Multimodal Pipeline for Clinical Data Extraction: Applying Vision-Language Models to Scans of Transfusion Reaction Reports
by: Schäfer, Henning, et al.
Published: (2025)

ConvNets for Counting: Object Detection of Transient Phenomena in Steelpan Drums
by: Hawley, Scott H., et al.
Published: (2021)

RCI: A Score for Evaluating Global and Local Reasoning in Multimodal Benchmarks
by: Agarwal, Amit, et al.
Published: (2025)

Spatially Optimized Compact Deep Metric Learning Model for Similarity Search
by: Islam, Md. Farhadul, et al.
Published: (2024)

Surrealistic-like Image Generation with Vision-Language Models
by: Ayten, Elif, et al.
Published: (2024)

Multimodal Image Matching based on Frequency-domain Information of Local Energy Response
by: Yang, Meng, et al.
Published: (2025)