Saved in:
| Main Authors: | Si, Zhaofeng, Lyu, Siwei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.17594 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Text-Visual Semantic Constrained AI-Generated Image Quality Assessment
by: Li, Qiang, et al.
Published: (2025)
by: Li, Qiang, et al.
Published: (2025)
High-Throughput Phenotyping using Computer Vision and Machine Learning
by: Singhvi, Vivaan, et al.
Published: (2024)
by: Singhvi, Vivaan, et al.
Published: (2024)
Modeling Time-Lapse Trajectories to Characterize Cranberry Growth
by: John, Ronan, et al.
Published: (2025)
by: John, Ronan, et al.
Published: (2025)
Revisiting Multi-Granularity Representation via Group Contrastive Learning for Unsupervised Vehicle Re-identification
by: Chang, Zhigang, et al.
Published: (2024)
by: Chang, Zhigang, et al.
Published: (2024)
Carotid Artery Plaque Analysis in 3D Based on Distance Encoding in Mesh Representations
by: Rahlfs, Hinrich, et al.
Published: (2025)
by: Rahlfs, Hinrich, et al.
Published: (2025)
MARVO: Marine-Adaptive Radiance-aware Visual Odometry
by: Sundar, Sacchin, et al.
Published: (2025)
by: Sundar, Sacchin, et al.
Published: (2025)
Transformer-Based Vector Font Classification Using Different Font Formats: TrueType versus PostScript
by: Fujioka, Takumu, et al.
Published: (2025)
by: Fujioka, Takumu, et al.
Published: (2025)
A Genealogy of Foundation Models in Remote Sensing
by: Lane, Kevin, et al.
Published: (2025)
by: Lane, Kevin, et al.
Published: (2025)
Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
by: Li, Kai, et al.
Published: (2023)
by: Li, Kai, et al.
Published: (2023)
CNN-based local features for navigation near an asteroid
by: Knuuttila, Olli, et al.
Published: (2023)
by: Knuuttila, Olli, et al.
Published: (2023)
Supervised Embedded Methods for Hyperspectral Band Selection
by: Zimmer, Yaniv, et al.
Published: (2024)
by: Zimmer, Yaniv, et al.
Published: (2024)
A method for supervoxel-wise association studies of age and other non-imaging variables from coronary computed tomography angiograms
by: Öfverstedt, Johan, et al.
Published: (2024)
by: Öfverstedt, Johan, et al.
Published: (2024)
Quantized FCA: Efficient Zero-Shot Texture Anomaly Detection
by: Ardelean, Andrei-Timotei, et al.
Published: (2025)
by: Ardelean, Andrei-Timotei, et al.
Published: (2025)
Combined Hyperbolic and Euclidean Soft Triple Loss Beyond the Single Space Deep Metric Learning
by: Saeki, Shozo, et al.
Published: (2025)
by: Saeki, Shozo, et al.
Published: (2025)
Quantifying and Inducing Shape Bias in CNNs via Max-Pool Dilation
by: Sawada, Takito, et al.
Published: (2026)
by: Sawada, Takito, et al.
Published: (2026)
Interpreting Video Representations with Spatio-Temporal Sparse Autoencoders
by: Dokme, Atahan, et al.
Published: (2026)
by: Dokme, Atahan, et al.
Published: (2026)
Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods
by: Yu, Hyeon, et al.
Published: (2024)
by: Yu, Hyeon, et al.
Published: (2024)
Defect Detection in Tire X-Ray Images: Conventional Methods Meet Deep Structures
by: Cozma, Andrei, et al.
Published: (2024)
by: Cozma, Andrei, et al.
Published: (2024)
Sign language recognition based on deep learning and low-cost handcrafted descriptors
by: Carneiro, Alvaro Leandro Cavalcante, et al.
Published: (2024)
by: Carneiro, Alvaro Leandro Cavalcante, et al.
Published: (2024)
Trapped in texture bias? A large scale comparison of deep instance segmentation
by: Theodoridis, Johannes, et al.
Published: (2024)
by: Theodoridis, Johannes, et al.
Published: (2024)
Exploiting Causality Signals in Medical Images: A Pilot Study with Empirical Results
by: Carloni, Gianluca, et al.
Published: (2023)
by: Carloni, Gianluca, et al.
Published: (2023)
cp_measure: API-first feature extraction for image-based profiling workflows
by: Muñoz, Alán F., et al.
Published: (2025)
by: Muñoz, Alán F., et al.
Published: (2025)
Whole-examination AI estimation of fetal biometrics from 20-week ultrasound scans
by: Venturini, Lorenzo, et al.
Published: (2024)
by: Venturini, Lorenzo, et al.
Published: (2024)
OOD Detection with immature Models
by: Montazeran, Behrooz, et al.
Published: (2025)
by: Montazeran, Behrooz, et al.
Published: (2025)
K-means Enhanced Density Gradient Analysis for Urban and Transport Metrics Using Multi-Modal Satellite Imagery
by: Tomkiewicz, P., et al.
Published: (2025)
by: Tomkiewicz, P., et al.
Published: (2025)
Vision Transformer-based Model for Severity Quantification of Lung Pneumonia Using Chest X-ray Images
by: Slika, Bouthaina, et al.
Published: (2023)
by: Slika, Bouthaina, et al.
Published: (2023)
Vision transformers in domain adaptation and domain generalization: a study of robustness
by: Alijani, Shadi, et al.
Published: (2024)
by: Alijani, Shadi, et al.
Published: (2024)
GraphTEN: Graph Enhanced Texture Encoding Network
by: Peng, Bo, et al.
Published: (2025)
by: Peng, Bo, et al.
Published: (2025)
On-Device Generative AI for GDPR-Compliant Visual Monitoring: Natural Language Alerts from Local Object Detection
by: Schappacher-Tilp, Gudrun, et al.
Published: (2026)
by: Schappacher-Tilp, Gudrun, et al.
Published: (2026)
Time Step Generating: A Universal Synthesized Deepfake Image Detector
by: Zeng, Ziyue, et al.
Published: (2024)
by: Zeng, Ziyue, et al.
Published: (2024)
Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets
by: Chuquimarca, Luis, et al.
Published: (2025)
by: Chuquimarca, Luis, et al.
Published: (2025)
Fast Normalized Cross-Correlation for Template Matching with Rotations
by: Almira, José María, et al.
Published: (2023)
by: Almira, José María, et al.
Published: (2023)
MANGO: Learning Disentangled Image Transformation Manifolds with Grouped Operators
by: Ancelin, Brighton, et al.
Published: (2024)
by: Ancelin, Brighton, et al.
Published: (2024)
IDFace: Face Template Protection for Efficient and Secure Identification
by: Kim, Sunpill, et al.
Published: (2025)
by: Kim, Sunpill, et al.
Published: (2025)
A Multimodal Pipeline for Clinical Data Extraction: Applying Vision-Language Models to Scans of Transfusion Reaction Reports
by: Schäfer, Henning, et al.
Published: (2025)
by: Schäfer, Henning, et al.
Published: (2025)
ConvNets for Counting: Object Detection of Transient Phenomena in Steelpan Drums
by: Hawley, Scott H., et al.
Published: (2021)
by: Hawley, Scott H., et al.
Published: (2021)
RCI: A Score for Evaluating Global and Local Reasoning in Multimodal Benchmarks
by: Agarwal, Amit, et al.
Published: (2025)
by: Agarwal, Amit, et al.
Published: (2025)
Spatially Optimized Compact Deep Metric Learning Model for Similarity Search
by: Islam, Md. Farhadul, et al.
Published: (2024)
by: Islam, Md. Farhadul, et al.
Published: (2024)
Surrealistic-like Image Generation with Vision-Language Models
by: Ayten, Elif, et al.
Published: (2024)
by: Ayten, Elif, et al.
Published: (2024)
Multimodal Image Matching based on Frequency-domain Information of Local Energy Response
by: Yang, Meng, et al.
Published: (2025)
by: Yang, Meng, et al.
Published: (2025)
Similar Items
-
Text-Visual Semantic Constrained AI-Generated Image Quality Assessment
by: Li, Qiang, et al.
Published: (2025) -
High-Throughput Phenotyping using Computer Vision and Machine Learning
by: Singhvi, Vivaan, et al.
Published: (2024) -
Modeling Time-Lapse Trajectories to Characterize Cranberry Growth
by: John, Ronan, et al.
Published: (2025) -
Revisiting Multi-Granularity Representation via Group Contrastive Learning for Unsupervised Vehicle Re-identification
by: Chang, Zhigang, et al.
Published: (2024) -
Carotid Artery Plaque Analysis in 3D Based on Distance Encoding in Mesh Representations
by: Rahlfs, Hinrich, et al.
Published: (2025)