:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Marouani, Alexis, Siméoni, Oriane, Jégou, Hervé, Bojanowski, Piotr, Vo, Huy V.
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition I.4.10
Online Access:	https://arxiv.org/abs/2602.08626
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Unveiling Text in Challenging Stone Inscriptions: A Character-Context-Aware Patching Strategy for Binarization
by: Jena, Pratyush, et al.
Published: (2026)

A Spitting Image: Modular Superpixel Tokenization in Vision Transformers
by: Aasan, Marius, et al.
Published: (2024)

LLM-empowered Dynamic Prompt Routing for Vision-Language Models Tuning under Long-Tailed Distributions
by: Jia, Yongju, et al.
Published: (2025)

Quantized Vision-Language Models for Damage Assessment: A Comparative Study of LLaVA-1.5-7B Quantization Levels
by: Yasuno, Takato
Published: (2026)

SigLino: Efficient Multi-Teacher Distillation for Agglomerative Vision Foundation Models
by: Chaybouti, Sofian, et al.
Published: (2025)

Differentiable Hierarchical Visual Tokenization
by: Aasan, Marius, et al.
Published: (2025)

VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
by: Chen, Zhipeng, et al.
Published: (2024)

HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training
by: Tang, Fenghe, et al.
Published: (2024)

Mobile-Ready Automated Triage of Diabetic Retinopathy Using Digital Fundus Images
by: Joshi, Aadi, et al.
Published: (2026)

Learning Unified Representation of 3D Gaussian Splatting
by: Xin, Yuelin, et al.
Published: (2025)

Frequency-Decomposed INR for NIR-Assisted Low-Light RGB Image Denoising
by: Shi, Ligen, et al.
Published: (2026)

Neural Fields for 3D Tracking of Anatomy and Surgical Instruments in Monocular Laparoscopic Video Clips
by: Gerats, Beerend G. A., et al.
Published: (2024)

A Hierarchical Self-Consistent Regularization Approach to Satellite Image Time Series Classification
by: Weikmann, Giulio, et al.
Published: (2025)

Scalable Face Security Vision Foundation Model for Deepfake, Diffusion, and Spoofing Detection
by: Wang, Gaojian, et al.
Published: (2025)

Learning to Expand Images for Efficient Visual Autoregressive Modeling
by: Yang, Ruiqing, et al.
Published: (2025)

Cora: Correspondence-aware image editing using few step diffusion
by: Alimohammadi, Amirhossein, et al.
Published: (2025)

Pointing-Based Object Recognition
by: Hajdúch, Lukáš, et al.
Published: (2026)

Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution
by: Urueña, Jaime Álvarez, et al.
Published: (2025)

FAME: Feature Activation Map Explanation on Image Classification and Face Recognition
by: Zhang, Xinyi, et al.
Published: (2026)

Label Delay in Online Continual Learning
by: Csaba, Botos, et al.
Published: (2023)

Neural Implicit Morphing of Face Images
by: Schardong, Guilherme, et al.
Published: (2023)

RealHD: A High-Quality Dataset for Robust Detection of State-of-the-Art AI-Generated Images
by: Yu, Hanzhe, et al.
Published: (2026)

A Novel Global Context-aware Deep Neural Network for Enhanced Brain Tumor Segmentation using Magnetic Resonance Images
by: Mukherjee, Sourjya, et al.
Published: (2026)

When Style Similarity Scores Fail: Diagnosing Raw CSD Cosine in Artist-Style Evaluation
by: Frochte, Jörg
Published: (2026)

Prototype-Guided Concept Erasure in Diffusion Models
by: Cai, Yuze, et al.
Published: (2026)

Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
by: Li, Jinhao, et al.
Published: (2024)

UOPSL: Unpaired OCT Predilection Sites Learning for Fundus Image Diagnosis Augmentation
by: Zhao, Zhihao, et al.
Published: (2025)

GAIR: Location-Aware Self-Supervised Contrastive Pre-Training with Geo-Aligned Implicit Representations
by: Liu, Zeping, et al.
Published: (2025)

MienCap: Realtime Performance-Based Facial Animation with Live Mood Dynamics
by: Pan, Ye, et al.
Published: (2025)

Generating real-time detailed ground visualisations from sparse aerial point clouds
by: Murray, Aidan, et al.
Published: (2025)

Symmetry Awareness Encoded Deep Learning Framework for Brain Imaging Analysis
by: Ma, Yang, et al.
Published: (2024)

STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics
by: Chen, Jiawen, et al.
Published: (2024)

VLSlice: Interactive Vision-and-Language Slice Discovery
by: Slyman, Eric, et al.
Published: (2023)

ViFiCon: Vision and Wireless Association Via Self-Supervised Contrastive Learning
by: Meegan, Nicholas, et al.
Published: (2022)

Towards Onboard Continuous Change Detection for Floods
by: Kyselica, Daniel, et al.
Published: (2026)

Non-Robust Features are Not Always Useful in One-Class Classification
by: Lau, Matthew, et al.
Published: (2024)

Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers?
by: Maity, Subhajit, et al.
Published: (2025)

Feature-Augmented Deep Networks for Multiscale Building Segmentation in High-Resolution UAV and Satellite Imagery
by: Maniyar, Chintan B., et al.
Published: (2025)

Meta Co-Training: Two Views are Better than One
by: Rothenberger, Jay C., et al.
Published: (2023)

Co-Training with Active Contrastive Learning and Meta-Pseudo-Labeling on 2D Projections for Deep Semi-Supervised Learning
by: Aparco-Cardenas, David, et al.
Published: (2025)