Saved in:
| Main Authors: | Marouani, Alexis, Siméoni, Oriane, Jégou, Hervé, Bojanowski, Piotr, Vo, Huy V. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.08626 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Unveiling Text in Challenging Stone Inscriptions: A Character-Context-Aware Patching Strategy for Binarization
by: Jena, Pratyush, et al.
Published: (2026)
by: Jena, Pratyush, et al.
Published: (2026)
A Spitting Image: Modular Superpixel Tokenization in Vision Transformers
by: Aasan, Marius, et al.
Published: (2024)
by: Aasan, Marius, et al.
Published: (2024)
LLM-empowered Dynamic Prompt Routing for Vision-Language Models Tuning under Long-Tailed Distributions
by: Jia, Yongju, et al.
Published: (2025)
by: Jia, Yongju, et al.
Published: (2025)
Quantized Vision-Language Models for Damage Assessment: A Comparative Study of LLaVA-1.5-7B Quantization Levels
by: Yasuno, Takato
Published: (2026)
by: Yasuno, Takato
Published: (2026)
SigLino: Efficient Multi-Teacher Distillation for Agglomerative Vision Foundation Models
by: Chaybouti, Sofian, et al.
Published: (2025)
by: Chaybouti, Sofian, et al.
Published: (2025)
Differentiable Hierarchical Visual Tokenization
by: Aasan, Marius, et al.
Published: (2025)
by: Aasan, Marius, et al.
Published: (2025)
VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
by: Chen, Zhipeng, et al.
Published: (2024)
by: Chen, Zhipeng, et al.
Published: (2024)
HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training
by: Tang, Fenghe, et al.
Published: (2024)
by: Tang, Fenghe, et al.
Published: (2024)
Mobile-Ready Automated Triage of Diabetic Retinopathy Using Digital Fundus Images
by: Joshi, Aadi, et al.
Published: (2026)
by: Joshi, Aadi, et al.
Published: (2026)
Learning Unified Representation of 3D Gaussian Splatting
by: Xin, Yuelin, et al.
Published: (2025)
by: Xin, Yuelin, et al.
Published: (2025)
Frequency-Decomposed INR for NIR-Assisted Low-Light RGB Image Denoising
by: Shi, Ligen, et al.
Published: (2026)
by: Shi, Ligen, et al.
Published: (2026)
Neural Fields for 3D Tracking of Anatomy and Surgical Instruments in Monocular Laparoscopic Video Clips
by: Gerats, Beerend G. A., et al.
Published: (2024)
by: Gerats, Beerend G. A., et al.
Published: (2024)
A Hierarchical Self-Consistent Regularization Approach to Satellite Image Time Series Classification
by: Weikmann, Giulio, et al.
Published: (2025)
by: Weikmann, Giulio, et al.
Published: (2025)
Scalable Face Security Vision Foundation Model for Deepfake, Diffusion, and Spoofing Detection
by: Wang, Gaojian, et al.
Published: (2025)
by: Wang, Gaojian, et al.
Published: (2025)
Learning to Expand Images for Efficient Visual Autoregressive Modeling
by: Yang, Ruiqing, et al.
Published: (2025)
by: Yang, Ruiqing, et al.
Published: (2025)
Cora: Correspondence-aware image editing using few step diffusion
by: Alimohammadi, Amirhossein, et al.
Published: (2025)
by: Alimohammadi, Amirhossein, et al.
Published: (2025)
Pointing-Based Object Recognition
by: Hajdúch, Lukáš, et al.
Published: (2026)
by: Hajdúch, Lukáš, et al.
Published: (2026)
Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution
by: Urueña, Jaime Álvarez, et al.
Published: (2025)
by: Urueña, Jaime Álvarez, et al.
Published: (2025)
FAME: Feature Activation Map Explanation on Image Classification and Face Recognition
by: Zhang, Xinyi, et al.
Published: (2026)
by: Zhang, Xinyi, et al.
Published: (2026)
Label Delay in Online Continual Learning
by: Csaba, Botos, et al.
Published: (2023)
by: Csaba, Botos, et al.
Published: (2023)
Neural Implicit Morphing of Face Images
by: Schardong, Guilherme, et al.
Published: (2023)
by: Schardong, Guilherme, et al.
Published: (2023)
RealHD: A High-Quality Dataset for Robust Detection of State-of-the-Art AI-Generated Images
by: Yu, Hanzhe, et al.
Published: (2026)
by: Yu, Hanzhe, et al.
Published: (2026)
A Novel Global Context-aware Deep Neural Network for Enhanced Brain Tumor Segmentation using Magnetic Resonance Images
by: Mukherjee, Sourjya, et al.
Published: (2026)
by: Mukherjee, Sourjya, et al.
Published: (2026)
When Style Similarity Scores Fail: Diagnosing Raw CSD Cosine in Artist-Style Evaluation
by: Frochte, Jörg
Published: (2026)
by: Frochte, Jörg
Published: (2026)
Prototype-Guided Concept Erasure in Diffusion Models
by: Cai, Yuze, et al.
Published: (2026)
by: Cai, Yuze, et al.
Published: (2026)
Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
by: Li, Jinhao, et al.
Published: (2024)
by: Li, Jinhao, et al.
Published: (2024)
UOPSL: Unpaired OCT Predilection Sites Learning for Fundus Image Diagnosis Augmentation
by: Zhao, Zhihao, et al.
Published: (2025)
by: Zhao, Zhihao, et al.
Published: (2025)
GAIR: Location-Aware Self-Supervised Contrastive Pre-Training with Geo-Aligned Implicit Representations
by: Liu, Zeping, et al.
Published: (2025)
by: Liu, Zeping, et al.
Published: (2025)
MienCap: Realtime Performance-Based Facial Animation with Live Mood Dynamics
by: Pan, Ye, et al.
Published: (2025)
by: Pan, Ye, et al.
Published: (2025)
Generating real-time detailed ground visualisations from sparse aerial point clouds
by: Murray, Aidan, et al.
Published: (2025)
by: Murray, Aidan, et al.
Published: (2025)
Symmetry Awareness Encoded Deep Learning Framework for Brain Imaging Analysis
by: Ma, Yang, et al.
Published: (2024)
by: Ma, Yang, et al.
Published: (2024)
STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics
by: Chen, Jiawen, et al.
Published: (2024)
by: Chen, Jiawen, et al.
Published: (2024)
VLSlice: Interactive Vision-and-Language Slice Discovery
by: Slyman, Eric, et al.
Published: (2023)
by: Slyman, Eric, et al.
Published: (2023)
ViFiCon: Vision and Wireless Association Via Self-Supervised Contrastive Learning
by: Meegan, Nicholas, et al.
Published: (2022)
by: Meegan, Nicholas, et al.
Published: (2022)
Towards Onboard Continuous Change Detection for Floods
by: Kyselica, Daniel, et al.
Published: (2026)
by: Kyselica, Daniel, et al.
Published: (2026)
Non-Robust Features are Not Always Useful in One-Class Classification
by: Lau, Matthew, et al.
Published: (2024)
by: Lau, Matthew, et al.
Published: (2024)
Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers?
by: Maity, Subhajit, et al.
Published: (2025)
by: Maity, Subhajit, et al.
Published: (2025)
Feature-Augmented Deep Networks for Multiscale Building Segmentation in High-Resolution UAV and Satellite Imagery
by: Maniyar, Chintan B., et al.
Published: (2025)
by: Maniyar, Chintan B., et al.
Published: (2025)
Meta Co-Training: Two Views are Better than One
by: Rothenberger, Jay C., et al.
Published: (2023)
by: Rothenberger, Jay C., et al.
Published: (2023)
Co-Training with Active Contrastive Learning and Meta-Pseudo-Labeling on 2D Projections for Deep Semi-Supervised Learning
by: Aparco-Cardenas, David, et al.
Published: (2025)
by: Aparco-Cardenas, David, et al.
Published: (2025)
Similar Items
-
Unveiling Text in Challenging Stone Inscriptions: A Character-Context-Aware Patching Strategy for Binarization
by: Jena, Pratyush, et al.
Published: (2026) -
A Spitting Image: Modular Superpixel Tokenization in Vision Transformers
by: Aasan, Marius, et al.
Published: (2024) -
LLM-empowered Dynamic Prompt Routing for Vision-Language Models Tuning under Long-Tailed Distributions
by: Jia, Yongju, et al.
Published: (2025) -
Quantized Vision-Language Models for Damage Assessment: A Comparative Study of LLaVA-1.5-7B Quantization Levels
by: Yasuno, Takato
Published: (2026) -
SigLino: Efficient Multi-Teacher Distillation for Agglomerative Vision Foundation Models
by: Chaybouti, Sofian, et al.
Published: (2025)