:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ortega, Jorge Chang, Lan, Bastien Le, Serre, Thomas, Boutin, Victor
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.23819
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Saliency strikes back: How filtering out high frequencies improves white-box explanations
by: Muzellec, Sabine, et al.
Published: (2023)

Large Language Models can Share Images, Too!
by: Lee, Young-Jun, et al.
Published: (2023)

Debiasing Central Fixation Confounds Reveals a Peripheral "Sweet Spot" for Human-like Scanpaths in Hard-Attention Vision
by: Pan, Pengcheng, et al.
Published: (2026)

Unlocking Feature Visualization for Deeper Networks with MAgnitude Constrained Optimization
by: Fel, Thomas, et al.
Published: (2023)

Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models
by: Cohen, Regev, et al.
Published: (2024)

Better artificial intelligence does not mean better models of biology
by: Linsley, Drew, et al.
Published: (2025)

Understanding Visual Feature Reliance through the Lens of Complexity
by: Fel, Thomas, et al.
Published: (2024)

Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks
by: Boutin, Victor, et al.
Published: (2024)

Choosing the right basis for interpretability: Psychophysical comparison between neuron-based and dictionary-based representations
by: Colin, Julien, et al.
Published: (2024)

OpenSDI: Spotting Diffusion-Generated Images in the Open World
by: Wang, Yabin, et al.
Published: (2025)

SUGAR: A Sweeter Spot for Generative Unlearning of Many Identities
by: Nguyen, Dung Thuy, et al.
Published: (2025)

Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
by: Lepori, Michael A., et al.
Published: (2024)

Discriminative Flow Matching Via Local Generative Predictors
by: Jha, Om Govind, et al.
Published: (2026)

Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru
by: Cusipuma, Dunant, et al.
Published: (2025)

SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization
by: Tan, Zhentao, et al.
Published: (2024)

On Diversity in Discriminative Neural Networks
by: Oubaha, Brahim, et al.
Published: (2024)

Implicit Preference Alignment for Human Image Animation
by: Wang, Yuanzhi, et al.
Published: (2026)

RareSpot: Spotting Small and Rare Wildlife in Aerial Imagery with Multi-Scale Consistency and Context-Aware Augmentation
by: Zhang, Bowen, et al.
Published: (2025)

CODE: Confident Ordinary Differential Editing
by: van Delft, Bastien, et al.
Published: (2024)

Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
by: Saichandran, Ketan Suhaas, et al.
Published: (2025)

Explaining Human Comparisons using Alignment-Importance Heatmaps
by: Truong, Nhut, et al.
Published: (2024)

Parrot Captions Teach CLIP to Spot Text
by: Lin, Yiqi, et al.
Published: (2023)

MiraGe: Multimodal Discriminative Representation Learning for Generalizable AI-Generated Image Detection
by: Shi, Kuo, et al.
Published: (2025)

Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity
by: Fang, Zhengyao, et al.
Published: (2026)

EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
by: Xing, Yifei, et al.
Published: (2024)

SpotEdit: Selective Region Editing in Diffusion Transformers
by: Qin, Zhibin, et al.
Published: (2025)

Text-Enhanced Panoptic Symbol Spotting in CAD Drawings
by: Liu, Xianlin, et al.
Published: (2025)

InstructOCR: Instruction Boosting Scene Text Spotting
by: Duan, Chen, et al.
Published: (2024)

Sign Spotting Disambiguation using Large Language Models
by: Low, JianHe, et al.
Published: (2025)

GloTSFormer: Global Video Text Spotting Transformer
by: Wang, Han, et al.
Published: (2024)

AI's Blind Spots: Geographic Knowledge and Diversity Deficit in Generated Urban Scenario
by: Beneduce, Ciro, et al.
Published: (2025)

AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment
by: Li, Yan, et al.
Published: (2024)

Enhancing Medical Large Vision-Language Models via Alignment Distillation
by: Chang, Aofei, et al.
Published: (2025)

Discriminative Probing and Tuning for Text-to-Image Generation
by: Qu, Leigang, et al.
Published: (2024)

Masked Face Recognition with Generative-to-Discriminative Representations
by: Ge, Shiming, et al.
Published: (2024)

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
by: Li, Yiheng, et al.
Published: (2024)

A Comprehensive Literature Review on Sweet Orange Leaf Diseases
by: Emon, Yousuf Rayhan, et al.
Published: (2023)

Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition
by: Zhang, Junzheng, et al.
Published: (2024)

Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies between Model Predictions and Human Responses in VQA
by: Lan, Jian, et al.
Published: (2024)

VladVA: Discriminative Fine-tuning of LVLMs
by: Ouali, Yassine, et al.
Published: (2024)