Saved in:
| Main Authors: | Ortega, Jorge Chang, Lan, Bastien Le, Serre, Thomas, Boutin, Victor |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.23819 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Saliency strikes back: How filtering out high frequencies improves white-box explanations
by: Muzellec, Sabine, et al.
Published: (2023)
by: Muzellec, Sabine, et al.
Published: (2023)
Large Language Models can Share Images, Too!
by: Lee, Young-Jun, et al.
Published: (2023)
by: Lee, Young-Jun, et al.
Published: (2023)
Debiasing Central Fixation Confounds Reveals a Peripheral "Sweet Spot" for Human-like Scanpaths in Hard-Attention Vision
by: Pan, Pengcheng, et al.
Published: (2026)
by: Pan, Pengcheng, et al.
Published: (2026)
Unlocking Feature Visualization for Deeper Networks with MAgnitude Constrained Optimization
by: Fel, Thomas, et al.
Published: (2023)
by: Fel, Thomas, et al.
Published: (2023)
Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models
by: Cohen, Regev, et al.
Published: (2024)
by: Cohen, Regev, et al.
Published: (2024)
Better artificial intelligence does not mean better models of biology
by: Linsley, Drew, et al.
Published: (2025)
by: Linsley, Drew, et al.
Published: (2025)
Understanding Visual Feature Reliance through the Lens of Complexity
by: Fel, Thomas, et al.
Published: (2024)
by: Fel, Thomas, et al.
Published: (2024)
Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks
by: Boutin, Victor, et al.
Published: (2024)
by: Boutin, Victor, et al.
Published: (2024)
Choosing the right basis for interpretability: Psychophysical comparison between neuron-based and dictionary-based representations
by: Colin, Julien, et al.
Published: (2024)
by: Colin, Julien, et al.
Published: (2024)
OpenSDI: Spotting Diffusion-Generated Images in the Open World
by: Wang, Yabin, et al.
Published: (2025)
by: Wang, Yabin, et al.
Published: (2025)
SUGAR: A Sweeter Spot for Generative Unlearning of Many Identities
by: Nguyen, Dung Thuy, et al.
Published: (2025)
by: Nguyen, Dung Thuy, et al.
Published: (2025)
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
by: Lepori, Michael A., et al.
Published: (2024)
by: Lepori, Michael A., et al.
Published: (2024)
Discriminative Flow Matching Via Local Generative Predictors
by: Jha, Om Govind, et al.
Published: (2026)
by: Jha, Om Govind, et al.
Published: (2026)
Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru
by: Cusipuma, Dunant, et al.
Published: (2025)
by: Cusipuma, Dunant, et al.
Published: (2025)
SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization
by: Tan, Zhentao, et al.
Published: (2024)
by: Tan, Zhentao, et al.
Published: (2024)
On Diversity in Discriminative Neural Networks
by: Oubaha, Brahim, et al.
Published: (2024)
by: Oubaha, Brahim, et al.
Published: (2024)
Implicit Preference Alignment for Human Image Animation
by: Wang, Yuanzhi, et al.
Published: (2026)
by: Wang, Yuanzhi, et al.
Published: (2026)
RareSpot: Spotting Small and Rare Wildlife in Aerial Imagery with Multi-Scale Consistency and Context-Aware Augmentation
by: Zhang, Bowen, et al.
Published: (2025)
by: Zhang, Bowen, et al.
Published: (2025)
CODE: Confident Ordinary Differential Editing
by: van Delft, Bastien, et al.
Published: (2024)
by: van Delft, Bastien, et al.
Published: (2024)
Progressive Prompt Detailing for Improved Alignment in Text-to-Image Generative Models
by: Saichandran, Ketan Suhaas, et al.
Published: (2025)
by: Saichandran, Ketan Suhaas, et al.
Published: (2025)
Explaining Human Comparisons using Alignment-Importance Heatmaps
by: Truong, Nhut, et al.
Published: (2024)
by: Truong, Nhut, et al.
Published: (2024)
Parrot Captions Teach CLIP to Spot Text
by: Lin, Yiqi, et al.
Published: (2023)
by: Lin, Yiqi, et al.
Published: (2023)
MiraGe: Multimodal Discriminative Representation Learning for Generalizable AI-Generated Image Detection
by: Shi, Kuo, et al.
Published: (2025)
by: Shi, Kuo, et al.
Published: (2025)
Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity
by: Fang, Zhengyao, et al.
Published: (2026)
by: Fang, Zhengyao, et al.
Published: (2026)
EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment
by: Xing, Yifei, et al.
Published: (2024)
by: Xing, Yifei, et al.
Published: (2024)
SpotEdit: Selective Region Editing in Diffusion Transformers
by: Qin, Zhibin, et al.
Published: (2025)
by: Qin, Zhibin, et al.
Published: (2025)
Text-Enhanced Panoptic Symbol Spotting in CAD Drawings
by: Liu, Xianlin, et al.
Published: (2025)
by: Liu, Xianlin, et al.
Published: (2025)
InstructOCR: Instruction Boosting Scene Text Spotting
by: Duan, Chen, et al.
Published: (2024)
by: Duan, Chen, et al.
Published: (2024)
Sign Spotting Disambiguation using Large Language Models
by: Low, JianHe, et al.
Published: (2025)
by: Low, JianHe, et al.
Published: (2025)
GloTSFormer: Global Video Text Spotting Transformer
by: Wang, Han, et al.
Published: (2024)
by: Wang, Han, et al.
Published: (2024)
AI's Blind Spots: Geographic Knowledge and Diversity Deficit in Generated Urban Scenario
by: Beneduce, Ciro, et al.
Published: (2025)
by: Beneduce, Ciro, et al.
Published: (2025)
AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment
by: Li, Yan, et al.
Published: (2024)
by: Li, Yan, et al.
Published: (2024)
Enhancing Medical Large Vision-Language Models via Alignment Distillation
by: Chang, Aofei, et al.
Published: (2025)
by: Chang, Aofei, et al.
Published: (2025)
Discriminative Probing and Tuning for Text-to-Image Generation
by: Qu, Leigang, et al.
Published: (2024)
by: Qu, Leigang, et al.
Published: (2024)
Masked Face Recognition with Generative-to-Discriminative Representations
by: Ge, Shiming, et al.
Published: (2024)
by: Ge, Shiming, et al.
Published: (2024)
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
by: Li, Yiheng, et al.
Published: (2024)
by: Li, Yiheng, et al.
Published: (2024)
A Comprehensive Literature Review on Sweet Orange Leaf Diseases
by: Emon, Yousuf Rayhan, et al.
Published: (2023)
by: Emon, Yousuf Rayhan, et al.
Published: (2023)
Distilling Generative-Discriminative Representations for Very Low-Resolution Face Recognition
by: Zhang, Junzheng, et al.
Published: (2024)
by: Zhang, Junzheng, et al.
Published: (2024)
Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies between Model Predictions and Human Responses in VQA
by: Lan, Jian, et al.
Published: (2024)
by: Lan, Jian, et al.
Published: (2024)
VladVA: Discriminative Fine-tuning of LVLMs
by: Ouali, Yassine, et al.
Published: (2024)
by: Ouali, Yassine, et al.
Published: (2024)
Similar Items
-
Saliency strikes back: How filtering out high frequencies improves white-box explanations
by: Muzellec, Sabine, et al.
Published: (2023) -
Large Language Models can Share Images, Too!
by: Lee, Young-Jun, et al.
Published: (2023) -
Debiasing Central Fixation Confounds Reveals a Peripheral "Sweet Spot" for Human-like Scanpaths in Hard-Attention Vision
by: Pan, Pengcheng, et al.
Published: (2026) -
Unlocking Feature Visualization for Deeper Networks with MAgnitude Constrained Optimization
by: Fel, Thomas, et al.
Published: (2023) -
Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models
by: Cohen, Regev, et al.
Published: (2024)