:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hollard, Lilian, Mohimont, Lucas, Gaveau, Nathalie, Steffenel, Luiz-Angelo
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.15798
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LeYOLO, New Embedded Architecture for Object Detection
by: Hollard, Lilian, et al.
Published: (2024)

Adversarial Attacks Leverage Interference Between Features in Superposition
by: Stevinson, Edward, et al.
Published: (2025)

Adjoint Inversion Reveals Holographic Superposition and Destructive Interference in CNN Classifiers
by: Shu, Kaixiang
Published: (2026)

Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models
by: Tang, Longxiang, et al.
Published: (2024)

GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples
by: Gao, Tian, et al.
Published: (2023)

Exploring Scalable Unified Modeling for General Low-Level Vision
by: Chen, Xiangyu, et al.
Published: (2025)

On the Explainability of Vision-Language Models in Art History
by: Schneider, Stefanie
Published: (2026)

State-of-the-Art Fails in the Art of Damage Detection
by: Ivanova, Daniela, et al.
Published: (2024)

Exploring Light-Weight Object Recognition for Real-Time Document Detection
by: Wojcik, Lucas, et al.
Published: (2025)

LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR
by: Taghadouini, Said, et al.
Published: (2026)

Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing
by: Lou, Meng, et al.
Published: (2026)

Have Large Vision-Language Models Mastered Art History?
by: Strafforello, Ombretta, et al.
Published: (2024)

Non-Learning Low-Light Stereo Vision
by: Wang, Jason, et al.
Published: (2026)

State-of-the-Art Stroke Lesion Segmentation at 1/1000th of Parameters
by: Fedorov, Alex, et al.
Published: (2025)

ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation
by: Li, Chang, et al.
Published: (2024)

No One Knows the State of the Art in Geospatial Foundation Models
by: Corley, Isaac, et al.
Published: (2026)

Training A Small Emotional Vision Language Model for Visual Art Comprehension
by: Zhang, Jing, et al.
Published: (2024)

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
by: Deitke, Matt, et al.
Published: (2024)

Vision Tiny Recursion Model (ViTRM): Parameter-Efficient Image Classification via Recursive State Refinement
by: Akazan, Ange-Clément, et al.
Published: (2026)

Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks
by: Qu, Tingyu, et al.
Published: (2024)

Mitigating Parameter Interference in Model Merging via Sharpness-Aware Fine-Tuning
by: Lee, Yeoreum, et al.
Published: (2025)

AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art
by: Khan, Faizan Farooq, et al.
Published: (2024)

Superposition through Active Learning lens
by: Devkar, Akanksha
Published: (2024)

From Data Statistics to Feature Geometry: How Correlations Shape Superposition
by: Prieto, Lucas, et al.
Published: (2026)

Exploring Token Pruning in Vision State Space Models
by: Zhan, Zheng, et al.
Published: (2024)

Low-Resource Vision Challenges for Foundation Models
by: Zhang, Yunhua, et al.
Published: (2024)

Deepfake Media Forensics: State of the Art and Challenges Ahead
by: Amerini, Irene, et al.
Published: (2024)

ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States
by: Wang, Haowen, et al.
Published: (2025)

SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model
by: Li, Zhengang, et al.
Published: (2024)

Toward Multi-class Anomaly Detection: Exploring Class-aware Unified Model against Inter-class Interference
by: Jiang, Xi, et al.
Published: (2024)

CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models
by: Banerjee, Ayan, et al.
Published: (2025)

Understanding the Risks of Asphalt Art to the Reliability of Vision-Based Perception Systems
by: Ma, Jin, et al.
Published: (2025)

An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models
by: Sun, Jiahao, et al.
Published: (2024)

Forest Before Trees: Latent Superposition for Efficient Visual Reasoning
by: Wang, Yubo, et al.
Published: (2026)

PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks
by: Cui, Cheng, et al.
Published: (2026)

MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models
by: Guo, Yuncheng, et al.
Published: (2025)

Ensemble-Based Deepfake Detection using State-of-the-Art Models with Robust Cross-Dataset Generalisation
by: Wahab, Haroon, et al.
Published: (2025)

Eyes on the Road: State-of-the-Art Video Question Answering Models Assessment for Traffic Monitoring Tasks
by: Vishal, Joseph Raj, et al.
Published: (2024)

ArtGPT-4: Towards Artistic-understanding Large Vision-Language Models with Enhanced Adapter
by: Yuan, Zhengqing, et al.
Published: (2023)

On Neural BRDFs: A Thorough Comparison of State-of-the-Art Approaches
by: Hofherr, Florian, et al.
Published: (2025)