Saved in:
| Main Authors: | Hollard, Lilian, Mohimont, Lucas, Gaveau, Nathalie, Steffenel, Luiz-Angelo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.15798 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LeYOLO, New Embedded Architecture for Object Detection
by: Hollard, Lilian, et al.
Published: (2024)
by: Hollard, Lilian, et al.
Published: (2024)
Adversarial Attacks Leverage Interference Between Features in Superposition
by: Stevinson, Edward, et al.
Published: (2025)
by: Stevinson, Edward, et al.
Published: (2025)
Adjoint Inversion Reveals Holographic Superposition and Destructive Interference in CNN Classifiers
by: Shu, Kaixiang
Published: (2026)
by: Shu, Kaixiang
Published: (2026)
Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models
by: Tang, Longxiang, et al.
Published: (2024)
by: Tang, Longxiang, et al.
Published: (2024)
GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples
by: Gao, Tian, et al.
Published: (2023)
by: Gao, Tian, et al.
Published: (2023)
Exploring Scalable Unified Modeling for General Low-Level Vision
by: Chen, Xiangyu, et al.
Published: (2025)
by: Chen, Xiangyu, et al.
Published: (2025)
On the Explainability of Vision-Language Models in Art History
by: Schneider, Stefanie
Published: (2026)
by: Schneider, Stefanie
Published: (2026)
State-of-the-Art Fails in the Art of Damage Detection
by: Ivanova, Daniela, et al.
Published: (2024)
by: Ivanova, Daniela, et al.
Published: (2024)
Exploring Light-Weight Object Recognition for Real-Time Document Detection
by: Wojcik, Lucas, et al.
Published: (2025)
by: Wojcik, Lucas, et al.
Published: (2025)
LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR
by: Taghadouini, Said, et al.
Published: (2026)
by: Taghadouini, Said, et al.
Published: (2026)
Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing
by: Lou, Meng, et al.
Published: (2026)
by: Lou, Meng, et al.
Published: (2026)
Have Large Vision-Language Models Mastered Art History?
by: Strafforello, Ombretta, et al.
Published: (2024)
by: Strafforello, Ombretta, et al.
Published: (2024)
Non-Learning Low-Light Stereo Vision
by: Wang, Jason, et al.
Published: (2026)
by: Wang, Jason, et al.
Published: (2026)
State-of-the-Art Stroke Lesion Segmentation at 1/1000th of Parameters
by: Fedorov, Alex, et al.
Published: (2025)
by: Fedorov, Alex, et al.
Published: (2025)
ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation
by: Li, Chang, et al.
Published: (2024)
by: Li, Chang, et al.
Published: (2024)
No One Knows the State of the Art in Geospatial Foundation Models
by: Corley, Isaac, et al.
Published: (2026)
by: Corley, Isaac, et al.
Published: (2026)
Training A Small Emotional Vision Language Model for Visual Art Comprehension
by: Zhang, Jing, et al.
Published: (2024)
by: Zhang, Jing, et al.
Published: (2024)
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
by: Deitke, Matt, et al.
Published: (2024)
by: Deitke, Matt, et al.
Published: (2024)
Vision Tiny Recursion Model (ViTRM): Parameter-Efficient Image Classification via Recursive State Refinement
by: Akazan, Ange-Clément, et al.
Published: (2026)
by: Akazan, Ange-Clément, et al.
Published: (2026)
Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks
by: Qu, Tingyu, et al.
Published: (2024)
by: Qu, Tingyu, et al.
Published: (2024)
Mitigating Parameter Interference in Model Merging via Sharpness-Aware Fine-Tuning
by: Lee, Yeoreum, et al.
Published: (2025)
by: Lee, Yeoreum, et al.
Published: (2025)
AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art
by: Khan, Faizan Farooq, et al.
Published: (2024)
by: Khan, Faizan Farooq, et al.
Published: (2024)
Superposition through Active Learning lens
by: Devkar, Akanksha
Published: (2024)
by: Devkar, Akanksha
Published: (2024)
From Data Statistics to Feature Geometry: How Correlations Shape Superposition
by: Prieto, Lucas, et al.
Published: (2026)
by: Prieto, Lucas, et al.
Published: (2026)
Exploring Token Pruning in Vision State Space Models
by: Zhan, Zheng, et al.
Published: (2024)
by: Zhan, Zheng, et al.
Published: (2024)
Low-Resource Vision Challenges for Foundation Models
by: Zhang, Yunhua, et al.
Published: (2024)
by: Zhang, Yunhua, et al.
Published: (2024)
Deepfake Media Forensics: State of the Art and Challenges Ahead
by: Amerini, Irene, et al.
Published: (2024)
by: Amerini, Irene, et al.
Published: (2024)
ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States
by: Wang, Haowen, et al.
Published: (2025)
by: Wang, Haowen, et al.
Published: (2025)
SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model
by: Li, Zhengang, et al.
Published: (2024)
by: Li, Zhengang, et al.
Published: (2024)
Toward Multi-class Anomaly Detection: Exploring Class-aware Unified Model against Inter-class Interference
by: Jiang, Xi, et al.
Published: (2024)
by: Jiang, Xi, et al.
Published: (2024)
CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models
by: Banerjee, Ayan, et al.
Published: (2025)
by: Banerjee, Ayan, et al.
Published: (2025)
Understanding the Risks of Asphalt Art to the Reliability of Vision-Based Perception Systems
by: Ma, Jin, et al.
Published: (2025)
by: Ma, Jin, et al.
Published: (2025)
An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models
by: Sun, Jiahao, et al.
Published: (2024)
by: Sun, Jiahao, et al.
Published: (2024)
Forest Before Trees: Latent Superposition for Efficient Visual Reasoning
by: Wang, Yubo, et al.
Published: (2026)
by: Wang, Yubo, et al.
Published: (2026)
PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks
by: Cui, Cheng, et al.
Published: (2026)
by: Cui, Cheng, et al.
Published: (2026)
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models
by: Guo, Yuncheng, et al.
Published: (2025)
by: Guo, Yuncheng, et al.
Published: (2025)
Ensemble-Based Deepfake Detection using State-of-the-Art Models with Robust Cross-Dataset Generalisation
by: Wahab, Haroon, et al.
Published: (2025)
by: Wahab, Haroon, et al.
Published: (2025)
Eyes on the Road: State-of-the-Art Video Question Answering Models Assessment for Traffic Monitoring Tasks
by: Vishal, Joseph Raj, et al.
Published: (2024)
by: Vishal, Joseph Raj, et al.
Published: (2024)
ArtGPT-4: Towards Artistic-understanding Large Vision-Language Models with Enhanced Adapter
by: Yuan, Zhengqing, et al.
Published: (2023)
by: Yuan, Zhengqing, et al.
Published: (2023)
On Neural BRDFs: A Thorough Comparison of State-of-the-Art Approaches
by: Hofherr, Florian, et al.
Published: (2025)
by: Hofherr, Florian, et al.
Published: (2025)
Similar Items
-
LeYOLO, New Embedded Architecture for Object Detection
by: Hollard, Lilian, et al.
Published: (2024) -
Adversarial Attacks Leverage Interference Between Features in Superposition
by: Stevinson, Edward, et al.
Published: (2025) -
Adjoint Inversion Reveals Holographic Superposition and Destructive Interference in CNN Classifiers
by: Shu, Kaixiang
Published: (2026) -
Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models
by: Tang, Longxiang, et al.
Published: (2024) -
GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples
by: Gao, Tian, et al.
Published: (2023)