:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Colagrande, Alex, Caillon, Paul, Feillet, Eva, Allauzen, Alexandre
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2507.02748
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Limits of Resolution Equivariance in Fourier Neural Operators
by: Colagrande, Alex, et al.
Published: (2026)

Forward Only Learning for Orthogonal Neural Networks of any Depth
by: Caillon, Paul, et al.
Published: (2025)

Recommendation of data-free class-incremental learning algorithms by simulating future data
by: Feillet, Eva, et al.
Published: (2024)

HierSum: A Global and Local Attention Mechanism for Video Summarization
by: Beedu, Apoorva, et al.
Published: (2025)

T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers
by: Ntrougkas, Mariano V., et al.
Published: (2024)

Class-Discriminative Attention Maps for Vision Transformers
by: Brocki, Lennart, et al.
Published: (2023)

FasterViT: Fast Vision Transformers with Hierarchical Attention
by: Hatamizadeh, Ali, et al.
Published: (2023)

SLA2: Sparse-Linear Attention with Learnable Routing and QAT
by: Zhang, Jintao, et al.
Published: (2026)

Test-Time Training with KV Binding Is Secretly Linear Attention
by: Liu, Junchen, et al.
Published: (2026)

GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
by: Miyato, Takeru, et al.
Published: (2023)

Fairness-aware Vision Transformer via Debiased Self-Attention
by: Qiang, Yao, et al.
Published: (2023)

COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
by: Xiao, Jinqi, et al.
Published: (2023)

Fast Training of Recurrent Neural Networks with Stationary State Feedbacks
by: Caillon, Paul, et al.
Published: (2025)

The Loupe: A Plug-and-Play Attention Module for Amplifying Discriminative Features in Vision Transformers
by: Sengodan, Naren
Published: (2025)

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention
by: Zhang, Jintao, et al.
Published: (2025)

MoH: Multi-Head Attention as Mixture-of-Head Attention
by: Jin, Peng, et al.
Published: (2024)

Voila-A: Aligning Vision-Language Models with User's Gaze Attention
by: Yan, Kun, et al.
Published: (2023)

Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light
by: Hassani, Ali, et al.
Published: (2025)

Mitigating Object Hallucinations in Vision-Language Models through Region-Aware Attention Recalibration
by: Xu, Yuanzhi, et al.
Published: (2026)

Interpreting Attention Heads for Image-to-Text Information Flow in Large Vision-Language Models
by: Kim, Jinyeong, et al.
Published: (2025)

On the Surprising Effectiveness of Attention Transfer for Vision Transformers
by: Li, Alexander C., et al.
Published: (2024)

IBiT: Utilizing Inductive Biases to Create a More Data Efficient Attention Mechanism
by: Giri, Adithya
Published: (2025)

Attention in Diffusion Model: A Survey
by: Hua, Litao, et al.
Published: (2025)

Where Reliability Lives in Vision-Language Models: A Mechanistic Study of Attention, Hidden States, and Causal Circuits
by: Mann, Logan, et al.
Published: (2026)

Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level
by: Hassani, Ali, et al.
Published: (2024)

Not All Attention Heads Are What You Need: Refining CLIP's Image Representation with Attention Ablation
by: Lin, Feng, et al.
Published: (2025)

PCaM: A Progressive Focus Attention-Based Information Fusion Method for Improving Vision Transformer Domain Adaptation
by: Zang, Zelin, et al.
Published: (2025)

Elliptical Attention
by: Nielsen, Stefan K., et al.
Published: (2024)

Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation
by: Li, Xingyang, et al.
Published: (2025)

Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learning
by: Brouwer, Eric, et al.
Published: (2024)

MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
by: Choi, Kanghyun, et al.
Published: (2024)

Deep Attention-guided Adaptive Subsampling
by: Shankaranarayana, Sharath M, et al.
Published: (2025)

ENA: Efficient N-dimensional Attention
by: Zhong, Yibo
Published: (2025)

MAIS: Memory-Attention for Interactive Segmentation
by: Orbes-Arteaga, Mauricio, et al.
Published: (2025)

EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware Acceleration
by: Song, Zhifan, et al.
Published: (2025)

Shape-Guided Diffusion with Inside-Outside Attention
by: Park, Dong Huk, et al.
Published: (2022)

Motion meets Attention: Video Motion Prompts
by: Chen, Qixiang, et al.
Published: (2024)

Efficient Image Generation with Variadic Attention Heads
by: Walton, Steven, et al.
Published: (2022)

DiffCLIP: Differential Attention Meets CLIP
by: Hammoud, Hasan Abed Al Kader, et al.
Published: (2025)

Scratching Visual Transformer's Back with Uniform Attention
by: Hyeon-Woo, Nam, et al.
Published: (2022)