Saved in:
| Main Authors: | Colagrande, Alex, Caillon, Paul, Feillet, Eva, Allauzen, Alexandre |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.02748 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Limits of Resolution Equivariance in Fourier Neural Operators
by: Colagrande, Alex, et al.
Published: (2026)
by: Colagrande, Alex, et al.
Published: (2026)
Forward Only Learning for Orthogonal Neural Networks of any Depth
by: Caillon, Paul, et al.
Published: (2025)
by: Caillon, Paul, et al.
Published: (2025)
Recommendation of data-free class-incremental learning algorithms by simulating future data
by: Feillet, Eva, et al.
Published: (2024)
by: Feillet, Eva, et al.
Published: (2024)
HierSum: A Global and Local Attention Mechanism for Video Summarization
by: Beedu, Apoorva, et al.
Published: (2025)
by: Beedu, Apoorva, et al.
Published: (2025)
T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers
by: Ntrougkas, Mariano V., et al.
Published: (2024)
by: Ntrougkas, Mariano V., et al.
Published: (2024)
Class-Discriminative Attention Maps for Vision Transformers
by: Brocki, Lennart, et al.
Published: (2023)
by: Brocki, Lennart, et al.
Published: (2023)
FasterViT: Fast Vision Transformers with Hierarchical Attention
by: Hatamizadeh, Ali, et al.
Published: (2023)
by: Hatamizadeh, Ali, et al.
Published: (2023)
SLA2: Sparse-Linear Attention with Learnable Routing and QAT
by: Zhang, Jintao, et al.
Published: (2026)
by: Zhang, Jintao, et al.
Published: (2026)
Test-Time Training with KV Binding Is Secretly Linear Attention
by: Liu, Junchen, et al.
Published: (2026)
by: Liu, Junchen, et al.
Published: (2026)
GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers
by: Miyato, Takeru, et al.
Published: (2023)
by: Miyato, Takeru, et al.
Published: (2023)
Fairness-aware Vision Transformer via Debiased Self-Attention
by: Qiang, Yao, et al.
Published: (2023)
by: Qiang, Yao, et al.
Published: (2023)
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
by: Xiao, Jinqi, et al.
Published: (2023)
by: Xiao, Jinqi, et al.
Published: (2023)
Fast Training of Recurrent Neural Networks with Stationary State Feedbacks
by: Caillon, Paul, et al.
Published: (2025)
by: Caillon, Paul, et al.
Published: (2025)
The Loupe: A Plug-and-Play Attention Module for Amplifying Discriminative Features in Vision Transformers
by: Sengodan, Naren
Published: (2025)
by: Sengodan, Naren
Published: (2025)
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention
by: Zhang, Jintao, et al.
Published: (2025)
by: Zhang, Jintao, et al.
Published: (2025)
MoH: Multi-Head Attention as Mixture-of-Head Attention
by: Jin, Peng, et al.
Published: (2024)
by: Jin, Peng, et al.
Published: (2024)
Voila-A: Aligning Vision-Language Models with User's Gaze Attention
by: Yan, Kun, et al.
Published: (2023)
by: Yan, Kun, et al.
Published: (2023)
Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light
by: Hassani, Ali, et al.
Published: (2025)
by: Hassani, Ali, et al.
Published: (2025)
Mitigating Object Hallucinations in Vision-Language Models through Region-Aware Attention Recalibration
by: Xu, Yuanzhi, et al.
Published: (2026)
by: Xu, Yuanzhi, et al.
Published: (2026)
Interpreting Attention Heads for Image-to-Text Information Flow in Large Vision-Language Models
by: Kim, Jinyeong, et al.
Published: (2025)
by: Kim, Jinyeong, et al.
Published: (2025)
On the Surprising Effectiveness of Attention Transfer for Vision Transformers
by: Li, Alexander C., et al.
Published: (2024)
by: Li, Alexander C., et al.
Published: (2024)
IBiT: Utilizing Inductive Biases to Create a More Data Efficient Attention Mechanism
by: Giri, Adithya
Published: (2025)
by: Giri, Adithya
Published: (2025)
Attention in Diffusion Model: A Survey
by: Hua, Litao, et al.
Published: (2025)
by: Hua, Litao, et al.
Published: (2025)
Where Reliability Lives in Vision-Language Models: A Mechanistic Study of Attention, Hidden States, and Causal Circuits
by: Mann, Logan, et al.
Published: (2026)
by: Mann, Logan, et al.
Published: (2026)
Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level
by: Hassani, Ali, et al.
Published: (2024)
by: Hassani, Ali, et al.
Published: (2024)
Not All Attention Heads Are What You Need: Refining CLIP's Image Representation with Attention Ablation
by: Lin, Feng, et al.
Published: (2025)
by: Lin, Feng, et al.
Published: (2025)
PCaM: A Progressive Focus Attention-Based Information Fusion Method for Improving Vision Transformer Domain Adaptation
by: Zang, Zelin, et al.
Published: (2025)
by: Zang, Zelin, et al.
Published: (2025)
Elliptical Attention
by: Nielsen, Stefan K., et al.
Published: (2024)
by: Nielsen, Stefan K., et al.
Published: (2024)
Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation
by: Li, Xingyang, et al.
Published: (2025)
by: Li, Xingyang, et al.
Published: (2025)
Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learning
by: Brouwer, Eric, et al.
Published: (2024)
by: Brouwer, Eric, et al.
Published: (2024)
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
by: Choi, Kanghyun, et al.
Published: (2024)
by: Choi, Kanghyun, et al.
Published: (2024)
Deep Attention-guided Adaptive Subsampling
by: Shankaranarayana, Sharath M, et al.
Published: (2025)
by: Shankaranarayana, Sharath M, et al.
Published: (2025)
ENA: Efficient N-dimensional Attention
by: Zhong, Yibo
Published: (2025)
by: Zhong, Yibo
Published: (2025)
MAIS: Memory-Attention for Interactive Segmentation
by: Orbes-Arteaga, Mauricio, et al.
Published: (2025)
by: Orbes-Arteaga, Mauricio, et al.
Published: (2025)
EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware Acceleration
by: Song, Zhifan, et al.
Published: (2025)
by: Song, Zhifan, et al.
Published: (2025)
Shape-Guided Diffusion with Inside-Outside Attention
by: Park, Dong Huk, et al.
Published: (2022)
by: Park, Dong Huk, et al.
Published: (2022)
Motion meets Attention: Video Motion Prompts
by: Chen, Qixiang, et al.
Published: (2024)
by: Chen, Qixiang, et al.
Published: (2024)
Efficient Image Generation with Variadic Attention Heads
by: Walton, Steven, et al.
Published: (2022)
by: Walton, Steven, et al.
Published: (2022)
DiffCLIP: Differential Attention Meets CLIP
by: Hammoud, Hasan Abed Al Kader, et al.
Published: (2025)
by: Hammoud, Hasan Abed Al Kader, et al.
Published: (2025)
Scratching Visual Transformer's Back with Uniform Attention
by: Hyeon-Woo, Nam, et al.
Published: (2022)
by: Hyeon-Woo, Nam, et al.
Published: (2022)
Similar Items
-
Limits of Resolution Equivariance in Fourier Neural Operators
by: Colagrande, Alex, et al.
Published: (2026) -
Forward Only Learning for Orthogonal Neural Networks of any Depth
by: Caillon, Paul, et al.
Published: (2025) -
Recommendation of data-free class-incremental learning algorithms by simulating future data
by: Feillet, Eva, et al.
Published: (2024) -
HierSum: A Global and Local Attention Mechanism for Video Summarization
by: Beedu, Apoorva, et al.
Published: (2025) -
T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers
by: Ntrougkas, Mariano V., et al.
Published: (2024)