Saved in:
| Main Author: | Racioppo, Peter |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.04154 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RT-Transformer: The Transformer Block as a Spherical State Estimator
by: Racioppo, Peter
Published: (2026)
by: Racioppo, Peter
Published: (2026)
Attention-Based Neural-Augmented Kalman Filter for Legged Robot State Estimation
by: Lee, Seokju, et al.
Published: (2026)
by: Lee, Seokju, et al.
Published: (2026)
Attention-Aided MMSE for OFDM Channel Estimation: Learning Linear Filters with Attention
by: Ha, TaeJun, et al.
Published: (2025)
by: Ha, TaeJun, et al.
Published: (2025)
The Routing and Filtering Structure of Attention
by: Jamil, Shafayeth, et al.
Published: (2026)
by: Jamil, Shafayeth, et al.
Published: (2026)
Pay Attention to Small Weights
by: Zhou, Chao, et al.
Published: (2025)
by: Zhou, Chao, et al.
Published: (2025)
The Anxiety of Influence: Bloom Filters in Transformer Attention Heads
by: Balogh, Peter
Published: (2026)
by: Balogh, Peter
Published: (2026)
DistrAttention: An Efficient and Flexible Self-Attention Mechanism on Modern GPUs
by: Jin, Haolin, et al.
Published: (2025)
by: Jin, Haolin, et al.
Published: (2025)
Spatial-Temporal Attention Model for Traffic State Estimation with Sparse Internet of Vehicles
by: Xue, Jianzhe, et al.
Published: (2024)
by: Xue, Jianzhe, et al.
Published: (2024)
More Expressive Attention with Negative Weights
by: Lv, Ang, et al.
Published: (2024)
by: Lv, Ang, et al.
Published: (2024)
AC-SINDy: Compositional Sparse Identification of Nonlinear Dynamics
by: Racioppo, Peter
Published: (2026)
by: Racioppo, Peter
Published: (2026)
One Filters All: A Generalist Filter for State Estimation
by: Liu, Shiqi, et al.
Published: (2025)
by: Liu, Shiqi, et al.
Published: (2025)
Modeling Choice via Self-Attention
by: Ko, Joohwan, et al.
Published: (2023)
by: Ko, Joohwan, et al.
Published: (2023)
Quaternion Self-Attention with Shared Scores
by: Yamauchi, Shogo, et al.
Published: (2026)
by: Yamauchi, Shogo, et al.
Published: (2026)
Weighted Graph Structure Learning with Attention Denoising for Node Classification
by: Wang, Tingting, et al.
Published: (2025)
by: Wang, Tingting, et al.
Published: (2025)
Sigmoid Self-Attention has Lower Sample Complexity than Softmax Self-Attention: A Mixture-of-Experts Perspective
by: Yan, Fanqi, et al.
Published: (2025)
by: Yan, Fanqi, et al.
Published: (2025)
Attention as Robust Representation for Time Series Forecasting
by: Niu, PeiSong, et al.
Published: (2024)
by: Niu, PeiSong, et al.
Published: (2024)
Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention
by: Qiu, Haiquan, et al.
Published: (2025)
by: Qiu, Haiquan, et al.
Published: (2025)
FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning
by: Luo, Haozheng, et al.
Published: (2026)
by: Luo, Haozheng, et al.
Published: (2026)
Are Self-Attentions Effective for Time Series Forecasting?
by: Kim, Dongbin, et al.
Published: (2024)
by: Kim, Dongbin, et al.
Published: (2024)
Graph Convolutions Enrich the Self-Attention in Transformers!
by: Choi, Jeongwhan, et al.
Published: (2023)
by: Choi, Jeongwhan, et al.
Published: (2023)
AFD-STA: Adaptive Filtering Denoising with Spatiotemporal Attention for Chaotic System Prediction
by: Gong, Chunlin, et al.
Published: (2025)
by: Gong, Chunlin, et al.
Published: (2025)
State Rank Dynamics in Linear Attention LLMs
by: Sun, Ao, et al.
Published: (2026)
by: Sun, Ao, et al.
Published: (2026)
Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training
by: Emadi, Seyed Morteza
Published: (2026)
by: Emadi, Seyed Morteza
Published: (2026)
Diagonal-Tiled Mixed-Precision Attention for Efficient Low-Bit MXFP Inference
by: Ding, Yifu, et al.
Published: (2026)
by: Ding, Yifu, et al.
Published: (2026)
Key and Value Weights Are Probably All You Need: On the Necessity of the Query, Key, Value weight Triplet in Self-Attention Transformers
by: Karbevski, Marko, et al.
Published: (2025)
by: Karbevski, Marko, et al.
Published: (2025)
Attention-Driven Hierarchical Reinforcement Learning with Particle Filtering for Source Localization in Dynamic Fields
by: Shi, Yiwei, et al.
Published: (2025)
by: Shi, Yiwei, et al.
Published: (2025)
vAttention: Verified Sparse Attention
by: Desai, Aditya, et al.
Published: (2025)
by: Desai, Aditya, et al.
Published: (2025)
Attention Sinks and Outliers in Attention Residuals
by: Luo, Haozheng, et al.
Published: (2026)
by: Luo, Haozheng, et al.
Published: (2026)
Data-Free Pruning of Self-Attention Layers in LLMs
by: Saikumar, Dhananjay, et al.
Published: (2025)
by: Saikumar, Dhananjay, et al.
Published: (2025)
Echo State Transformer: Attention Over Finite Memories
by: Bendi-Ouis, Yannis, et al.
Published: (2025)
by: Bendi-Ouis, Yannis, et al.
Published: (2025)
Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI
by: Doodipala, Ruthwik Reddy, et al.
Published: (2026)
by: Doodipala, Ruthwik Reddy, et al.
Published: (2026)
Self-Attention Mechanism in Multimodal Context for Banking Transaction Flow
by: Delestre, Cyrile, et al.
Published: (2024)
by: Delestre, Cyrile, et al.
Published: (2024)
CSAI: Conditional Self-Attention Imputation for Healthcare Time-series
by: Qian, Linglong, et al.
Published: (2023)
by: Qian, Linglong, et al.
Published: (2023)
Sessa: Selective State Space Attention
by: Horbatko, Liubomyr
Published: (2026)
by: Horbatko, Liubomyr
Published: (2026)
Improving Underwater Acoustic Classification Through Learnable Gabor Filter Convolution and Attention Mechanisms
by: Domingos, Lucas Cesar Ferreira, et al.
Published: (2025)
by: Domingos, Lucas Cesar Ferreira, et al.
Published: (2025)
Parkinson's Disease Detection from Resting State EEG using Multi-Head Graph Structure Learning with Gradient Weighted Graph Attention Explanations
by: Neves, Christopher, et al.
Published: (2024)
by: Neves, Christopher, et al.
Published: (2024)
Hierarchical Self-Attention: Generalizing Neural Attention Mechanics to Multi-Scale Problems
by: Amizadeh, Saeed, et al.
Published: (2025)
by: Amizadeh, Saeed, et al.
Published: (2025)
Towards Robust Knowledge Tracing Models via k-Sparse Attention
by: Huang, Shuyan, et al.
Published: (2024)
by: Huang, Shuyan, et al.
Published: (2024)
Attention Schema-based Attention Control (ASAC): A Cognitive-Inspired Approach for Attention Management in Transformers
by: Saxena, Krati, et al.
Published: (2025)
by: Saxena, Krati, et al.
Published: (2025)
Attention on the Sphere
by: Bonev, Boris, et al.
Published: (2025)
by: Bonev, Boris, et al.
Published: (2025)
Similar Items
-
RT-Transformer: The Transformer Block as a Spherical State Estimator
by: Racioppo, Peter
Published: (2026) -
Attention-Based Neural-Augmented Kalman Filter for Legged Robot State Estimation
by: Lee, Seokju, et al.
Published: (2026) -
Attention-Aided MMSE for OFDM Channel Estimation: Learning Linear Filters with Attention
by: Ha, TaeJun, et al.
Published: (2025) -
The Routing and Filtering Structure of Attention
by: Jamil, Shafayeth, et al.
Published: (2026) -
Pay Attention to Small Weights
by: Zhou, Chao, et al.
Published: (2025)