:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Racioppo, Peter
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.04154
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

RT-Transformer: The Transformer Block as a Spherical State Estimator
by: Racioppo, Peter
Published: (2026)

Attention-Based Neural-Augmented Kalman Filter for Legged Robot State Estimation
by: Lee, Seokju, et al.
Published: (2026)

Attention-Aided MMSE for OFDM Channel Estimation: Learning Linear Filters with Attention
by: Ha, TaeJun, et al.
Published: (2025)

The Routing and Filtering Structure of Attention
by: Jamil, Shafayeth, et al.
Published: (2026)

Pay Attention to Small Weights
by: Zhou, Chao, et al.
Published: (2025)

The Anxiety of Influence: Bloom Filters in Transformer Attention Heads
by: Balogh, Peter
Published: (2026)

DistrAttention: An Efficient and Flexible Self-Attention Mechanism on Modern GPUs
by: Jin, Haolin, et al.
Published: (2025)

Spatial-Temporal Attention Model for Traffic State Estimation with Sparse Internet of Vehicles
by: Xue, Jianzhe, et al.
Published: (2024)

More Expressive Attention with Negative Weights
by: Lv, Ang, et al.
Published: (2024)

AC-SINDy: Compositional Sparse Identification of Nonlinear Dynamics
by: Racioppo, Peter
Published: (2026)

One Filters All: A Generalist Filter for State Estimation
by: Liu, Shiqi, et al.
Published: (2025)

Modeling Choice via Self-Attention
by: Ko, Joohwan, et al.
Published: (2023)

Quaternion Self-Attention with Shared Scores
by: Yamauchi, Shogo, et al.
Published: (2026)

Weighted Graph Structure Learning with Attention Denoising for Node Classification
by: Wang, Tingting, et al.
Published: (2025)

Sigmoid Self-Attention has Lower Sample Complexity than Softmax Self-Attention: A Mixture-of-Experts Perspective
by: Yan, Fanqi, et al.
Published: (2025)

Attention as Robust Representation for Time Series Forecasting
by: Niu, PeiSong, et al.
Published: (2024)

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention
by: Qiu, Haiquan, et al.
Published: (2025)

FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning
by: Luo, Haozheng, et al.
Published: (2026)

Are Self-Attentions Effective for Time Series Forecasting?
by: Kim, Dongbin, et al.
Published: (2024)

Graph Convolutions Enrich the Self-Attention in Transformers!
by: Choi, Jeongwhan, et al.
Published: (2023)

AFD-STA: Adaptive Filtering Denoising with Spatiotemporal Attention for Chaotic System Prediction
by: Gong, Chunlin, et al.
Published: (2025)

State Rank Dynamics in Linear Attention LLMs
by: Sun, Ao, et al.
Published: (2026)

Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training
by: Emadi, Seyed Morteza
Published: (2026)

Diagonal-Tiled Mixed-Precision Attention for Efficient Low-Bit MXFP Inference
by: Ding, Yifu, et al.
Published: (2026)

Key and Value Weights Are Probably All You Need: On the Necessity of the Query, Key, Value weight Triplet in Self-Attention Transformers
by: Karbevski, Marko, et al.
Published: (2025)

Attention-Driven Hierarchical Reinforcement Learning with Particle Filtering for Source Localization in Dynamic Fields
by: Shi, Yiwei, et al.
Published: (2025)

vAttention: Verified Sparse Attention
by: Desai, Aditya, et al.
Published: (2025)

Attention Sinks and Outliers in Attention Residuals
by: Luo, Haozheng, et al.
Published: (2026)

Data-Free Pruning of Self-Attention Layers in LLMs
by: Saikumar, Dhananjay, et al.
Published: (2025)

Echo State Transformer: Attention Over Finite Memories
by: Bendi-Ouis, Yannis, et al.
Published: (2025)

Rhamba: Region-Aware Hybrid Attention-Mamba Framework for Self-Supervised Learning in Resting-State fMRI
by: Doodipala, Ruthwik Reddy, et al.
Published: (2026)

Self-Attention Mechanism in Multimodal Context for Banking Transaction Flow
by: Delestre, Cyrile, et al.
Published: (2024)

CSAI: Conditional Self-Attention Imputation for Healthcare Time-series
by: Qian, Linglong, et al.
Published: (2023)

Sessa: Selective State Space Attention
by: Horbatko, Liubomyr
Published: (2026)

Improving Underwater Acoustic Classification Through Learnable Gabor Filter Convolution and Attention Mechanisms
by: Domingos, Lucas Cesar Ferreira, et al.
Published: (2025)

Parkinson's Disease Detection from Resting State EEG using Multi-Head Graph Structure Learning with Gradient Weighted Graph Attention Explanations
by: Neves, Christopher, et al.
Published: (2024)

Hierarchical Self-Attention: Generalizing Neural Attention Mechanics to Multi-Scale Problems
by: Amizadeh, Saeed, et al.
Published: (2025)

Towards Robust Knowledge Tracing Models via k-Sparse Attention
by: Huang, Shuyan, et al.
Published: (2024)

Attention Schema-based Attention Control (ASAC): A Cognitive-Inspired Approach for Attention Management in Transformers
by: Saxena, Krati, et al.
Published: (2025)

Attention on the Sphere
by: Bonev, Boris, et al.
Published: (2025)