Saved in:
| Main Author: | Khasia, Vladimer |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.07070 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Spectral-Window Hybrid (SWH)
by: Khasia, Vladimer
Published: (2026)
by: Khasia, Vladimer
Published: (2026)
Dynamic Subspace Composition: Efficient Adaptation via Contractive Basis Expansion
by: Khasia, Vladimer
Published: (2025)
by: Khasia, Vladimer
Published: (2025)
HAS-VQ: Hessian-Adaptive Sparse Vector Quantization for High-Fidelity LLM Compression
by: Khasia, Vladimer
Published: (2026)
by: Khasia, Vladimer
Published: (2026)
BASIS: Balanced Activation Sketching with Invariant Scalars for "Ghost Backpropagation"
by: Khasia, Vladimer
Published: (2026)
by: Khasia, Vladimer
Published: (2026)
HoloByte: Continuous Hyperspherical Distillation for Tokenizer-Free Modeling
by: Khasia, Vladimer
Published: (2026)
by: Khasia, Vladimer
Published: (2026)
Beyond Attention: True Adaptive World Models via Spherical Kernel Operator
by: Khasia, Vladimer
Published: (2026)
by: Khasia, Vladimer
Published: (2026)
The Adaptive Vekua Cascade: A Differentiable Spectral-Analytic Solver for Physics-Informed Representation
by: Khasia, Vladimer
Published: (2025)
by: Khasia, Vladimer
Published: (2025)
DeepVekua: Geometric-Spectral Representation Learning for Physics-Informed Fields
by: Khasia, Vladimer
Published: (2025)
by: Khasia, Vladimer
Published: (2025)
The Vekua Layer: Exact Physical Priors for Implicit Neural Representations via Generalized Analytic Functions
by: Khasia, Vladimer
Published: (2025)
by: Khasia, Vladimer
Published: (2025)
Primal: A Unified Deterministic Framework for Quasi-Orthogonal Hashing and Manifold Learning
by: Khasia, Vladimer
Published: (2025)
by: Khasia, Vladimer
Published: (2025)
On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention
by: Ro, Yeonju, et al.
Published: (2025)
by: Ro, Yeonju, et al.
Published: (2025)
Blending Complementary Memory Systems in Hybrid Quadratic-Linear Transformers
by: Irie, Kazuki, et al.
Published: (2025)
by: Irie, Kazuki, et al.
Published: (2025)
Dual Path Attribution: Efficient Attribution for SwiGLU-Transformers through Layer-Wise Target Propagation
by: Jantsch, Lasse Marten, et al.
Published: (2026)
by: Jantsch, Lasse Marten, et al.
Published: (2026)
Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers
by: Choi, Sehyun
Published: (2024)
by: Choi, Sehyun
Published: (2024)
EGMOF: Efficient Generation of Metal-Organic Frameworks Using a Hybrid Diffusion-Transformer Architecture
by: Han, Seunghee, et al.
Published: (2025)
by: Han, Seunghee, et al.
Published: (2025)
ZeroS: Zero-Sum Linear Attention for Efficient Transformers
by: Lu, Jiecheng, et al.
Published: (2026)
by: Lu, Jiecheng, et al.
Published: (2026)
Gated Linear Attention Transformers with Hardware-Efficient Training
by: Yang, Songlin, et al.
Published: (2023)
by: Yang, Songlin, et al.
Published: (2023)
The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling
by: Kerce, J. Clayton, et al.
Published: (2026)
by: Kerce, J. Clayton, et al.
Published: (2026)
AnchorGT: Efficient and Flexible Attention Architecture for Scalable Graph Transformers
by: Zhu, Wenhao, et al.
Published: (2024)
by: Zhu, Wenhao, et al.
Published: (2024)
Rethinking Transformer Connectivity: TLinFormer, A Path to Exact, Full Context-Aware Linear Attention
by: Tang, Zhongpan
Published: (2025)
by: Tang, Zhongpan
Published: (2025)
On Limitations of the Transformer Architecture
by: Peng, Binghui, et al.
Published: (2024)
by: Peng, Binghui, et al.
Published: (2024)
Ister: Linear Transformer for Efficient Multivariate Time Series Forecasting
by: Cao, Fanpu, et al.
Published: (2024)
by: Cao, Fanpu, et al.
Published: (2024)
In Transformer We Trust? A Perspective on Transformer Architecture Failure Modes
by: Mondal, Trishit, et al.
Published: (2026)
by: Mondal, Trishit, et al.
Published: (2026)
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
by: Becker, Philipp, et al.
Published: (2025)
by: Becker, Philipp, et al.
Published: (2025)
PPTNet: A Hybrid Periodic Pattern-Transformer Architecture for Traffic Flow Prediction and Congestion Identification
by: Kou, Hongrui, et al.
Published: (2025)
by: Kou, Hongrui, et al.
Published: (2025)
Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers
by: Choromanski, Krzysztof Marcin, et al.
Published: (2023)
by: Choromanski, Krzysztof Marcin, et al.
Published: (2023)
Dual Filter: A Transformer-like Inference Architecture for Hidden Markov Models
by: Chang, Heng-Sheng, et al.
Published: (2025)
by: Chang, Heng-Sheng, et al.
Published: (2025)
Reducing the Transformer Architecture to a Minimum
by: Bermeitinger, Bernhard, et al.
Published: (2024)
by: Bermeitinger, Bernhard, et al.
Published: (2024)
Efficiently Transforming Neural Networks into Decision Trees: A Path to Ground Truth Explanations with RENTT
by: Monke, Helena, et al.
Published: (2025)
by: Monke, Helena, et al.
Published: (2025)
Hybrid Dynamic Pruning: A Pathway to Efficient Transformer Inference
by: Jaradat, Ghadeer, et al.
Published: (2024)
by: Jaradat, Ghadeer, et al.
Published: (2024)
Cottention: Linear Transformers With Cosine Attention
by: Mongaras, Gabriel, et al.
Published: (2024)
by: Mongaras, Gabriel, et al.
Published: (2024)
Linear Transformers are Versatile In-Context Learners
by: Vladymyrov, Max, et al.
Published: (2024)
by: Vladymyrov, Max, et al.
Published: (2024)
Generalized Linear Mode Connectivity for Transformers
by: Theus, Alexander, et al.
Published: (2025)
by: Theus, Alexander, et al.
Published: (2025)
Architecture Determines Observability of Transformers
by: Carmichael, Thomas
Published: (2026)
by: Carmichael, Thomas
Published: (2026)
EEG Emotion Classification Using an Enhanced Transformer-CNN-BiLSTM Architecture with Dual Attention Mechanisms
by: Karim, S M Rakib UI, et al.
Published: (2026)
by: Karim, S M Rakib UI, et al.
Published: (2026)
Spectral Journey: How Transformers Predict the Shortest Path
by: Cohen, Andrew, et al.
Published: (2025)
by: Cohen, Andrew, et al.
Published: (2025)
PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations
by: Holzschuh, Benjamin, et al.
Published: (2025)
by: Holzschuh, Benjamin, et al.
Published: (2025)
Separations in the Representational Capabilities of Transformers and Recurrent Architectures
by: Bhattamishra, Satwik, et al.
Published: (2024)
by: Bhattamishra, Satwik, et al.
Published: (2024)
Approximation Rate of the Transformer Architecture for Sequence Modeling
by: Jiang, Haotian, et al.
Published: (2023)
by: Jiang, Haotian, et al.
Published: (2023)
Simple Path Structural Encoding for Graph Transformers
by: Airale, Louis, et al.
Published: (2025)
by: Airale, Louis, et al.
Published: (2025)
Similar Items
-
Spectral-Window Hybrid (SWH)
by: Khasia, Vladimer
Published: (2026) -
Dynamic Subspace Composition: Efficient Adaptation via Contractive Basis Expansion
by: Khasia, Vladimer
Published: (2025) -
HAS-VQ: Hessian-Adaptive Sparse Vector Quantization for High-Fidelity LLM Compression
by: Khasia, Vladimer
Published: (2026) -
BASIS: Balanced Activation Sketching with Invariant Scalars for "Ghost Backpropagation"
by: Khasia, Vladimer
Published: (2026) -
HoloByte: Continuous Hyperspherical Distillation for Tokenizer-Free Modeling
by: Khasia, Vladimer
Published: (2026)