:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Khasia, Vladimer
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.07070
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Spectral-Window Hybrid (SWH)
by: Khasia, Vladimer
Published: (2026)

Dynamic Subspace Composition: Efficient Adaptation via Contractive Basis Expansion
by: Khasia, Vladimer
Published: (2025)

HAS-VQ: Hessian-Adaptive Sparse Vector Quantization for High-Fidelity LLM Compression
by: Khasia, Vladimer
Published: (2026)

BASIS: Balanced Activation Sketching with Invariant Scalars for "Ghost Backpropagation"
by: Khasia, Vladimer
Published: (2026)

HoloByte: Continuous Hyperspherical Distillation for Tokenizer-Free Modeling
by: Khasia, Vladimer
Published: (2026)

Beyond Attention: True Adaptive World Models via Spherical Kernel Operator
by: Khasia, Vladimer
Published: (2026)

The Adaptive Vekua Cascade: A Differentiable Spectral-Analytic Solver for Physics-Informed Representation
by: Khasia, Vladimer
Published: (2025)

DeepVekua: Geometric-Spectral Representation Learning for Physics-Informed Fields
by: Khasia, Vladimer
Published: (2025)

The Vekua Layer: Exact Physical Priors for Implicit Neural Representations via Generalized Analytic Functions
by: Khasia, Vladimer
Published: (2025)

Primal: A Unified Deterministic Framework for Quasi-Orthogonal Hashing and Manifold Learning
by: Khasia, Vladimer
Published: (2025)

On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention
by: Ro, Yeonju, et al.
Published: (2025)

Blending Complementary Memory Systems in Hybrid Quadratic-Linear Transformers
by: Irie, Kazuki, et al.
Published: (2025)

Dual Path Attribution: Efficient Attribution for SwiGLU-Transformers through Layer-Wise Target Propagation
by: Jantsch, Lasse Marten, et al.
Published: (2026)

Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers
by: Choi, Sehyun
Published: (2024)

EGMOF: Efficient Generation of Metal-Organic Frameworks Using a Hybrid Diffusion-Transformer Architecture
by: Han, Seunghee, et al.
Published: (2025)

ZeroS: Zero-Sum Linear Attention for Efficient Transformers
by: Lu, Jiecheng, et al.
Published: (2026)

Gated Linear Attention Transformers with Hardware-Efficient Training
by: Yang, Songlin, et al.
Published: (2023)

The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling
by: Kerce, J. Clayton, et al.
Published: (2026)

AnchorGT: Efficient and Flexible Attention Architecture for Scalable Graph Transformers
by: Zhu, Wenhao, et al.
Published: (2024)

Rethinking Transformer Connectivity: TLinFormer, A Path to Exact, Full Context-Aware Linear Attention
by: Tang, Zhongpan
Published: (2025)

On Limitations of the Transformer Architecture
by: Peng, Binghui, et al.
Published: (2024)

Ister: Linear Transformer for Efficient Multivariate Time Series Forecasting
by: Cao, Fanpu, et al.
Published: (2024)

In Transformer We Trust? A Perspective on Transformer Architecture Failure Modes
by: Mondal, Trishit, et al.
Published: (2026)

EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
by: Becker, Philipp, et al.
Published: (2025)

PPTNet: A Hybrid Periodic Pattern-Transformer Architecture for Traffic Flow Prediction and Congestion Identification
by: Kou, Hongrui, et al.
Published: (2025)

Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers
by: Choromanski, Krzysztof Marcin, et al.
Published: (2023)

Dual Filter: A Transformer-like Inference Architecture for Hidden Markov Models
by: Chang, Heng-Sheng, et al.
Published: (2025)

Reducing the Transformer Architecture to a Minimum
by: Bermeitinger, Bernhard, et al.
Published: (2024)

Efficiently Transforming Neural Networks into Decision Trees: A Path to Ground Truth Explanations with RENTT
by: Monke, Helena, et al.
Published: (2025)

Hybrid Dynamic Pruning: A Pathway to Efficient Transformer Inference
by: Jaradat, Ghadeer, et al.
Published: (2024)

Cottention: Linear Transformers With Cosine Attention
by: Mongaras, Gabriel, et al.
Published: (2024)

Linear Transformers are Versatile In-Context Learners
by: Vladymyrov, Max, et al.
Published: (2024)

Generalized Linear Mode Connectivity for Transformers
by: Theus, Alexander, et al.
Published: (2025)

Architecture Determines Observability of Transformers
by: Carmichael, Thomas
Published: (2026)

EEG Emotion Classification Using an Enhanced Transformer-CNN-BiLSTM Architecture with Dual Attention Mechanisms
by: Karim, S M Rakib UI, et al.
Published: (2026)

Spectral Journey: How Transformers Predict the Shortest Path
by: Cohen, Andrew, et al.
Published: (2025)

PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations
by: Holzschuh, Benjamin, et al.
Published: (2025)

Separations in the Representational Capabilities of Transformers and Recurrent Architectures
by: Bhattamishra, Satwik, et al.
Published: (2024)

Approximation Rate of the Transformer Architecture for Sequence Modeling
by: Jiang, Haotian, et al.
Published: (2023)

Simple Path Structural Encoding for Graph Transformers
by: Airale, Louis, et al.
Published: (2025)