:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lai, Zhao-Rong, Yang, Haisheng
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2508.18596
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Autonomous Sparse Mean-CVaR Portfolio Optimization
by: Lin, Yizun, et al.
Published: (2024)

Invariant Risk Minimization Is A Total Variation Model
by: Lai, Zhao-Rong, et al.
Published: (2024)

Making Every Head Count: Sparse Attention Without the Speed-Performance Trade-off
by: Zhao, Mingkuan, et al.
Published: (2025)

The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs
by: Nawrot, Piotr, et al.
Published: (2025)

Out-of-distribution Generalization for Total Variation based Invariant Risk Minimization
by: Wang, Yuanchao, et al.
Published: (2025)

Sparse Hybrid Linear-Morphological Networks
by: Fotopoulos, Konstantinos, et al.
Published: (2025)

Sparse Linear Regression and Lattice Problems
by: Gupte, Aparna, et al.
Published: (2024)

Sparse Linear Bandits with Blocking Constraints
by: Jain, Adit, et al.
Published: (2024)

Accelerating Frequency Domain Diffusion Models with Error-Feedback Event-Driven Caching
by: Liu, Dong, et al.
Published: (2026)

Step-Level Sparse Autoencoder for Reasoning Process Interpretation
by: Yang, Xuan, et al.
Published: (2026)

Accelerating Sparse Transformer Inference on GPU
by: Dai, Wenhao, et al.
Published: (2025)

Spectrum-Informed Multistage Neural Networks: Multiscale Function Approximators of Machine Precision
by: Ng, Jakin, et al.
Published: (2024)

A De-singularity Subgradient Approach for the Extended Weber Location Problem
by: Lai, Zhao-Rong, et al.
Published: (2024)

Follow The Approximate Sparse Leader for No-Regret Online Sparse Linear Approximation
by: Mukhopadhyay, Samrat, et al.
Published: (2025)

Identifiability Challenges in Sparse Linear Ordinary Differential Equations
by: Casolo, Cecilia, et al.
Published: (2025)

Induced Covariance for Causal Discovery in Linear Sparse Structures
by: Mohseni-Sehdeh, Saeed, et al.
Published: (2024)

Enhancing Linear Attention with Residual Learning
by: Lai, Xunhao, et al.
Published: (2025)

How Transformers Utilize Multi-Head Attention in In-Context Learning? A Case Study on Sparse Linear Regression
by: Chen, Xingwu, et al.
Published: (2024)

Spectrum Extraction and Clipping for Implicitly Linear Layers
by: Boroojeny, Ali Ebrahimpour, et al.
Published: (2024)

Sparse Graphical Linear Dynamical Systems
by: Chouzenoux, Emilie, et al.
Published: (2023)

Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers
by: Choromanski, Krzysztof Marcin, et al.
Published: (2023)

Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment
by: Wu, Haoyuan, et al.
Published: (2025)

Controllable Pareto Trade-off between Fairness and Accuracy
by: Du, Yongkang, et al.
Published: (2025)

SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation
by: Shen, Yixian, et al.
Published: (2025)

Scaling Linear Attention with Sparse State Expansion
by: Pan, Yuqi, et al.
Published: (2025)

Linear Mode Connectivity in Sparse Neural Networks
by: McDermott, Luke, et al.
Published: (2023)

Pareto Continual Learning: Preference-Conditioned Learning and Adaption for Dynamic Stability-Plasticity Trade-off
by: Lai, Song, et al.
Published: (2025)

Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input
by: Chen, Ziang, et al.
Published: (2024)

Learning to Forget: Bayesian Time Series Forecasting using Recurrent Sparse Spectrum Signature Gaussian Processes
by: Tóth, Csaba, et al.
Published: (2024)

Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
by: Du, Ally Yalei, et al.
Published: (2024)

Optimal Cross-Validation for Sparse Linear Regression
by: Cory-Wright, Ryan, et al.
Published: (2023)

Environment-Conditioned Tail Reweighting for Total Variation Invariant Risk Minimization
by: Wang, Yuanchao, et al.
Published: (2026)

Discussing the Spectrum of Physics-Enhanced Machine Learning; a Survey on Structural Mechanics Applications
by: Haywood-Alexander, Marcus, et al.
Published: (2023)

Minimizing False-Positive Attributions in Explanations of Non-Linear Models
by: Gjølbye, Anders, et al.
Published: (2025)

Position: Curvature Matrices Should Be Democratized via Linear Operators
by: Dangel, Felix, et al.
Published: (2025)

Learning and Transferring Sparse Contextual Bigrams with Linear Transformers
by: Ren, Yunwei, et al.
Published: (2024)

SEA: Sparse Linear Attention with Estimated Attention Mask
by: Lee, Heejun, et al.
Published: (2023)

FRWKV+: Adaptive Periodic-Position Branch Interaction for Frequency-Space Linear Time Series Forecasting
by: Yang, Qingyuan, et al.
Published: (2026)

Hamiltonian Monte Carlo Inference of Marginalized Linear Mixed-Effects Models
by: Lai, Jinlin, et al.
Published: (2024)

In-context Learning for Mixture of Linear Regressions: Existence, Generalization and Training Dynamics
by: Jin, Yanhao, et al.
Published: (2024)