Saved in:
| Main Authors: | Lai, Zhao-Rong, Yang, Haisheng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.18596 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Autonomous Sparse Mean-CVaR Portfolio Optimization
by: Lin, Yizun, et al.
Published: (2024)
by: Lin, Yizun, et al.
Published: (2024)
Invariant Risk Minimization Is A Total Variation Model
by: Lai, Zhao-Rong, et al.
Published: (2024)
by: Lai, Zhao-Rong, et al.
Published: (2024)
Making Every Head Count: Sparse Attention Without the Speed-Performance Trade-off
by: Zhao, Mingkuan, et al.
Published: (2025)
by: Zhao, Mingkuan, et al.
Published: (2025)
The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs
by: Nawrot, Piotr, et al.
Published: (2025)
by: Nawrot, Piotr, et al.
Published: (2025)
Out-of-distribution Generalization for Total Variation based Invariant Risk Minimization
by: Wang, Yuanchao, et al.
Published: (2025)
by: Wang, Yuanchao, et al.
Published: (2025)
Sparse Hybrid Linear-Morphological Networks
by: Fotopoulos, Konstantinos, et al.
Published: (2025)
by: Fotopoulos, Konstantinos, et al.
Published: (2025)
Sparse Linear Regression and Lattice Problems
by: Gupte, Aparna, et al.
Published: (2024)
by: Gupte, Aparna, et al.
Published: (2024)
Sparse Linear Bandits with Blocking Constraints
by: Jain, Adit, et al.
Published: (2024)
by: Jain, Adit, et al.
Published: (2024)
Accelerating Frequency Domain Diffusion Models with Error-Feedback Event-Driven Caching
by: Liu, Dong, et al.
Published: (2026)
by: Liu, Dong, et al.
Published: (2026)
Step-Level Sparse Autoencoder for Reasoning Process Interpretation
by: Yang, Xuan, et al.
Published: (2026)
by: Yang, Xuan, et al.
Published: (2026)
Accelerating Sparse Transformer Inference on GPU
by: Dai, Wenhao, et al.
Published: (2025)
by: Dai, Wenhao, et al.
Published: (2025)
Spectrum-Informed Multistage Neural Networks: Multiscale Function Approximators of Machine Precision
by: Ng, Jakin, et al.
Published: (2024)
by: Ng, Jakin, et al.
Published: (2024)
A De-singularity Subgradient Approach for the Extended Weber Location Problem
by: Lai, Zhao-Rong, et al.
Published: (2024)
by: Lai, Zhao-Rong, et al.
Published: (2024)
Follow The Approximate Sparse Leader for No-Regret Online Sparse Linear Approximation
by: Mukhopadhyay, Samrat, et al.
Published: (2025)
by: Mukhopadhyay, Samrat, et al.
Published: (2025)
Identifiability Challenges in Sparse Linear Ordinary Differential Equations
by: Casolo, Cecilia, et al.
Published: (2025)
by: Casolo, Cecilia, et al.
Published: (2025)
Induced Covariance for Causal Discovery in Linear Sparse Structures
by: Mohseni-Sehdeh, Saeed, et al.
Published: (2024)
by: Mohseni-Sehdeh, Saeed, et al.
Published: (2024)
Enhancing Linear Attention with Residual Learning
by: Lai, Xunhao, et al.
Published: (2025)
by: Lai, Xunhao, et al.
Published: (2025)
How Transformers Utilize Multi-Head Attention in In-Context Learning? A Case Study on Sparse Linear Regression
by: Chen, Xingwu, et al.
Published: (2024)
by: Chen, Xingwu, et al.
Published: (2024)
Spectrum Extraction and Clipping for Implicitly Linear Layers
by: Boroojeny, Ali Ebrahimpour, et al.
Published: (2024)
by: Boroojeny, Ali Ebrahimpour, et al.
Published: (2024)
Sparse Graphical Linear Dynamical Systems
by: Chouzenoux, Emilie, et al.
Published: (2023)
by: Chouzenoux, Emilie, et al.
Published: (2023)
Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers
by: Choromanski, Krzysztof Marcin, et al.
Published: (2023)
by: Choromanski, Krzysztof Marcin, et al.
Published: (2023)
Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment
by: Wu, Haoyuan, et al.
Published: (2025)
by: Wu, Haoyuan, et al.
Published: (2025)
Controllable Pareto Trade-off between Fairness and Accuracy
by: Du, Yongkang, et al.
Published: (2025)
by: Du, Yongkang, et al.
Published: (2025)
SSH: Sparse Spectrum Adaptation via Discrete Hartley Transformation
by: Shen, Yixian, et al.
Published: (2025)
by: Shen, Yixian, et al.
Published: (2025)
Scaling Linear Attention with Sparse State Expansion
by: Pan, Yuqi, et al.
Published: (2025)
by: Pan, Yuqi, et al.
Published: (2025)
Linear Mode Connectivity in Sparse Neural Networks
by: McDermott, Luke, et al.
Published: (2023)
by: McDermott, Luke, et al.
Published: (2023)
Pareto Continual Learning: Preference-Conditioned Learning and Adaption for Dynamic Stability-Plasticity Trade-off
by: Lai, Song, et al.
Published: (2025)
by: Lai, Song, et al.
Published: (2025)
Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input
by: Chen, Ziang, et al.
Published: (2024)
by: Chen, Ziang, et al.
Published: (2024)
Learning to Forget: Bayesian Time Series Forecasting using Recurrent Sparse Spectrum Signature Gaussian Processes
by: Tóth, Csaba, et al.
Published: (2024)
by: Tóth, Csaba, et al.
Published: (2024)
Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
by: Du, Ally Yalei, et al.
Published: (2024)
by: Du, Ally Yalei, et al.
Published: (2024)
Optimal Cross-Validation for Sparse Linear Regression
by: Cory-Wright, Ryan, et al.
Published: (2023)
by: Cory-Wright, Ryan, et al.
Published: (2023)
Environment-Conditioned Tail Reweighting for Total Variation Invariant Risk Minimization
by: Wang, Yuanchao, et al.
Published: (2026)
by: Wang, Yuanchao, et al.
Published: (2026)
Discussing the Spectrum of Physics-Enhanced Machine Learning; a Survey on Structural Mechanics Applications
by: Haywood-Alexander, Marcus, et al.
Published: (2023)
by: Haywood-Alexander, Marcus, et al.
Published: (2023)
Minimizing False-Positive Attributions in Explanations of Non-Linear Models
by: Gjølbye, Anders, et al.
Published: (2025)
by: Gjølbye, Anders, et al.
Published: (2025)
Position: Curvature Matrices Should Be Democratized via Linear Operators
by: Dangel, Felix, et al.
Published: (2025)
by: Dangel, Felix, et al.
Published: (2025)
Learning and Transferring Sparse Contextual Bigrams with Linear Transformers
by: Ren, Yunwei, et al.
Published: (2024)
by: Ren, Yunwei, et al.
Published: (2024)
SEA: Sparse Linear Attention with Estimated Attention Mask
by: Lee, Heejun, et al.
Published: (2023)
by: Lee, Heejun, et al.
Published: (2023)
FRWKV+: Adaptive Periodic-Position Branch Interaction for Frequency-Space Linear Time Series Forecasting
by: Yang, Qingyuan, et al.
Published: (2026)
by: Yang, Qingyuan, et al.
Published: (2026)
Hamiltonian Monte Carlo Inference of Marginalized Linear Mixed-Effects Models
by: Lai, Jinlin, et al.
Published: (2024)
by: Lai, Jinlin, et al.
Published: (2024)
In-context Learning for Mixture of Linear Regressions: Existence, Generalization and Training Dynamics
by: Jin, Yanhao, et al.
Published: (2024)
by: Jin, Yanhao, et al.
Published: (2024)
Similar Items
-
Autonomous Sparse Mean-CVaR Portfolio Optimization
by: Lin, Yizun, et al.
Published: (2024) -
Invariant Risk Minimization Is A Total Variation Model
by: Lai, Zhao-Rong, et al.
Published: (2024) -
Making Every Head Count: Sparse Attention Without the Speed-Performance Trade-off
by: Zhao, Mingkuan, et al.
Published: (2025) -
The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs
by: Nawrot, Piotr, et al.
Published: (2025) -
Out-of-distribution Generalization for Total Variation based Invariant Risk Minimization
by: Wang, Yuanchao, et al.
Published: (2025)