Saved in:
| Main Authors: | Liu, Fusheng, Li, Qianxiao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.19455 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From Generalization Analysis to Optimization Designs for State Space Models
by: Liu, Fusheng, et al.
Published: (2024)
by: Liu, Fusheng, et al.
Published: (2024)
The Effect of Depth on the Expressivity of Deep Linear State-Space Models
by: Bao, Zeyu, et al.
Published: (2025)
by: Bao, Zeyu, et al.
Published: (2025)
StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization
by: Wang, Shida, et al.
Published: (2023)
by: Wang, Shida, et al.
Published: (2023)
Approximation Rate of the Transformer Architecture for Sequence Modeling
by: Jiang, Haotian, et al.
Published: (2023)
by: Jiang, Haotian, et al.
Published: (2023)
Learning task-specific predictive models for scientific computing
by: Yin, Jianyuan, et al.
Published: (2025)
by: Yin, Jianyuan, et al.
Published: (2025)
Accelerating Legacy Numerical Solvers by Non-intrusive Gradient-based Meta-solving
by: Arisaka, Sohei, et al.
Published: (2024)
by: Arisaka, Sohei, et al.
Published: (2024)
Unifying back-propagation and forward-forward algorithms through model predictive control
by: Ren, Lianhai, et al.
Published: (2024)
by: Ren, Lianhai, et al.
Published: (2024)
DynGMA: a robust approach for learning stochastic differential equations from data
by: Zhu, Aiqing, et al.
Published: (2024)
by: Zhu, Aiqing, et al.
Published: (2024)
Learning Macroscopic Dynamics from Partial Microscopic Observations
by: Chen, Mengyi, et al.
Published: (2024)
by: Chen, Mengyi, et al.
Published: (2024)
Numerical Investigation of Sequence Modeling Theory using Controllable Memory Functions
by: Jiang, Haotian, et al.
Published: (2025)
by: Jiang, Haotian, et al.
Published: (2025)
Continuity-Preserving Convolutional Autoencoders for Learning Continuous Latent Dynamical Models from Images
by: Zhu, Aiqing, et al.
Published: (2025)
by: Zhu, Aiqing, et al.
Published: (2025)
Machine Unlearning under Retain-Forget Entanglement
by: Cheng, Jingpu, et al.
Published: (2026)
by: Cheng, Jingpu, et al.
Published: (2026)
Deep Autocorrelation Modeling for Time-Series Forecasting: Progress and Prospects
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
Learning Permutation-invariant Macroscopic Dynamics
by: Han, Zhichao, et al.
Published: (2026)
by: Han, Zhichao, et al.
Published: (2026)
Inverse Approximation Theory for Nonlinear Recurrent Neural Networks
by: Wang, Shida, et al.
Published: (2023)
by: Wang, Shida, et al.
Published: (2023)
UnHiPPO: Uncertainty-aware Initialization for State Space Models
by: Lienen, Marten, et al.
Published: (2025)
by: Lienen, Marten, et al.
Published: (2025)
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration
by: Ren, Lianhai, et al.
Published: (2026)
by: Ren, Lianhai, et al.
Published: (2026)
InfoFlow: A Framework for Multi-Layer Transformer Analysis
by: Yu, Penghao, et al.
Published: (2026)
by: Yu, Penghao, et al.
Published: (2026)
Mimetic Initialization Helps State Space Models Learn to Recall
by: Trockman, Asher, et al.
Published: (2024)
by: Trockman, Asher, et al.
Published: (2024)
Latent Matters: Learning Deep State-Space Models
by: Klushyn, Alexej, et al.
Published: (2026)
by: Klushyn, Alexej, et al.
Published: (2026)
Mitigating distribution shift in machine learning-augmented hybrid simulation
by: Zhao, Jiaxi, et al.
Published: (2024)
by: Zhao, Jiaxi, et al.
Published: (2024)
Fast Gaussian Process Approximations for Autocorrelated Data
by: Chokhachian, Ahmadreza, et al.
Published: (2025)
by: Chokhachian, Ahmadreza, et al.
Published: (2025)
Scalable learning of macroscopic stochastic dynamics
by: Chen, Mengyi, et al.
Published: (2025)
by: Chen, Mengyi, et al.
Published: (2025)
A unified framework for establishing the universal approximation of transformer-type architectures
by: Cheng, Jingpu, et al.
Published: (2025)
by: Cheng, Jingpu, et al.
Published: (2025)
Deep learning and the rate of approximation by flows
by: Cheng, Jingpu, et al.
Published: (2026)
by: Cheng, Jingpu, et al.
Published: (2026)
Initialization Schemes for Kolmogorov-Arnold Networks: An Empirical Study
by: Rigas, Spyros, et al.
Published: (2025)
by: Rigas, Spyros, et al.
Published: (2025)
The Effect of Attention Head Count on Transformer Approximation
by: Yu, Penghao, et al.
Published: (2025)
by: Yu, Penghao, et al.
Published: (2025)
Identifiable learning of dissipative dynamics
by: Zhu, Aiqing, et al.
Published: (2025)
by: Zhu, Aiqing, et al.
Published: (2025)
Terminally constrained flow-based generative models from an optimal control perspective
by: Gao, Weiguo, et al.
Published: (2026)
by: Gao, Weiguo, et al.
Published: (2026)
Allocation of Parameters in Transformers
by: Yu, Ruoxi, et al.
Published: (2025)
by: Yu, Ruoxi, et al.
Published: (2025)
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
by: Wang, Peihao, et al.
Published: (2024)
by: Wang, Peihao, et al.
Published: (2024)
Spectra-Guided Neural Tucker Factorization
by: Wang, Fusheng, et al.
Published: (2026)
by: Wang, Fusheng, et al.
Published: (2026)
Partially-Observable Sequential Change-Point Detection for Autocorrelated Data via Upper Confidence Region
by: Xu, Haijie, et al.
Published: (2024)
by: Xu, Haijie, et al.
Published: (2024)
Analysis of Long Range Dependency Understanding in State Space Models
by: Ravikumar, Srividya, et al.
Published: (2026)
by: Ravikumar, Srividya, et al.
Published: (2026)
Unraveling the Interplay between Carryover Effects and Reward Autocorrelations in Switchback Experiments
by: Wen, Qianglin, et al.
Published: (2024)
by: Wen, Qianglin, et al.
Published: (2024)
Identifying Elasticities in Autocorrelated Time Series Using Causal Graphs
by: Tiedemann, Silvana, et al.
Published: (2024)
by: Tiedemann, Silvana, et al.
Published: (2024)
Autocorrelation Reintroduces Spectral Bias in KANs for Time Series Forecasting
by: Zeng, Chen, et al.
Published: (2026)
by: Zeng, Chen, et al.
Published: (2026)
Advancing Neural Network Performance through Emergence-Promoting Initialization Scheme
by: Li, Johnny Jingze, et al.
Published: (2024)
by: Li, Johnny Jingze, et al.
Published: (2024)
On the Crucial Role of Initialization for Matrix Factorization
by: Li, Bingcong, et al.
Published: (2024)
by: Li, Bingcong, et al.
Published: (2024)
The Driver-Blindness Phenomenon: Why Deep Sequence Models Default to Autocorrelation in Blood Glucose Forecasting
by: Shakeri, Heman
Published: (2025)
by: Shakeri, Heman
Published: (2025)
Similar Items
-
From Generalization Analysis to Optimization Designs for State Space Models
by: Liu, Fusheng, et al.
Published: (2024) -
The Effect of Depth on the Expressivity of Deep Linear State-Space Models
by: Bao, Zeyu, et al.
Published: (2025) -
StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization
by: Wang, Shida, et al.
Published: (2023) -
Approximation Rate of the Transformer Architecture for Sequence Modeling
by: Jiang, Haotian, et al.
Published: (2023) -
Learning task-specific predictive models for scientific computing
by: Yin, Jianyuan, et al.
Published: (2025)