:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Fusheng, Li, Qianxiao
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2405.02670
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models
by: Liu, Fusheng, et al.
Published: (2024)

The Effect of Depth on the Expressivity of Deep Linear State-Space Models
by: Bao, Zeyu, et al.
Published: (2025)

StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization
by: Wang, Shida, et al.
Published: (2023)

Approximation Rate of the Transformer Architecture for Sequence Modeling
by: Jiang, Haotian, et al.
Published: (2023)

Learning task-specific predictive models for scientific computing
by: Yin, Jianyuan, et al.
Published: (2025)

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration
by: Ren, Lianhai, et al.
Published: (2026)

Accelerating Legacy Numerical Solvers by Non-intrusive Gradient-based Meta-solving
by: Arisaka, Sohei, et al.
Published: (2024)

Unifying back-propagation and forward-forward algorithms through model predictive control
by: Ren, Lianhai, et al.
Published: (2024)

InfoFlow: A Framework for Multi-Layer Transformer Analysis
by: Yu, Penghao, et al.
Published: (2026)

DynGMA: a robust approach for learning stochastic differential equations from data
by: Zhu, Aiqing, et al.
Published: (2024)

Learning Macroscopic Dynamics from Partial Microscopic Observations
by: Chen, Mengyi, et al.
Published: (2024)

On the Curse of Memory in Recurrent Neural Networks: Approximation and Optimization Analysis
by: Li, Zhong, et al.
Published: (2020)

Numerical Investigation of Sequence Modeling Theory using Controllable Memory Functions
by: Jiang, Haotian, et al.
Published: (2025)

Continuity-Preserving Convolutional Autoencoders for Learning Continuous Latent Dynamical Models from Images
by: Zhu, Aiqing, et al.
Published: (2025)

Machine Unlearning under Retain-Forget Entanglement
by: Cheng, Jingpu, et al.
Published: (2026)

Learning Permutation-invariant Macroscopic Dynamics
by: Han, Zhichao, et al.
Published: (2026)

Inverse Approximation Theory for Nonlinear Recurrent Neural Networks
by: Wang, Shida, et al.
Published: (2023)

Mitigating distribution shift in machine learning-augmented hybrid simulation
by: Zhao, Jiaxi, et al.
Published: (2024)

Scalable learning of macroscopic stochastic dynamics
by: Chen, Mengyi, et al.
Published: (2025)

A unified framework for establishing the universal approximation of transformer-type architectures
by: Cheng, Jingpu, et al.
Published: (2025)

Deep learning and the rate of approximation by flows
by: Cheng, Jingpu, et al.
Published: (2026)

Multi-Objective Latent Space Optimization of Generative Molecular Design Models
by: Abeer, A N M Nafiz, et al.
Published: (2022)

The Effect of Attention Head Count on Transformer Approximation
by: Yu, Penghao, et al.
Published: (2025)

Identifiable learning of dissipative dynamics
by: Zhu, Aiqing, et al.
Published: (2025)

Terminally constrained flow-based generative models from an optimal control perspective
by: Gao, Weiguo, et al.
Published: (2026)

Generalization Error Analysis for Selective State-Space Models Through the Lens of Attention
by: Honarpisheh, Arya, et al.
Published: (2025)

Allocation of Parameters in Transformers
by: Yu, Ruoxi, et al.
Published: (2025)

Spectra-Guided Neural Tucker Factorization
by: Wang, Fusheng, et al.
Published: (2026)

A Theoretical Analysis of Mamba's Training Dynamics: Filtering Relevant Features for Generalization in State Space Models
by: Shandirasegaran, Mugunthan, et al.
Published: (2026)

Paid with Models: Optimal Contract Design for Collaborative Machine Learning
by: Wang, Bingchen, et al.
Published: (2024)

Controllability Analysis of State Space-based Language Model
by: Mabrok, Mohamed, et al.
Published: (2025)

Global Optimization By Gradient From Hierarchical Score-Matching Spaces
by: Li, Ming
Published: (2026)

GG-SSMs: Graph-Generating State Space Models
by: Zubić, Nikola, et al.
Published: (2024)

Parallelizing Autoregressive Generation with Variational State Space Models
by: Lambrechts, Gaspard, et al.
Published: (2024)

Hypothesis-driven construction of mesoscopic dynamics
by: Li, Zhuoyuan, et al.
Published: (2026)

Generalization Analysis for Classification on Korobov Space
by: Liu, Yuqing
Published: (2025)

Quantum-Optimized Selective State Space Model for Efficient Time Series Prediction
by: Jura, Stefan-Alexandru, et al.
Published: (2025)

From Layers to States: A State Space Model Perspective to Deep Neural Network Layer Dynamics
by: Liu, Qinshuo, et al.
Published: (2025)

PerfMamba: Performance Analysis and Pruning of Selective State Space Models
by: Asif, Abdullah Al, et al.
Published: (2025)

Longhorn: State Space Models are Amortized Online Learners
by: Liu, Bo, et al.
Published: (2024)