Saved in:
| Main Authors: | Yin, Jianyuan, Li, Qianxiao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.03835 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Unifying back-propagation and forward-forward algorithms through model predictive control
by: Ren, Lianhai, et al.
Published: (2024)
by: Ren, Lianhai, et al.
Published: (2024)
Learning Macroscopic Dynamics from Partial Microscopic Observations
by: Chen, Mengyi, et al.
Published: (2024)
by: Chen, Mengyi, et al.
Published: (2024)
From Generalization Analysis to Optimization Designs for State Space Models
by: Liu, Fusheng, et al.
Published: (2024)
by: Liu, Fusheng, et al.
Published: (2024)
Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models
by: Liu, Fusheng, et al.
Published: (2024)
by: Liu, Fusheng, et al.
Published: (2024)
Approximation Rate of the Transformer Architecture for Sequence Modeling
by: Jiang, Haotian, et al.
Published: (2023)
by: Jiang, Haotian, et al.
Published: (2023)
Learning Permutation-invariant Macroscopic Dynamics
by: Han, Zhichao, et al.
Published: (2026)
by: Han, Zhichao, et al.
Published: (2026)
Accelerating Legacy Numerical Solvers by Non-intrusive Gradient-based Meta-solving
by: Arisaka, Sohei, et al.
Published: (2024)
by: Arisaka, Sohei, et al.
Published: (2024)
DynGMA: a robust approach for learning stochastic differential equations from data
by: Zhu, Aiqing, et al.
Published: (2024)
by: Zhu, Aiqing, et al.
Published: (2024)
Continuity-Preserving Convolutional Autoencoders for Learning Continuous Latent Dynamical Models from Images
by: Zhu, Aiqing, et al.
Published: (2025)
by: Zhu, Aiqing, et al.
Published: (2025)
StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization
by: Wang, Shida, et al.
Published: (2023)
by: Wang, Shida, et al.
Published: (2023)
Terminally constrained flow-based generative models from an optimal control perspective
by: Gao, Weiguo, et al.
Published: (2026)
by: Gao, Weiguo, et al.
Published: (2026)
Inverse Approximation Theory for Nonlinear Recurrent Neural Networks
by: Wang, Shida, et al.
Published: (2023)
by: Wang, Shida, et al.
Published: (2023)
The Effect of Depth on the Expressivity of Deep Linear State-Space Models
by: Bao, Zeyu, et al.
Published: (2025)
by: Bao, Zeyu, et al.
Published: (2025)
Numerical Investigation of Sequence Modeling Theory using Controllable Memory Functions
by: Jiang, Haotian, et al.
Published: (2025)
by: Jiang, Haotian, et al.
Published: (2025)
InfoFlow: A Framework for Multi-Layer Transformer Analysis
by: Yu, Penghao, et al.
Published: (2026)
by: Yu, Penghao, et al.
Published: (2026)
Machine Unlearning under Retain-Forget Entanglement
by: Cheng, Jingpu, et al.
Published: (2026)
by: Cheng, Jingpu, et al.
Published: (2026)
Accelerating scientific discovery with the common task framework
by: Kutz, J. Nathan, et al.
Published: (2025)
by: Kutz, J. Nathan, et al.
Published: (2025)
Aviary: training language agents on challenging scientific tasks
by: Narayanan, Siddharth, et al.
Published: (2024)
by: Narayanan, Siddharth, et al.
Published: (2024)
A Geometry-Adaptive Deep Variational Framework for Phase Discovery in the Landau-Brazovskii Model
by: Xie, Yuchen, et al.
Published: (2026)
by: Xie, Yuchen, et al.
Published: (2026)
Mitigating distribution shift in machine learning-augmented hybrid simulation
by: Zhao, Jiaxi, et al.
Published: (2024)
by: Zhao, Jiaxi, et al.
Published: (2024)
Scalable learning of macroscopic stochastic dynamics
by: Chen, Mengyi, et al.
Published: (2025)
by: Chen, Mengyi, et al.
Published: (2025)
A unified framework for establishing the universal approximation of transformer-type architectures
by: Cheng, Jingpu, et al.
Published: (2025)
by: Cheng, Jingpu, et al.
Published: (2025)
Deep learning and the rate of approximation by flows
by: Cheng, Jingpu, et al.
Published: (2026)
by: Cheng, Jingpu, et al.
Published: (2026)
The Effect of Attention Head Count on Transformer Approximation
by: Yu, Penghao, et al.
Published: (2025)
by: Yu, Penghao, et al.
Published: (2025)
Identifiable learning of dissipative dynamics
by: Zhu, Aiqing, et al.
Published: (2025)
by: Zhu, Aiqing, et al.
Published: (2025)
Investigating task-specific prompts and sparse autoencoders for activation monitoring
by: Tillman, Henk, et al.
Published: (2025)
by: Tillman, Henk, et al.
Published: (2025)
Next-generation reservoir computing validated by classification task
by: Kitayama, Ken-ichi
Published: (2025)
by: Kitayama, Ken-ichi
Published: (2025)
Contrastive General Graph Matching with Adaptive Augmentation Sampling
by: Bo, Jianyuan, et al.
Published: (2024)
by: Bo, Jianyuan, et al.
Published: (2024)
Allocation of Parameters in Transformers
by: Yu, Ruoxi, et al.
Published: (2025)
by: Yu, Ruoxi, et al.
Published: (2025)
Physical formula enhanced multi-task learning for pharmacokinetics prediction
by: Li, Ruifeng, et al.
Published: (2024)
by: Li, Ruifeng, et al.
Published: (2024)
Hierarchical Relational Learning for Few-Shot Knowledge Graph Completion
by: Wu, Han, et al.
Published: (2022)
by: Wu, Han, et al.
Published: (2022)
Joint multi-task learning improves weakly-supervised biomarker prediction in computational pathology
by: Nahhas, Omar S. M. El, et al.
Published: (2024)
by: Nahhas, Omar S. M. El, et al.
Published: (2024)
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration
by: Ren, Lianhai, et al.
Published: (2026)
by: Ren, Lianhai, et al.
Published: (2026)
CORE: Contrastive Masked Feature Reconstruction on Graphs
by: Bo, Jianyuan, et al.
Published: (2025)
by: Bo, Jianyuan, et al.
Published: (2025)
Simultaneous identification of models and parameters of scientific simulators
by: Schröder, Cornelius, et al.
Published: (2023)
by: Schröder, Cornelius, et al.
Published: (2023)
Tackling prediction tasks in relational databases with LLMs
by: Wydmuch, Marek, et al.
Published: (2024)
by: Wydmuch, Marek, et al.
Published: (2024)
A representation-learning game for classes of prediction tasks
by: Uzan, Neria, et al.
Published: (2024)
by: Uzan, Neria, et al.
Published: (2024)
Hypothesis-driven construction of mesoscopic dynamics
by: Li, Zhuoyuan, et al.
Published: (2026)
by: Li, Zhuoyuan, et al.
Published: (2026)
Multi-task Heterogeneous Graph Learning on Electronic Health Records
by: Chan, Tsai Hor, et al.
Published: (2024)
by: Chan, Tsai Hor, et al.
Published: (2024)
Reducing cross-sample prediction churn in scientific machine learning
by: Prastalo, Gordan, et al.
Published: (2026)
by: Prastalo, Gordan, et al.
Published: (2026)
Similar Items
-
Unifying back-propagation and forward-forward algorithms through model predictive control
by: Ren, Lianhai, et al.
Published: (2024) -
Learning Macroscopic Dynamics from Partial Microscopic Observations
by: Chen, Mengyi, et al.
Published: (2024) -
From Generalization Analysis to Optimization Designs for State Space Models
by: Liu, Fusheng, et al.
Published: (2024) -
Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models
by: Liu, Fusheng, et al.
Published: (2024) -
Approximation Rate of the Transformer Architecture for Sequence Modeling
by: Jiang, Haotian, et al.
Published: (2023)