Saved in:
| Main Authors: | Long, Jiangxuan, Song, Zhao, Yang, Chiwun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.14076 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Curse of Attention: A Kernel-Based Perspective for Why Transformers Fail to Generalize on Time Series Forecasting and Beyond
by: Ke, Yekun, et al.
Published: (2024)
by: Ke, Yekun, et al.
Published: (2024)
Unifying Learning Dynamics and Generalization in Transformers Scaling Law
by: Yang, Chiwun
Published: (2025)
by: Yang, Chiwun
Published: (2025)
Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows
by: Gong, Chengyue, et al.
Published: (2025)
by: Gong, Chengyue, et al.
Published: (2025)
Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
by: Cao, Yang, et al.
Published: (2025)
by: Cao, Yang, et al.
Published: (2025)
How Sparse Attention Approximates Exact Attention? Your Attention is Naturally $n^C$-Sparse
by: Deng, Yichuan, et al.
Published: (2024)
by: Deng, Yichuan, et al.
Published: (2024)
Enhancing Stochastic Gradient Descent: A Unified Framework and Novel Acceleration Methods for Faster Convergence
by: Deng, Yichuan, et al.
Published: (2024)
by: Deng, Yichuan, et al.
Published: (2024)
Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix
by: Liang, Yingyu, et al.
Published: (2024)
by: Liang, Yingyu, et al.
Published: (2024)
Unlocking the Theory Behind Scaling 1-Bit Neural Networks
by: Daliri, Majid, et al.
Published: (2024)
by: Daliri, Majid, et al.
Published: (2024)
Towards Infinite-Long Prefix in Transformer
by: Liang, Yingyu, et al.
Published: (2024)
by: Liang, Yingyu, et al.
Published: (2024)
FlowTS: Time Series Generation via Rectified Flow
by: Hu, Yang, et al.
Published: (2024)
by: Hu, Yang, et al.
Published: (2024)
Synthetic Series-Symbol Data Generation for Time Series Foundation Models
by: Wang, Wenxuan, et al.
Published: (2025)
by: Wang, Wenxuan, et al.
Published: (2025)
Towards High-Order Mean Flow Generative Models: Feasibility, Expressivity, and Provably Efficient Criteria
by: Cao, Yang, et al.
Published: (2025)
by: Cao, Yang, et al.
Published: (2025)
A Theoretical Analysis of Discrete Flow Matching Generative Models
by: Su, Maojiang, et al.
Published: (2025)
by: Su, Maojiang, et al.
Published: (2025)
Leveraging Generic Time Series Foundation Models for EEG Classification
by: Gnassounou, Théo, et al.
Published: (2025)
by: Gnassounou, Théo, et al.
Published: (2025)
PrismFlow: Residual Dynamics for Flow Matching in Time-Series Generation
by: Zhang, Junru, et al.
Published: (2026)
by: Zhang, Junru, et al.
Published: (2026)
TimeDiT: General-purpose Diffusion Transformers for Time Series Foundation Model
by: Cao, Defu, et al.
Published: (2024)
by: Cao, Defu, et al.
Published: (2024)
Theoretical Foundations of Scaling Law in Familial Models
by: Song, Huan, et al.
Published: (2025)
by: Song, Huan, et al.
Published: (2025)
A Mamba Foundation Model for Time Series Forecasting
by: Ma, Haoyu, et al.
Published: (2024)
by: Ma, Haoyu, et al.
Published: (2024)
Locally Linear Continual Learning for Time Series based on VC-Theoretical Generalization Bounds
by: Ferreira, Yan V. G., et al.
Published: (2026)
by: Ferreira, Yan V. G., et al.
Published: (2026)
Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation
by: Wang, Wenxuan, et al.
Published: (2025)
by: Wang, Wenxuan, et al.
Published: (2025)
Provably Robust Adaptation for Language-Empowered Foundation Models
by: Lai, Yuni, et al.
Published: (2025)
by: Lai, Yuni, et al.
Published: (2025)
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
by: Cho, Taehyun, et al.
Published: (2024)
by: Cho, Taehyun, et al.
Published: (2024)
TS-RAG: Retrieval-Augmented Generation based Time Series Foundation Models are Stronger Zero-Shot Forecaster
by: Ning, Kanghui, et al.
Published: (2025)
by: Ning, Kanghui, et al.
Published: (2025)
Circuit Complexity Bounds for RoPE-based Transformer Architecture
by: Chen, Bo, et al.
Published: (2024)
by: Chen, Bo, et al.
Published: (2024)
Towards Stable and Structured Time Series Generation with Perturbation-Aware Flow Matching
by: Zhang, Jintao, et al.
Published: (2025)
by: Zhang, Jintao, et al.
Published: (2025)
Provable Zero-Shot Generalization in Offline Reinforcement Learning
by: Wang, Zhiyong, et al.
Published: (2025)
by: Wang, Zhiyong, et al.
Published: (2025)
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
by: Zhao, Runze, et al.
Published: (2025)
by: Zhao, Runze, et al.
Published: (2025)
T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models
by: Ge, Yunfeng, et al.
Published: (2025)
by: Ge, Yunfeng, et al.
Published: (2025)
Discrete Prototypical Memories for Federated Time Series Foundation Models
by: Deng, Liwei, et al.
Published: (2026)
by: Deng, Liwei, et al.
Published: (2026)
Population Aware Diffusion for Time Series Generation
by: Li, Yang, et al.
Published: (2025)
by: Li, Yang, et al.
Published: (2025)
Provable Generalization in Overparameterized Neural Nets
by: Dhingra, Aviral
Published: (2025)
by: Dhingra, Aviral
Published: (2025)
Diversified Scaling Inference in Time Series Foundation Models
by: Hua, Ruijin, et al.
Published: (2026)
by: Hua, Ruijin, et al.
Published: (2026)
AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models
by: Ren, Lei, et al.
Published: (2024)
by: Ren, Lei, et al.
Published: (2024)
Distilling Time Series Foundation Models for Efficient Forecasting
by: Li, Yuqi, et al.
Published: (2026)
by: Li, Yuqi, et al.
Published: (2026)
Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond
by: Cao, Yang, et al.
Published: (2024)
by: Cao, Yang, et al.
Published: (2024)
TimeOmni-VL: Unified Models for Time Series Understanding and Generation
by: Guan, Tong, et al.
Published: (2026)
by: Guan, Tong, et al.
Published: (2026)
On the Statistical Efficiency of Mean-Field Reinforcement Learning with General Function Approximation
by: Huang, Jiawei, et al.
Published: (2023)
by: Huang, Jiawei, et al.
Published: (2023)
Characteristic Learning for Provable One Step Generation
by: Ding, Zhao, et al.
Published: (2024)
by: Ding, Zhao, et al.
Published: (2024)
Towards Neural Scaling Laws for Time Series Foundation Models
by: Yao, Qingren, et al.
Published: (2024)
by: Yao, Qingren, et al.
Published: (2024)
Diffusion-TS: Interpretable Diffusion for General Time Series Generation
by: Yuan, Xinyu, et al.
Published: (2024)
by: Yuan, Xinyu, et al.
Published: (2024)
Similar Items
-
Curse of Attention: A Kernel-Based Perspective for Why Transformers Fail to Generalize on Time Series Forecasting and Beyond
by: Ke, Yekun, et al.
Published: (2024) -
Unifying Learning Dynamics and Generalization in Transformers Scaling Law
by: Yang, Chiwun
Published: (2025) -
Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows
by: Gong, Chengyue, et al.
Published: (2025) -
Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation
by: Cao, Yang, et al.
Published: (2025) -
How Sparse Attention Approximates Exact Attention? Your Attention is Naturally $n^C$-Sparse
by: Deng, Yichuan, et al.
Published: (2024)