Saved in:
| Main Authors: | Li, Binghui, Chen, Fengling, Huang, Zixun, Wang, Lean, Wu, Lei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.19189 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Optimal Learning-Rate Schedules under Functional Scaling Laws: Power Decay and Warmup-Stable-Decay
by: Li, Binghui, et al.
Published: (2026)
by: Li, Binghui, et al.
Published: (2026)
Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws
by: Wang, Jinbo, et al.
Published: (2026)
by: Wang, Jinbo, et al.
Published: (2026)
Muon in Associative Memory Learning: Training Dynamics and Scaling Laws
by: Li, Binghui, et al.
Published: (2026)
by: Li, Binghui, et al.
Published: (2026)
Optimal Rates and Saturation for Noiseless Kernel Ridge Regression
by: Long, Jihao, et al.
Published: (2024)
by: Long, Jihao, et al.
Published: (2024)
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches
by: Chen, Yifang, et al.
Published: (2025)
by: Chen, Yifang, et al.
Published: (2025)
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
by: Luo, Kairong, et al.
Published: (2025)
by: Luo, Kairong, et al.
Published: (2025)
Learning Analysis of Kernel Ridgeless Regression with Asymmetric Kernel Learning
by: He, Fan, et al.
Published: (2024)
by: He, Fan, et al.
Published: (2024)
On the Robustness of Kernel Ridge Regression Using the Cauchy Loss Function
by: Wen, Hongwei, et al.
Published: (2025)
by: Wen, Hongwei, et al.
Published: (2025)
Larger Datasets Can Be Repeated More: A Theoretical Analysis of Multi-Epoch Scaling in Linear Regression
by: Yan, Tingkai, et al.
Published: (2025)
by: Yan, Tingkai, et al.
Published: (2025)
Scaling Law with Learning Rate Annealing
by: Tissue, Howe, et al.
Published: (2024)
by: Tissue, Howe, et al.
Published: (2024)
Statistical Optimality of Divide and Conquer Kernel-based Functional Linear Regression
by: Liu, Jiading, et al.
Published: (2022)
by: Liu, Jiading, et al.
Published: (2022)
Theory of Optimal Learning Rate Schedules and Scaling Laws for a Random Feature Model
by: Bordelon, Blake, et al.
Published: (2026)
by: Bordelon, Blake, et al.
Published: (2026)
Convex Dominance in Deep Learning I: A Scaling Law of Loss and Learning Rate
by: Bu, Zhiqi, et al.
Published: (2026)
by: Bu, Zhiqi, et al.
Published: (2026)
Bayesian Kernel Regression for Functional Data
by: Kusaba, Minoru, et al.
Published: (2025)
by: Kusaba, Minoru, et al.
Published: (2025)
On the Clean Generalization and Robust Overfitting in Adversarial Training from Two Theoretical Views: Representation Complexity and Training Dynamics
by: Li, Binghui, et al.
Published: (2023)
by: Li, Binghui, et al.
Published: (2023)
Towards Scaling Laws for Symbolic Regression
by: Otte, David, et al.
Published: (2025)
by: Otte, David, et al.
Published: (2025)
Transfer Learning for Kernel-based Regression
by: Wang, Chao, et al.
Published: (2023)
by: Wang, Chao, et al.
Published: (2023)
OBLR-PO: A Theoretical Framework for Stable Reinforcement Learning
by: Huang, Zixun, et al.
Published: (2025)
by: Huang, Zixun, et al.
Published: (2025)
Optimal Rates of Kernel Ridge Regression under Source Condition in Large Dimensions
by: Zhang, Haobo, et al.
Published: (2024)
by: Zhang, Haobo, et al.
Published: (2024)
Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data
by: Li, Binghui, et al.
Published: (2024)
by: Li, Binghui, et al.
Published: (2024)
Improved Scaling Laws via Weak-to-Strong Generalization in Random Feature Ridge Regression
by: Wu, Diyuan, et al.
Published: (2026)
by: Wu, Diyuan, et al.
Published: (2026)
ScheduleFree+: Scaling Learning-Rate-Free & Schedule-Free Learning to Large Language Models
by: Defazio, Aaron
Published: (2026)
by: Defazio, Aaron
Published: (2026)
How Should LLMs Consume High-Quality Data? Optimal Data Scheduling via Quality-Aware Functional Scaling Laws
by: Zhu, Zhitao, et al.
Published: (2026)
by: Zhu, Zhitao, et al.
Published: (2026)
Improved Scaling Laws in Linear Regression via Data Reuse
by: Lin, Licong, et al.
Published: (2025)
by: Lin, Licong, et al.
Published: (2025)
Optimal Rate of Kernel Regression in Large Dimensions
by: Lu, Weihao, et al.
Published: (2023)
by: Lu, Weihao, et al.
Published: (2023)
Transfer Learning of CATE with Kernel Ridge Regression
by: Kim, Seok-Jin, et al.
Published: (2025)
by: Kim, Seok-Jin, et al.
Published: (2025)
Inf2Guard: An Information-Theoretic Framework for Learning Privacy-Preserving Representations against Inference Attacks
by: Noorbakhsh, Sayedeh Leila, et al.
Published: (2024)
by: Noorbakhsh, Sayedeh Leila, et al.
Published: (2024)
Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization
by: Huang, Zixuan, et al.
Published: (2025)
by: Huang, Zixuan, et al.
Published: (2025)
Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs -- A Graph Sequential Embedding Method
by: Li, Jiate, et al.
Published: (2024)
by: Li, Jiate, et al.
Published: (2024)
Neural Scaling Laws for Deep Regression
by: Cadez, Tilen, et al.
Published: (2025)
by: Cadez, Tilen, et al.
Published: (2025)
Learning Multi-Index Models with Hyper-Kernel Ridge Regression
by: Huang, Shuo, et al.
Published: (2025)
by: Huang, Shuo, et al.
Published: (2025)
Measure-Theoretic Anti-Causal Representation Learning
by: Behnam, Arman, et al.
Published: (2025)
by: Behnam, Arman, et al.
Published: (2025)
Loss-to-Loss Prediction: Scaling Laws for All Datasets
by: Brandfonbrener, David, et al.
Published: (2024)
by: Brandfonbrener, David, et al.
Published: (2024)
Scaling Laws in Linear Regression: Compute, Parameters, and Data
by: Lin, Licong, et al.
Published: (2024)
by: Lin, Licong, et al.
Published: (2024)
Kernel Stochastic Configuration Networks for Nonlinear Regression
by: Chen, Yongxuan, et al.
Published: (2024)
by: Chen, Yongxuan, et al.
Published: (2024)
FedTilt: Towards Multi-Level Fairness-Preserving and Robust Federated Learning
by: Zhang, Binghui, et al.
Published: (2025)
by: Zhang, Binghui, et al.
Published: (2025)
Learning Robust and Privacy-Preserving Representations via Information Theory
by: Zhang, Binghui, et al.
Published: (2024)
by: Zhang, Binghui, et al.
Published: (2024)
Decoupled Relative Learning Rate Schedules
by: Ludziejewski, Jan, et al.
Published: (2025)
by: Ludziejewski, Jan, et al.
Published: (2025)
Large Dimensional Kernel Ridge Regression: Extending to Product Kernels
by: Zhou, Yang, et al.
Published: (2026)
by: Zhou, Yang, et al.
Published: (2026)
On the Saturation Effect of Kernel Ridge Regression
by: Li, Yicheng, et al.
Published: (2024)
by: Li, Yicheng, et al.
Published: (2024)
Similar Items
-
Optimal Learning-Rate Schedules under Functional Scaling Laws: Power Decay and Warmup-Stable-Decay
by: Li, Binghui, et al.
Published: (2026) -
Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws
by: Wang, Jinbo, et al.
Published: (2026) -
Muon in Associative Memory Learning: Training Dynamics and Scaling Laws
by: Li, Binghui, et al.
Published: (2026) -
Optimal Rates and Saturation for Noiseless Kernel Ridge Regression
by: Long, Jihao, et al.
Published: (2024) -
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches
by: Chen, Yifang, et al.
Published: (2025)