Saved in:
| Main Authors: | Chen, Ziyan, Zhou, Ding-Xuan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.24316 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Improved Scaling Laws in Linear Regression via Data Reuse
by: Lin, Licong, et al.
Published: (2025)
by: Lin, Licong, et al.
Published: (2025)
Scaling Laws of SignSGD in Linear Regression: When Does It Outperform SGD?
by: Kim, Jihwan, et al.
Published: (2026)
by: Kim, Jihwan, et al.
Published: (2026)
Accelerating Single-Pass SGD for Generalized Linear Prediction
by: Chen, Qian, et al.
Published: (2026)
by: Chen, Qian, et al.
Published: (2026)
Perfect Parallelization in Mini-Batch SGD with Classical Momentum Acceleration
by: Garg, Sachin, et al.
Published: (2026)
by: Garg, Sachin, et al.
Published: (2026)
Data Deletion for Linear Regression with Noisy SGD
by: Xia, Zhangjie, et al.
Published: (2024)
by: Xia, Zhangjie, et al.
Published: (2024)
Full-Batch Gradient Descent Outperforms One-Pass SGD: Sample Complexity Separation in Single-Index Learning
by: Kovačević, Filip, et al.
Published: (2026)
by: Kovačević, Filip, et al.
Published: (2026)
Scaling Laws for Precision in High-Dimensional Linear Regression
by: Zhang, Dechen, et al.
Published: (2026)
by: Zhang, Dechen, et al.
Published: (2026)
Heavy-Tailed Linear Bandits: Huber Regression with One-Pass Update
by: Wang, Jing, et al.
Published: (2025)
by: Wang, Jing, et al.
Published: (2025)
Private Sketches for Linear Regression
by: Das, Shrutimoy, et al.
Published: (2025)
by: Das, Shrutimoy, et al.
Published: (2025)
Scaling Law for Stochastic Gradient Descent in Quadratically Parameterized Linear Regression
by: Ding, Shihong, et al.
Published: (2025)
by: Ding, Shihong, et al.
Published: (2025)
Scaling Laws in Linear Regression: Compute, Parameters, and Data
by: Lin, Licong, et al.
Published: (2024)
by: Lin, Licong, et al.
Published: (2024)
Hidden State Differential Private Mini-Batch Block Coordinate Descent for Multi-convexity Optimization
by: Chen, Ding, et al.
Published: (2024)
by: Chen, Ding, et al.
Published: (2024)
A Simplified Analysis of SGD for Linear Regression with Weight Averaging
by: Meterez, Alexandru, et al.
Published: (2025)
by: Meterez, Alexandru, et al.
Published: (2025)
Online Covariance Estimation in Averaged SGD: Improved Batch-Mean Rates and Minimax Optimality via Trajectory Regression
by: Ni, Yijin, et al.
Published: (2026)
by: Ni, Yijin, et al.
Published: (2026)
Statistical Inference for Linear Functionals of Online SGD in High-dimensional Linear Regression
by: Agrawalla, Bhavya, et al.
Published: (2023)
by: Agrawalla, Bhavya, et al.
Published: (2023)
Enhancing SignSGD: Small-Batch Convergence Analysis and a Hybrid Switching Strategy
by: Chen, Haoran, et al.
Published: (2026)
by: Chen, Haoran, et al.
Published: (2026)
Understanding SGD with Exponential Moving Average: A Case Study in Linear Regression
by: Li, Xuheng, et al.
Published: (2025)
by: Li, Xuheng, et al.
Published: (2025)
Scaling Federated Linear Contextual Bandits via Sketching
by: Yang, Hantao, et al.
Published: (2026)
by: Yang, Hantao, et al.
Published: (2026)
Exploring Scaling Laws for Local SGD in Large Language Model Training
by: He, Qiaozhi, et al.
Published: (2024)
by: He, Qiaozhi, et al.
Published: (2024)
On the (Generative) Linear Sketching Problem
by: Yuan, Xinyu, et al.
Published: (2026)
by: Yuan, Xinyu, et al.
Published: (2026)
Bayesian Data Sketching for Varying Coefficient Regression Models
by: Guhaniyogi, Rajarshi, et al.
Published: (2025)
by: Guhaniyogi, Rajarshi, et al.
Published: (2025)
Debiasing Mini-Batch Quadratics for Applications in Deep Learning
by: Tatzel, Lukas, et al.
Published: (2024)
by: Tatzel, Lukas, et al.
Published: (2024)
Towards Scaling Laws for Symbolic Regression
by: Otte, David, et al.
Published: (2025)
by: Otte, David, et al.
Published: (2025)
From Privacy to Generalization: Linear Max-Information Bounds for DP-SGD
by: Lampert, Christoph H., et al.
Published: (2026)
by: Lampert, Christoph H., et al.
Published: (2026)
Mini-Batch Kernel $k$-means
by: Jourdan, Ben, et al.
Published: (2024)
by: Jourdan, Ben, et al.
Published: (2024)
Scaling Law for Language Models Training Considering Batch Size
by: Shuai, Xian, et al.
Published: (2024)
by: Shuai, Xian, et al.
Published: (2024)
Mini-Batch Class Composition Bias in Link Prediction
by: Maguire, Kieran, et al.
Published: (2026)
by: Maguire, Kieran, et al.
Published: (2026)
Batch List-Decodable Linear Regression via Higher Moments
by: Diakonikolas, Ilias, et al.
Published: (2025)
by: Diakonikolas, Ilias, et al.
Published: (2025)
From Continual Learning to SGD and Back: Better Rates for Continual Linear Models
by: Evron, Itay, et al.
Published: (2025)
by: Evron, Itay, et al.
Published: (2025)
Generalized Linear Bandits: Almost Optimal Regret with One-Pass Update
by: Zhang, Yu-Jie, et al.
Published: (2025)
by: Zhang, Yu-Jie, et al.
Published: (2025)
SGD for Variational Inference: Tackling Unbounded Variance via Preconditioning and Dynamic Batching
by: Labarrière, Hippolyte, et al.
Published: (2026)
by: Labarrière, Hippolyte, et al.
Published: (2026)
Can Microcanonical Langevin Dynamics Leverage Mini-Batch Gradient Noise?
by: Sommer, Emanuel, et al.
Published: (2026)
by: Sommer, Emanuel, et al.
Published: (2026)
Probability Passing for Graph Neural Networks: Graph Structure and Representations Joint Learning
by: Wang, Ziyan, et al.
Published: (2024)
by: Wang, Ziyan, et al.
Published: (2024)
From Message-Passing to Linearized Graph Sequence Models
by: Mathys, Joël, et al.
Published: (2026)
by: Mathys, Joël, et al.
Published: (2026)
Neural Scaling Laws for Deep Regression
by: Cadez, Tilen, et al.
Published: (2025)
by: Cadez, Tilen, et al.
Published: (2025)
Hierarchical Rectified Flow Matching with Mini-Batch Couplings
by: Zhang, Yichi, et al.
Published: (2025)
by: Zhang, Yichi, et al.
Published: (2025)
Optimal Growth Schedules for Batch Size and Learning Rate in SGD that Reduce SFO Complexity
by: Umeda, Hikaru, et al.
Published: (2025)
by: Umeda, Hikaru, et al.
Published: (2025)
Batches Stabilize the Minimum Norm Risk in High Dimensional Overparameterized Linear Regression
by: Ioushua, Shahar Stein, et al.
Published: (2023)
by: Ioushua, Shahar Stein, et al.
Published: (2023)
Exact Mean Square Linear Stability Analysis for SGD
by: Mulayoff, Rotem, et al.
Published: (2023)
by: Mulayoff, Rotem, et al.
Published: (2023)
The Effect of Mini-Batch Noise on the Implicit Bias of Adam
by: Cattaneo, Matias D., et al.
Published: (2026)
by: Cattaneo, Matias D., et al.
Published: (2026)
Similar Items
-
Improved Scaling Laws in Linear Regression via Data Reuse
by: Lin, Licong, et al.
Published: (2025) -
Scaling Laws of SignSGD in Linear Regression: When Does It Outperform SGD?
by: Kim, Jihwan, et al.
Published: (2026) -
Accelerating Single-Pass SGD for Generalized Linear Prediction
by: Chen, Qian, et al.
Published: (2026) -
Perfect Parallelization in Mini-Batch SGD with Classical Momentum Acceleration
by: Garg, Sachin, et al.
Published: (2026) -
Data Deletion for Linear Regression with Noisy SGD
by: Xia, Zhangjie, et al.
Published: (2024)