Saved in:
| Main Authors: | Kong, Boao, Jia, Weichen, Zhang, Engao, Li, Guohong, Dong, Yonghan, Wang, Yao, Wang, Yaoyuan, Peng, Yunke, Yuan, Kun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.00539 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BROS: Bias-Corrected Randomized Subspaces for Memory-Efficient Single-Loop Bilevel Optimization
by: Zhang, Hengrui, et al.
Published: (2026)
by: Zhang, Hengrui, et al.
Published: (2026)
On the Convergence of Stochastic Gradient Descent with Perturbed Forward-Backward Passes
by: Kong, Boao, et al.
Published: (2026)
by: Kong, Boao, et al.
Published: (2026)
Clapping: Removing Per-sample Storage for Pipeline Parallel Distributed Optimization with Communication Compression
by: Kong, Boao, et al.
Published: (2025)
by: Kong, Boao, et al.
Published: (2025)
SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization
by: Zhu, Shuchen, et al.
Published: (2024)
by: Zhu, Shuchen, et al.
Published: (2024)
Decentralized Bilevel Optimization: A Perspective from Transient Iteration Complexity
by: Kong, Boao, et al.
Published: (2024)
by: Kong, Boao, et al.
Published: (2024)
Greedy Low-Rank Gradient Compression for Distributed Learning with Convergence Guarantees
by: Chen, Chuyan, et al.
Published: (2025)
by: Chen, Chuyan, et al.
Published: (2025)
Stabilizing Rate of Stochastic Control Systems
by: Jia, Hui, et al.
Published: (2025)
by: Jia, Hui, et al.
Published: (2025)
SUDA-Muon: Structural Design Principles and Boundaries for Fully Decentralized Muon
by: Zhang, Hengrui, et al.
Published: (2026)
by: Zhang, Hengrui, et al.
Published: (2026)
Compressing Large Language Models using Low Rank and Low Precision Decomposition
by: Saha, Rajarshi, et al.
Published: (2024)
by: Saha, Rajarshi, et al.
Published: (2024)
An Overview of Low-Rank Structures in the Training and Adaptation of Large Models
by: Balzano, Laura, et al.
Published: (2025)
by: Balzano, Laura, et al.
Published: (2025)
CR-Net: Scaling Parameter-Efficient Training with Cross-Layer Low-Rank Structure
by: Kong, Boao, et al.
Published: (2025)
by: Kong, Boao, et al.
Published: (2025)
Learning to Control Stabilization in Column Generation
by: Wang, Olivia, et al.
Published: (2026)
by: Wang, Olivia, et al.
Published: (2026)
Subspace Optimization for Large Language Models with Convergence Guarantees
by: He, Yutong, et al.
Published: (2024)
by: He, Yutong, et al.
Published: (2024)
GradPower: Powering Gradients for Faster Language Model Pre-Training
by: Wang, Jinbo, et al.
Published: (2025)
by: Wang, Jinbo, et al.
Published: (2025)
Nonlinear Optimal Guidance for Impact Time Control with Field-of-View Constraint
by: Lu, Fangmin, et al.
Published: (2025)
by: Lu, Fangmin, et al.
Published: (2025)
Optimal Complexity in Byzantine-Robust Distributed Stochastic Optimization with Data Heterogeneity
by: Shi, Qiankun, et al.
Published: (2025)
by: Shi, Qiankun, et al.
Published: (2025)
Safety on the Fly: Constructing Robust Safety Filters via Policy Control Barrier Functions at Runtime
by: Knoedler, Luzia, et al.
Published: (2024)
by: Knoedler, Luzia, et al.
Published: (2024)
A Multi-scale Perimeter Control and Route Guidance System for Large-scale Road Networks
by: Peng, Xianyue, et al.
Published: (2025)
by: Peng, Xianyue, et al.
Published: (2025)
From PowerSGD to PowerSGD+: Low-Rank Gradient Compression for Distributed Optimization with Convergence Guarantees
by: Xie, Shengping, et al.
Published: (2025)
by: Xie, Shengping, et al.
Published: (2025)
Stochastic Model Predictive Control for Sub-Gaussian Noise
by: Ao, Yunke, et al.
Published: (2025)
by: Ao, Yunke, et al.
Published: (2025)
A Proximal DC Algorithm for Sample Average Approximation of Chance Constrained Programming
by: Wang, Peng, et al.
Published: (2023)
by: Wang, Peng, et al.
Published: (2023)
Empirical Asymptotic Runtime Analysis of Linear Programming Algorithms
by: Rothberg, Edward
Published: (2026)
by: Rothberg, Edward
Published: (2026)
A Multi-objective Sequential Quadratic Programming Algorithm Based on Low-order Smooth Penalty Function
by: Kong, Zanyang
Published: (2025)
by: Kong, Zanyang
Published: (2025)
Boundary Stabilization for the Rayleigh Beam System under Event-triggered Controls
by: Wang, Siwen, et al.
Published: (2026)
by: Wang, Siwen, et al.
Published: (2026)
Reinforcement Learning for Distributed Transient Frequency Control with Stability and Safety Guarantees
by: Yuan, Zhenyi, et al.
Published: (2022)
by: Yuan, Zhenyi, et al.
Published: (2022)
Initial Error Tolerant Distributed Mean Field Control under Partial and Discrete Information
by: Jin, Yuxin, et al.
Published: (2025)
by: Jin, Yuxin, et al.
Published: (2025)
LoCo: Low-Bit Communication Adaptor for Large-scale Model Training
by: Xie, Xingyu, et al.
Published: (2024)
by: Xie, Xingyu, et al.
Published: (2024)
Frictionless Hamiltonian Descent and Coordinate Hamiltonian Descent for Strongly Convex Quadratic Problems
by: Wang, Jun-Kun
Published: (2024)
by: Wang, Jun-Kun
Published: (2024)
Latent Representations for Control Design with Provable Stability and Safety Guarantees
by: Lutkus, Paul, et al.
Published: (2025)
by: Lutkus, Paul, et al.
Published: (2025)
A Precise Characterization of SGD Stability Using Loss Surface Geometry
by: Dexter, Gregory, et al.
Published: (2024)
by: Dexter, Gregory, et al.
Published: (2024)
Stability, Contraction, and Controllers for Affine Systems
by: Wieringa, L. P., et al.
Published: (2026)
by: Wieringa, L. P., et al.
Published: (2026)
Exact Controllability and Stabilization of the Wave Equation
by: Zuazua, Enrique
Published: (2024)
by: Zuazua, Enrique
Published: (2024)
Analysis of Stability and Performance of Economic Model Predictive Control with State-Independent Costs
by: Arastou, Alireza, et al.
Published: (2026)
by: Arastou, Alireza, et al.
Published: (2026)
MARS: Unleashing the Power of Variance Reduction for Training Large Models
by: Yuan, Huizhuo, et al.
Published: (2024)
by: Yuan, Huizhuo, et al.
Published: (2024)
Perturbation-Tolerant Structural Controllability for Linear Systems
by: Zhang, Yuan, et al.
Published: (2021)
by: Zhang, Yuan, et al.
Published: (2021)
Convergence of Implicit Gradient Descent for Training Two-Layer Physics-Informed Neural Networks
by: Xu, Xianliang, et al.
Published: (2024)
by: Xu, Xianliang, et al.
Published: (2024)
Momentum Stability and Adaptive Control in Stochastic Reconfiguration
by: Wang, Yuyang, et al.
Published: (2026)
by: Wang, Yuyang, et al.
Published: (2026)
Does Weight Decay Enhance Training Stability?
by: Saether, Marius, et al.
Published: (2026)
by: Saether, Marius, et al.
Published: (2026)
On the Exponential Stability of Koopman Model Predictive Control
by: Shang, Xu, et al.
Published: (2025)
by: Shang, Xu, et al.
Published: (2025)
Low-Order Explicit Hessian Imitation Method for Large-Scale Supervised Machine Learning
by: Zhu, Yunlang, et al.
Published: (2026)
by: Zhu, Yunlang, et al.
Published: (2026)
Similar Items
-
BROS: Bias-Corrected Randomized Subspaces for Memory-Efficient Single-Loop Bilevel Optimization
by: Zhang, Hengrui, et al.
Published: (2026) -
On the Convergence of Stochastic Gradient Descent with Perturbed Forward-Backward Passes
by: Kong, Boao, et al.
Published: (2026) -
Clapping: Removing Per-sample Storage for Pipeline Parallel Distributed Optimization with Communication Compression
by: Kong, Boao, et al.
Published: (2025) -
SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization
by: Zhu, Shuchen, et al.
Published: (2024) -
Decentralized Bilevel Optimization: A Perspective from Transient Iteration Complexity
by: Kong, Boao, et al.
Published: (2024)