Guardado en:
| Autores principales: | Compagnoni, Enea Monzio, Islamov, Rustem, Proske, Frank Norbert, Lucchi, Aurelien, Orvieto, Antonio, Gorbunov, Eduard |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2506.00181 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs
por: Compagnoni, Enea Monzio, et al.
Publicado: (2025)
por: Compagnoni, Enea Monzio, et al.
Publicado: (2025)
Adaptive Methods through the Lens of SDEs: Theoretical Insights on the Role of Noise
por: Compagnoni, Enea Monzio, et al.
Publicado: (2024)
por: Compagnoni, Enea Monzio, et al.
Publicado: (2024)
Adaptive Methods Are Preferable in High Privacy Settings: An SDE Perspective
por: Compagnoni, Enea Monzio, et al.
Publicado: (2026)
por: Compagnoni, Enea Monzio, et al.
Publicado: (2026)
SDEs for Minimax Optimization
por: Compagnoni, Enea Monzio, et al.
Publicado: (2024)
por: Compagnoni, Enea Monzio, et al.
Publicado: (2024)
On the Role of Batch Size in Stochastic Conditional Gradient Methods
por: Islamov, Rustem, et al.
Publicado: (2026)
por: Islamov, Rustem, et al.
Publicado: (2026)
Loss Landscape Characterization of Neural Networks without Over-Parametrization
por: Islamov, Rustem, et al.
Publicado: (2024)
por: Islamov, Rustem, et al.
Publicado: (2024)
Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size
por: Islamov, Rustem, et al.
Publicado: (2025)
por: Islamov, Rustem, et al.
Publicado: (2025)
Double Momentum and Error Feedback for Clipping with Fast Rates and Differential Privacy
por: Islamov, Rustem, et al.
Publicado: (2025)
por: Islamov, Rustem, et al.
Publicado: (2025)
Byzantine-Robust and Differentially Private Federated Optimization under Weaker Assumptions
por: Islamov, Rustem, et al.
Publicado: (2026)
por: Islamov, Rustem, et al.
Publicado: (2026)
Why Do We Need Warm-up? A Theoretical Perspective
por: Alimisis, Foivos, et al.
Publicado: (2025)
por: Alimisis, Foivos, et al.
Publicado: (2025)
Byzantine-Robust Optimization under $(L_0, L_1)$-Smoothness
por: Bolatov, Arman, et al.
Publicado: (2026)
por: Bolatov, Arman, et al.
Publicado: (2026)
Convergence of Clipped-SGD for Convex $(L_0,L_1)$-Smooth Optimization with Heavy-Tailed Noise
por: Chezhegov, Savelii, et al.
Publicado: (2025)
por: Chezhegov, Savelii, et al.
Publicado: (2025)
Error Feedback under $(L_0,L_1)$-Smoothness: Normalization and Momentum
por: Khirirat, Sarit, et al.
Publicado: (2024)
por: Khirirat, Sarit, et al.
Publicado: (2024)
Linear Convergence Rate in Convex Setup is Possible! Gradient Descent Method Variants under $(L_0,L_1)$-Smoothness
por: Lobanov, Aleksandr, et al.
Publicado: (2024)
por: Lobanov, Aleksandr, et al.
Publicado: (2024)
Methods for Convex $(L_0,L_1)$-Smooth Optimization: Clipping, Acceleration, and Adaptivity
por: Gorbunov, Eduard, et al.
Publicado: (2024)
por: Gorbunov, Eduard, et al.
Publicado: (2024)
Towards Faster Decentralized Stochastic Optimization with Communication Compression
por: Islamov, Rustem, et al.
Publicado: (2024)
por: Islamov, Rustem, et al.
Publicado: (2024)
Small Noise Perturbations in Multidimensional Case
por: Pilipenko, Andrey, et al.
Publicado: (2021)
por: Pilipenko, Andrey, et al.
Publicado: (2021)
Safe-EF: Error Feedback for Nonsmooth Constrained Optimization
por: Islamov, Rustem, et al.
Publicado: (2025)
por: Islamov, Rustem, et al.
Publicado: (2025)
High Probability Complexity Bounds for Non-Smooth Stochastic Optimization with Heavy-Tailed Noise
por: Gorbunov, Eduard, et al.
Publicado: (2021)
por: Gorbunov, Eduard, et al.
Publicado: (2021)
Smoothness of solutions of hyperbolic stochastic partial differential equations with $L^{\infty}$-vector fields
por: Bogso, Antoine-Marie, et al.
Publicado: (2022)
por: Bogso, Antoine-Marie, et al.
Publicado: (2022)
Non-Euclidean Gradient Descent Operates at the Edge of Stability
por: Islamov, Rustem, et al.
Publicado: (2026)
por: Islamov, Rustem, et al.
Publicado: (2026)
Noise-Induced Equalization in quantum learning models
por: Scala, Francesco, et al.
Publicado: (2025)
por: Scala, Francesco, et al.
Publicado: (2025)
Last Iterate Convergence of AdaGrad-Norm for Convex Non-Smooth Optimization
por: Preobrazhenskaia, Margarita, et al.
Publicado: (2026)
por: Preobrazhenskaia, Margarita, et al.
Publicado: (2026)
An Adaptive Stochastic Gradient Method with Non-negative Gauss-Newton Stepsizes
por: Orvieto, Antonio, et al.
Publicado: (2024)
por: Orvieto, Antonio, et al.
Publicado: (2024)
Breaking the Heavy-Tailed Noise Barrier in Stochastic Optimization Problems
por: Puchkin, Nikita, et al.
Publicado: (2023)
por: Puchkin, Nikita, et al.
Publicado: (2023)
The Effects of the Gravitational Coupling Variation on the Local $H_0$ Estimation
por: Romano, Antonio Enea
Publicado: (2024)
por: Romano, Antonio Enea
Publicado: (2024)
A Theoretical Analysis of the Learning Dynamics under Class Imbalance
por: Francazi, Emanuele, et al.
Publicado: (2022)
por: Francazi, Emanuele, et al.
Publicado: (2022)
On the Analysis of a Singular Stochastic Volterra Differential Equation driven by a Wiener Noise
por: Coffie, Emmanuel, et al.
Publicado: (2025)
por: Coffie, Emmanuel, et al.
Publicado: (2025)
Median Clipping for Zeroth-order Non-Smooth Convex Optimization and Multi-Armed Bandit Problem with Heavy-tailed Symmetric Noise
por: Kornilov, Nikita, et al.
Publicado: (2024)
por: Kornilov, Nikita, et al.
Publicado: (2024)
Communication Compression for Byzantine Robust Learning: New Efficient Algorithms and Improved Rates
por: Rammal, Ahmad, et al.
Publicado: (2023)
por: Rammal, Ahmad, et al.
Publicado: (2023)
Changing the properties of Hf$_{0.5}$Zr$_{0.5}$O$_{2}$ during cyclic repolarization of ferroelectric capacitors with different electrode materials
por: Zalyalov, Timur M., et al.
Publicado: (2022)
por: Zalyalov, Timur M., et al.
Publicado: (2022)
Dynamics of SGD with Stochastic Polyak Stepsizes: Truly Adaptive Variants and Convergence to Exact Solution
por: Orvieto, Antonio, et al.
Publicado: (2022)
por: Orvieto, Antonio, et al.
Publicado: (2022)
Cubic regularized subspace Newton for non-convex optimization
por: Zhao, Jim, et al.
Publicado: (2024)
por: Zhao, Jim, et al.
Publicado: (2024)
Adam Simplified: Bias Correction Debunked
por: Laing, Sam, et al.
Publicado: (2025)
por: Laing, Sam, et al.
Publicado: (2025)
Revisiting associative recall in modern recurrent models
por: Okpekpe, Destiny, et al.
Publicado: (2025)
por: Okpekpe, Destiny, et al.
Publicado: (2025)
Recurrent neural networks: vanishing and exploding gradients are not the end of the story
por: Zucchet, Nicolas, et al.
Publicado: (2024)
por: Zucchet, Nicolas, et al.
Publicado: (2024)
Generalized Gradient Norm Clipping & Non-Euclidean $(L_0,L_1)$-Smoothness
por: Pethick, Thomas, et al.
Publicado: (2025)
por: Pethick, Thomas, et al.
Publicado: (2025)
Methods with Local Steps and Random Reshuffling for Generally Smooth Non-Convex Federated Optimization
por: Demidovich, Yury, et al.
Publicado: (2024)
por: Demidovich, Yury, et al.
Publicado: (2024)
Unpacking Softmax: How Temperature Drives Representation Collapse, Compression, and Generalization
por: Masarczyk, Wojciech, et al.
Publicado: (2025)
por: Masarczyk, Wojciech, et al.
Publicado: (2025)
Near-Optimal Convergence of Accelerated Gradient Methods under Generalized and $(L_0, L_1)$-Smoothness
por: Tyurin, Alexander
Publicado: (2025)
por: Tyurin, Alexander
Publicado: (2025)
Ejemplares similares
-
Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs
por: Compagnoni, Enea Monzio, et al.
Publicado: (2025) -
Adaptive Methods through the Lens of SDEs: Theoretical Insights on the Role of Noise
por: Compagnoni, Enea Monzio, et al.
Publicado: (2024) -
Adaptive Methods Are Preferable in High Privacy Settings: An SDE Perspective
por: Compagnoni, Enea Monzio, et al.
Publicado: (2026) -
SDEs for Minimax Optimization
por: Compagnoni, Enea Monzio, et al.
Publicado: (2024) -
On the Role of Batch Size in Stochastic Conditional Gradient Methods
por: Islamov, Rustem, et al.
Publicado: (2026)