Saved in:
| Main Authors: | Pooladzandi, Omead, Li, Xi-Lin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.04553 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PureEBM: Universal Poison Purification via Mid-Run Dynamics of Energy-Based Models
by: Pooladzandi, Omead, et al.
Published: (2024)
by: Pooladzandi, Omead, et al.
Published: (2024)
Implicit Bias and Convergence of Matrix Stochastic Mirror Descent
by: Akhtiamov, Danil, et al.
Published: (2026)
by: Akhtiamov, Danil, et al.
Published: (2026)
PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model Dynamics
by: Bhat, Sunay, et al.
Published: (2024)
by: Bhat, Sunay, et al.
Published: (2024)
Scaling Behavior of Discrete Diffusion Language Models
by: von Rütte, Dimitri, et al.
Published: (2025)
by: von Rütte, Dimitri, et al.
Published: (2025)
Stochastic Hessian Fittings with Lie Groups
by: Li, Xi-Lin
Published: (2024)
by: Li, Xi-Lin
Published: (2024)
Designing Preconditioners for SGD: Local Conditioning, Noise Floors, and Basin Stability
by: Scott, Mitchell, et al.
Published: (2025)
by: Scott, Mitchell, et al.
Published: (2025)
SketchySGD: Reliable Stochastic Optimization via Randomized Curvature Estimates
by: Frangella, Zachary, et al.
Published: (2022)
by: Frangella, Zachary, et al.
Published: (2022)
On the Superlinear Relationship between SGD Noise Covariance and Loss Landscape Curvature
by: Zhang, Yikuan, et al.
Published: (2026)
by: Zhang, Yikuan, et al.
Published: (2026)
Correlating Cross-Iteration Noise for DP-SGD using Model Curvature
by: Gu, Xin, et al.
Published: (2025)
by: Gu, Xin, et al.
Published: (2025)
Generative modeling of Sparse Approximate Inverse Preconditioners
by: Li, Mou, et al.
Published: (2024)
by: Li, Mou, et al.
Published: (2024)
Generalization and Optimization of SGD with Lookahead
by: Li, Kangcheng, et al.
Published: (2025)
by: Li, Kangcheng, et al.
Published: (2025)
Stability and Generalization for Decentralized Markov SGD
by: Wang, Jiahuan, et al.
Published: (2026)
by: Wang, Jiahuan, et al.
Published: (2026)
Improving Adaptive Moment Optimization via Preconditioner Diagonalization
by: Nguyen, Son, et al.
Published: (2025)
by: Nguyen, Son, et al.
Published: (2025)
Adaptive Preconditioners Trigger Loss Spikes in Adam
by: Bai, Zhiwei, et al.
Published: (2025)
by: Bai, Zhiwei, et al.
Published: (2025)
Diffusion Generative Modeling on Lie Group Representations
by: Bertolini, Marco, et al.
Published: (2025)
by: Bertolini, Marco, et al.
Published: (2025)
Learning Lie Group Generators from Trajectories
by: Hu, Lifan
Published: (2025)
by: Hu, Lifan
Published: (2025)
Unveiling High-Probability Generalization in Decentralized SGD
by: Wang, Jiahuan, et al.
Published: (2026)
by: Wang, Jiahuan, et al.
Published: (2026)
NeuraLSP: An Efficient and Rigorous Neural Left Singular Subspace Preconditioner for Conjugate Gradient Methods
by: Benanti, Alexander, et al.
Published: (2026)
by: Benanti, Alexander, et al.
Published: (2026)
Topology-aware Generalization of Decentralized SGD
by: Zhu, Tongtian, et al.
Published: (2022)
by: Zhu, Tongtian, et al.
Published: (2022)
Structured Preconditioners in Adaptive Optimization: A Unified Analysis
by: Xie, Shuo, et al.
Published: (2025)
by: Xie, Shuo, et al.
Published: (2025)
Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations
by: Si, Chongjie, et al.
Published: (2025)
by: Si, Chongjie, et al.
Published: (2025)
Benchmarking General-Purpose In-Context Learning
by: Wang, Fan, et al.
Published: (2024)
by: Wang, Fan, et al.
Published: (2024)
PCDP-SGD: Improving the Convergence of Differentially Private SGD via Projection in Advance
by: Sha, Haichao, et al.
Published: (2023)
by: Sha, Haichao, et al.
Published: (2023)
Tight Group-Level DP Guarantees for DP-SGD with Sampling via Mixture of Gaussians Mechanisms
by: Ganesh, Arun
Published: (2024)
by: Ganesh, Arun
Published: (2024)
A New Perspective on Shampoo's Preconditioner
by: Morwani, Depen, et al.
Published: (2024)
by: Morwani, Depen, et al.
Published: (2024)
Generalized Lie Symmetries in Physics-Informed Neural Operators
by: Wang, Amy Xiang, et al.
Published: (2025)
by: Wang, Amy Xiang, et al.
Published: (2025)
On the Limitations of General Purpose Domain Generalisation Methods
by: Gouk, Henry, et al.
Published: (2022)
by: Gouk, Henry, et al.
Published: (2022)
Improved Stability and Generalization Guarantees of the Decentralized SGD Algorithm
by: Bars, Batiste Le, et al.
Published: (2023)
by: Bars, Batiste Le, et al.
Published: (2023)
PUREVQ-GAN: Defending Data Poisoning Attacks through Vector-Quantized Bottlenecks
by: Branch, Alexander, et al.
Published: (2025)
by: Branch, Alexander, et al.
Published: (2025)
Matrix-free Neural Preconditioner for the Dirac Operator in Lattice Gauge Theory
by: Sun, Yixuan, et al.
Published: (2025)
by: Sun, Yixuan, et al.
Published: (2025)
Convex SGD: Generalization Without Early Stopping
by: Hendrickx, Julien, et al.
Published: (2024)
by: Hendrickx, Julien, et al.
Published: (2024)
Diagonalisation SGD: Fast & Convergent SGD for Non-Differentiable Models via Reparameterisation and Smoothing
by: Wagner, Dominik, et al.
Published: (2024)
by: Wagner, Dominik, et al.
Published: (2024)
Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups
by: Zhu, Yuchen, et al.
Published: (2024)
by: Zhu, Yuchen, et al.
Published: (2024)
Preconditioners for the Stochastic Training of Neural Fields
by: Chng, Shin-Fang, et al.
Published: (2024)
by: Chng, Shin-Fang, et al.
Published: (2024)
Sign-SGD via Parameter-Free Optimization
by: Medyakov, Daniil, et al.
Published: (2025)
by: Medyakov, Daniil, et al.
Published: (2025)
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
by: Yang, Zherui, et al.
Published: (2025)
by: Yang, Zherui, et al.
Published: (2025)
Flow Matching on Lie Groups
by: Sherry, Finn M., et al.
Published: (2025)
by: Sherry, Finn M., et al.
Published: (2025)
Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD
by: Peng, Ze, et al.
Published: (2026)
by: Peng, Ze, et al.
Published: (2026)
R-ODE: Ricci Curvature Tells When You Will be Informed
by: Sun, Li, et al.
Published: (2024)
by: Sun, Li, et al.
Published: (2024)
Dynamic Low-rank Approximation of Full-Matrix Preconditioner for Training Generalized Linear Models
by: Matveeva, Tatyana, et al.
Published: (2025)
by: Matveeva, Tatyana, et al.
Published: (2025)
Similar Items
-
PureEBM: Universal Poison Purification via Mid-Run Dynamics of Energy-Based Models
by: Pooladzandi, Omead, et al.
Published: (2024) -
Implicit Bias and Convergence of Matrix Stochastic Mirror Descent
by: Akhtiamov, Danil, et al.
Published: (2026) -
PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model Dynamics
by: Bhat, Sunay, et al.
Published: (2024) -
Scaling Behavior of Discrete Diffusion Language Models
by: von Rütte, Dimitri, et al.
Published: (2025) -
Stochastic Hessian Fittings with Lie Groups
by: Li, Xi-Lin
Published: (2024)