Guardado en:
| Autores principales: | You, Haochen, Liu, Baojing |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2510.01578 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Metric Embedding Initialization-Based Differentially Private and Explainable Graph Clustering
por: You, Haochen, et al.
Publicado: (2025)
por: You, Haochen, et al.
Publicado: (2025)
MCIGLE: Multimodal Exemplar-Free Class-Incremental Graph Learning
por: You, Haochen, et al.
Publicado: (2025)
por: You, Haochen, et al.
Publicado: (2025)
Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling
por: You, Haochen, et al.
Publicado: (2025)
por: You, Haochen, et al.
Publicado: (2025)
Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization
por: You, Haochen, et al.
Publicado: (2026)
por: You, Haochen, et al.
Publicado: (2026)
To Clip or not to Clip: the Dynamics of SGD with Gradient Clipping in High-Dimensions
por: Marshall, Noah, et al.
Publicado: (2024)
por: Marshall, Noah, et al.
Publicado: (2024)
ReSSFormer: A Recursive Sparse Structured Transformer for Scalable and Long-Context Reasoning
por: You, Haochen, et al.
Publicado: (2025)
por: You, Haochen, et al.
Publicado: (2025)
MOVER: Multimodal Optimal Transport with Volume-based Embedding Regularization
por: You, Haochen, et al.
Publicado: (2025)
por: You, Haochen, et al.
Publicado: (2025)
Can Entry-Wise Clipping Give Spectral Control of Stochastic Gradients?
por: Song, Zitao, et al.
Publicado: (2026)
por: Song, Zitao, et al.
Publicado: (2026)
Gradient Clipping Beyond Vector Norms: A Spectral Approach for Matrix-Valued Parameters
por: Yukhimchuk, Alexander, et al.
Publicado: (2026)
por: Yukhimchuk, Alexander, et al.
Publicado: (2026)
MuCon: Clipped Muon Updates for LLM Training
por: Yi, Albert
Publicado: (2026)
por: Yi, Albert
Publicado: (2026)
Adaptive Gradient Clipping for Robust Federated Learning
por: Allouah, Youssef, et al.
Publicado: (2024)
por: Allouah, Youssef, et al.
Publicado: (2024)
AdaDPIGU: Differentially Private SGD with Adaptive Clipping and Importance-Based Gradient Updates for Deep Neural Networks
por: Zhang, Huiqi, et al.
Publicado: (2025)
por: Zhang, Huiqi, et al.
Publicado: (2025)
When Gradient Clipping Becomes a Control Mechanism for Differential Privacy in Deep Learning
por: Partohaghighi, Mohammad, et al.
Publicado: (2026)
por: Partohaghighi, Mohammad, et al.
Publicado: (2026)
Robust Stochastic Optimization via Gradient Quantile Clipping
por: Merad, Ibrahim, et al.
Publicado: (2023)
por: Merad, Ibrahim, et al.
Publicado: (2023)
Exploring the Impact of Parameter Update Magnitude on Forgetting and Generalization of Continual Learning
por: He, JinLi, et al.
Publicado: (2026)
por: He, JinLi, et al.
Publicado: (2026)
From Gradient Clipping to Normalization for Heavy Tailed SGD
por: Hübler, Florian, et al.
Publicado: (2024)
por: Hübler, Florian, et al.
Publicado: (2024)
Parameter-free Clipped Gradient Descent Meets Polyak
por: Takezawa, Yuki, et al.
Publicado: (2024)
por: Takezawa, Yuki, et al.
Publicado: (2024)
Merging Beyond: Streaming LLM Updates via Activation-Guided Rotations
por: Yao, Yuxuan, et al.
Publicado: (2026)
por: Yao, Yuxuan, et al.
Publicado: (2026)
Clipped Gradient Methods for Nonsmooth Convex Optimization under Heavy-Tailed Noise: A Refined Analysis
por: Liu, Zijian
Publicado: (2025)
por: Liu, Zijian
Publicado: (2025)
Beyond Softmax: A New Perspective on Gradient Bandits
por: Melo, Emerson, et al.
Publicado: (2025)
por: Melo, Emerson, et al.
Publicado: (2025)
Nonconvex Stochastic Optimization under Heavy-Tailed Noises: Optimal Convergence without Gradient Clipping
por: Liu, Zijian, et al.
Publicado: (2024)
por: Liu, Zijian, et al.
Publicado: (2024)
Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks
por: Tucat, Matteo, et al.
Publicado: (2024)
por: Tucat, Matteo, et al.
Publicado: (2024)
Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond
por: Wang, Qizhou, et al.
Publicado: (2025)
por: Wang, Qizhou, et al.
Publicado: (2025)
Generalized Gradient Norm Clipping & Non-Euclidean $(L_0,L_1)$-Smoothness
por: Pethick, Thomas, et al.
Publicado: (2025)
por: Pethick, Thomas, et al.
Publicado: (2025)
SoftAdaClip: A Smooth Clipping Strategy for Fair and Private Model Training
por: Soleymani, Dorsa, et al.
Publicado: (2025)
por: Soleymani, Dorsa, et al.
Publicado: (2025)
Revisiting Gradient Normalization and Clipping for Nonconvex SGD under Heavy-Tailed Noise: Necessity, Sufficiency, and Acceleration
por: Sun, Tao, et al.
Publicado: (2024)
por: Sun, Tao, et al.
Publicado: (2024)
AGGC: Adaptive Group Gradient Clipping for Stabilizing Large Language Model Training
por: Li, Zhiyuan, et al.
Publicado: (2026)
por: Li, Zhiyuan, et al.
Publicado: (2026)
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
por: Su, Zhenpeng, et al.
Publicado: (2025)
por: Su, Zhenpeng, et al.
Publicado: (2025)
GaLLoP: Gradient-based Sparse Learning on Low-Magnitude Parameters
por: Choudhary, Anand, et al.
Publicado: (2025)
por: Choudhary, Anand, et al.
Publicado: (2025)
HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization
por: Zhao, Huaqin, et al.
Publicado: (2024)
por: Zhao, Huaqin, et al.
Publicado: (2024)
Guided AbsoluteGrad: Magnitude of Gradients Matters to Explanation's Localization and Saliency
por: Huang, Jun, et al.
Publicado: (2024)
por: Huang, Jun, et al.
Publicado: (2024)
Revisit Micro-batch Clipping: Adaptive Data Pruning via Gradient Manipulation
por: Wang, Lun
Publicado: (2024)
por: Wang, Lun
Publicado: (2024)
ConfClip: Confidence-Weighted and Clipped Reward for Reinforcement Learning in LLMs
por: Zhang, Bonan, et al.
Publicado: (2025)
por: Zhang, Bonan, et al.
Publicado: (2025)
PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping
por: Huang, Nai-Chieh, et al.
Publicado: (2023)
por: Huang, Nai-Chieh, et al.
Publicado: (2023)
A Bootstrap Perspective on Stochastic Gradient Descent
por: Lan, Hongjian, et al.
Publicado: (2025)
por: Lan, Hongjian, et al.
Publicado: (2025)
Gradient Transformer: Learning to Generate Updates for LLMs
por: Nguyen, Binh-Nguyen, et al.
Publicado: (2026)
por: Nguyen, Binh-Nguyen, et al.
Publicado: (2026)
CE-GPPO: Coordinating Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning
por: Su, Zhenpeng, et al.
Publicado: (2025)
por: Su, Zhenpeng, et al.
Publicado: (2025)
Support Basis: Fast Attention Beyond Bounded Entries
por: Aliakbarpour, Maryam, et al.
Publicado: (2025)
por: Aliakbarpour, Maryam, et al.
Publicado: (2025)
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping
por: Pelikan, Martin, et al.
Publicado: (2023)
por: Pelikan, Martin, et al.
Publicado: (2023)
Beyond Yes or No: Predictive Compliance Monitoring Approaches for Quantifying the Magnitude of Compliance Violations
por: Chen, Qian, et al.
Publicado: (2025)
por: Chen, Qian, et al.
Publicado: (2025)
Ejemplares similares
-
Metric Embedding Initialization-Based Differentially Private and Explainable Graph Clustering
por: You, Haochen, et al.
Publicado: (2025) -
MCIGLE: Multimodal Exemplar-Free Class-Incremental Graph Learning
por: You, Haochen, et al.
Publicado: (2025) -
Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling
por: You, Haochen, et al.
Publicado: (2025) -
Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization
por: You, Haochen, et al.
Publicado: (2026) -
To Clip or not to Clip: the Dynamics of SGD with Gradient Clipping in High-Dimensions
por: Marshall, Noah, et al.
Publicado: (2024)