:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	You, Haochen, Liu, Baojing
Formato:	Preprint
Publicado:	2025
Materias:	Machine Learning
Acceso en línea:	https://arxiv.org/abs/2510.01578
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Metric Embedding Initialization-Based Differentially Private and Explainable Graph Clustering
por: You, Haochen, et al.
Publicado: (2025)

MCIGLE: Multimodal Exemplar-Free Class-Incremental Graph Learning
por: You, Haochen, et al.
Publicado: (2025)

Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling
por: You, Haochen, et al.
Publicado: (2025)

Generalized Radius and Integrated Codebook Transforms for Differentiable Vector Quantization
por: You, Haochen, et al.
Publicado: (2026)

To Clip or not to Clip: the Dynamics of SGD with Gradient Clipping in High-Dimensions
por: Marshall, Noah, et al.
Publicado: (2024)

ReSSFormer: A Recursive Sparse Structured Transformer for Scalable and Long-Context Reasoning
por: You, Haochen, et al.
Publicado: (2025)

MOVER: Multimodal Optimal Transport with Volume-based Embedding Regularization
por: You, Haochen, et al.
Publicado: (2025)

Can Entry-Wise Clipping Give Spectral Control of Stochastic Gradients?
por: Song, Zitao, et al.
Publicado: (2026)

Gradient Clipping Beyond Vector Norms: A Spectral Approach for Matrix-Valued Parameters
por: Yukhimchuk, Alexander, et al.
Publicado: (2026)

MuCon: Clipped Muon Updates for LLM Training
por: Yi, Albert
Publicado: (2026)

Adaptive Gradient Clipping for Robust Federated Learning
por: Allouah, Youssef, et al.
Publicado: (2024)

AdaDPIGU: Differentially Private SGD with Adaptive Clipping and Importance-Based Gradient Updates for Deep Neural Networks
por: Zhang, Huiqi, et al.
Publicado: (2025)

When Gradient Clipping Becomes a Control Mechanism for Differential Privacy in Deep Learning
por: Partohaghighi, Mohammad, et al.
Publicado: (2026)

Robust Stochastic Optimization via Gradient Quantile Clipping
por: Merad, Ibrahim, et al.
Publicado: (2023)

Exploring the Impact of Parameter Update Magnitude on Forgetting and Generalization of Continual Learning
por: He, JinLi, et al.
Publicado: (2026)

From Gradient Clipping to Normalization for Heavy Tailed SGD
por: Hübler, Florian, et al.
Publicado: (2024)

Parameter-free Clipped Gradient Descent Meets Polyak
por: Takezawa, Yuki, et al.
Publicado: (2024)

Merging Beyond: Streaming LLM Updates via Activation-Guided Rotations
por: Yao, Yuxuan, et al.
Publicado: (2026)

Clipped Gradient Methods for Nonsmooth Convex Optimization under Heavy-Tailed Noise: A Refined Analysis
por: Liu, Zijian
Publicado: (2025)

Beyond Softmax: A New Perspective on Gradient Bandits
por: Melo, Emerson, et al.
Publicado: (2025)

Nonconvex Stochastic Optimization under Heavy-Tailed Noises: Optimal Convergence without Gradient Clipping
por: Liu, Zijian, et al.
Publicado: (2024)

Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks
por: Tucat, Matteo, et al.
Publicado: (2024)

Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond
por: Wang, Qizhou, et al.
Publicado: (2025)

Generalized Gradient Norm Clipping & Non-Euclidean $(L_0,L_1)$-Smoothness
por: Pethick, Thomas, et al.
Publicado: (2025)

SoftAdaClip: A Smooth Clipping Strategy for Fair and Private Model Training
por: Soleymani, Dorsa, et al.
Publicado: (2025)

Revisiting Gradient Normalization and Clipping for Nonconvex SGD under Heavy-Tailed Noise: Necessity, Sufficiency, and Acceleration
por: Sun, Tao, et al.
Publicado: (2024)

AGGC: Adaptive Group Gradient Clipping for Stabilizing Large Language Model Training
por: Li, Zhiyuan, et al.
Publicado: (2026)

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
por: Su, Zhenpeng, et al.
Publicado: (2025)

GaLLoP: Gradient-based Sparse Learning on Low-Magnitude Parameters
por: Choudhary, Anand, et al.
Publicado: (2025)

HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization
por: Zhao, Huaqin, et al.
Publicado: (2024)

Guided AbsoluteGrad: Magnitude of Gradients Matters to Explanation's Localization and Saliency
por: Huang, Jun, et al.
Publicado: (2024)

Revisit Micro-batch Clipping: Adaptive Data Pruning via Gradient Manipulation
por: Wang, Lun
Publicado: (2024)

ConfClip: Confidence-Weighted and Clipped Reward for Reinforcement Learning in LLMs
por: Zhang, Bonan, et al.
Publicado: (2025)

PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping
por: Huang, Nai-Chieh, et al.
Publicado: (2023)

A Bootstrap Perspective on Stochastic Gradient Descent
por: Lan, Hongjian, et al.
Publicado: (2025)

Gradient Transformer: Learning to Generate Updates for LLMs
por: Nguyen, Binh-Nguyen, et al.
Publicado: (2026)

CE-GPPO: Coordinating Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning
por: Su, Zhenpeng, et al.
Publicado: (2025)

Support Basis: Fast Attention Beyond Bounded Entries
por: Aliakbarpour, Maryam, et al.
Publicado: (2025)

Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping
por: Pelikan, Martin, et al.
Publicado: (2023)

Beyond Yes or No: Predictive Compliance Monitoring Approaches for Quantifying the Magnitude of Compliance Violations
por: Chen, Qian, et al.
Publicado: (2025)