:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	AlQuabeh, Hilal, de Vazelhes, William, Gu, Bin
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.01146
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Uncovering the Spectral Bias in Diagonal State Space Models
by: Solozabal, Ruben, et al.
Published: (2025)

Constrained Adversarial Perturbation
by: Nishad, Virendra, et al.
Published: (2025)

Mechanistic Insights into Grokking from the Embedding Layer
by: AlquBoj, H. V., et al.
Published: (2025)

New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions
by: Yuan, Xinzhe, et al.
Published: (2026)

Zeroth-Order Hard-Thresholding: Gradient Error vs. Expansivity
by: de Vazelhes, William, et al.
Published: (2022)

Optimization over Sparse Support-Preserving Sets: Two-Step Projection with Global Optimality Guarantees
by: de Vazelhes, William, et al.
Published: (2025)

Emergence of Primacy and Recency Effect in Mamba: A Mechanistic Point of View
by: Airlangga, Muhammad Cendekia, et al.
Published: (2025)

Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
by: Gao, Chengqian, et al.
Published: (2024)

Iterative Regularization with k-support Norm: An Important Complement to Sparse Recovery
by: de Vazelhes, William, et al.
Published: (2023)

The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces
by: El-Shangiti, Ahmed Oumar, et al.
Published: (2024)

Learning Curves of Stochastic Gradient Descent in Kernel Regression
by: Zhang, Haihan, et al.
Published: (2025)

Learning Associative Memories with Gradient Descent
by: Cabannes, Vivien, et al.
Published: (2024)

Partially Lazy Gradient Descent for Smoothed Online Learning
by: Mhaisen, Naram, et al.
Published: (2026)

Adaptive Kernel Selection for Stein Variational Gradient Descent
by: Melcher, Moritz, et al.
Published: (2025)

Weighted Averaged Stochastic Gradient Descent: Asymptotic Normality and Optimality
by: Wei, Ziyang, et al.
Published: (2023)

Geometrically Inspired Kernel Machines for Collaborative Learning Beyond Gradient Descent
by: Kumar, Mohit, et al.
Published: (2024)

Natural Gradient Descent for Online Continual Learning
by: Khawand, Joe, et al.
Published: (2026)

Stability-based Generalization Analysis of Randomized Coordinate Descent for Pairwise Learning
by: Wu, Liang, et al.
Published: (2025)

Harmonized Gradient Descent for Class Imbalanced Data Stream Online Learning
by: Zhou, Han, et al.
Published: (2025)

Learning Operators by Regularized Stochastic Gradient Descent with Operator-valued Kernels
by: Yang, Jia-Qi, et al.
Published: (2025)

Quantum Algorithm for Sparse Online Learning with Truncated Gradient Descent
by: Lim, Debbie, et al.
Published: (2024)

Limit Theorems for Stochastic Gradient Descent with Infinite Variance
by: Blanchet, Jose, et al.
Published: (2024)

The Power of Random Features and the Limits of Distribution-Free Gradient Descent
by: Karchmer, Ari, et al.
Published: (2025)

Central Limit Theorems for Stochastic Gradient Descent Quantile Estimators
by: Wei, Ziyang, et al.
Published: (2025)

Comparing Federated Stochastic Gradient Descent and Federated Averaging for Predicting Hospital Length of Stay
by: Balik, Mehmet Yigit
Published: (2024)

Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networks
by: Li, Binghui, et al.
Published: (2024)

Functional Central Limit Theorem for Stochastic Gradient Descent
by: Flamand, Kessang, et al.
Published: (2026)

Variational Online Mirror Descent for Robust Learning in Schrödinger Bridge
by: Han, Dong-Sig, et al.
Published: (2025)

Quantum Natural Stochastic Pairwise Coordinate Descent
by: Sohail, Mohammad Aamir, et al.
Published: (2024)

Metric Learning from Limited Pairwise Preference Comparisons
by: Wang, Zhi, et al.
Published: (2024)

Beyond Cross-Validation: Adaptive Parameter Selection for Kernel-Based Gradient Descents
by: Liu, Xiaotong, et al.
Published: (2026)

Unraveling the Gradient Descent Dynamics of Transformers
by: Song, Bingqing, et al.
Published: (2024)

Curl Descent: Non-Gradient Learning Dynamics with Sign-Diverse Plasticity
by: Ninou, Hugo, et al.
Published: (2025)

Truncated Kernel Stochastic Gradient Descent on Spheres
by: Bai, Jinhui, et al.
Published: (2024)

Distributed Gradient Descent for Functional Learning
by: Yu, Zhan, et al.
Published: (2023)

Feedback Descent: Open-Ended Text Optimization via Pairwise Comparison
by: Lee, Yoonho, et al.
Published: (2025)

Trained Mamba Emulates Online Gradient Descent in In-Context Linear Regression
by: Jiang, Jiarui, et al.
Published: (2025)

Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent
by: Chang, Xiangyu, et al.
Published: (2022)

The Limit Points of (Optimistic) Gradient Descent in Min-Max Optimization
by: Daskalakis, Constantinos, et al.
Published: (2018)

Characterizing Dynamical Stability of Stochastic Gradient Descent in Overparameterized Learning
by: Chemnitz, Dennis, et al.
Published: (2024)