Saved in:
| Main Authors: | AlQuabeh, Hilal, de Vazelhes, William, Gu, Bin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.01146 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Uncovering the Spectral Bias in Diagonal State Space Models
by: Solozabal, Ruben, et al.
Published: (2025)
by: Solozabal, Ruben, et al.
Published: (2025)
Constrained Adversarial Perturbation
by: Nishad, Virendra, et al.
Published: (2025)
by: Nishad, Virendra, et al.
Published: (2025)
Mechanistic Insights into Grokking from the Embedding Layer
by: AlquBoj, H. V., et al.
Published: (2025)
by: AlquBoj, H. V., et al.
Published: (2025)
New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions
by: Yuan, Xinzhe, et al.
Published: (2026)
by: Yuan, Xinzhe, et al.
Published: (2026)
Zeroth-Order Hard-Thresholding: Gradient Error vs. Expansivity
by: de Vazelhes, William, et al.
Published: (2022)
by: de Vazelhes, William, et al.
Published: (2022)
Optimization over Sparse Support-Preserving Sets: Two-Step Projection with Global Optimality Guarantees
by: de Vazelhes, William, et al.
Published: (2025)
by: de Vazelhes, William, et al.
Published: (2025)
Emergence of Primacy and Recency Effect in Mamba: A Mechanistic Point of View
by: Airlangga, Muhammad Cendekia, et al.
Published: (2025)
by: Airlangga, Muhammad Cendekia, et al.
Published: (2025)
Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
by: Gao, Chengqian, et al.
Published: (2024)
by: Gao, Chengqian, et al.
Published: (2024)
Iterative Regularization with k-support Norm: An Important Complement to Sparse Recovery
by: de Vazelhes, William, et al.
Published: (2023)
by: de Vazelhes, William, et al.
Published: (2023)
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces
by: El-Shangiti, Ahmed Oumar, et al.
Published: (2024)
by: El-Shangiti, Ahmed Oumar, et al.
Published: (2024)
Learning Curves of Stochastic Gradient Descent in Kernel Regression
by: Zhang, Haihan, et al.
Published: (2025)
by: Zhang, Haihan, et al.
Published: (2025)
Learning Associative Memories with Gradient Descent
by: Cabannes, Vivien, et al.
Published: (2024)
by: Cabannes, Vivien, et al.
Published: (2024)
Partially Lazy Gradient Descent for Smoothed Online Learning
by: Mhaisen, Naram, et al.
Published: (2026)
by: Mhaisen, Naram, et al.
Published: (2026)
Adaptive Kernel Selection for Stein Variational Gradient Descent
by: Melcher, Moritz, et al.
Published: (2025)
by: Melcher, Moritz, et al.
Published: (2025)
Weighted Averaged Stochastic Gradient Descent: Asymptotic Normality and Optimality
by: Wei, Ziyang, et al.
Published: (2023)
by: Wei, Ziyang, et al.
Published: (2023)
Geometrically Inspired Kernel Machines for Collaborative Learning Beyond Gradient Descent
by: Kumar, Mohit, et al.
Published: (2024)
by: Kumar, Mohit, et al.
Published: (2024)
Natural Gradient Descent for Online Continual Learning
by: Khawand, Joe, et al.
Published: (2026)
by: Khawand, Joe, et al.
Published: (2026)
Stability-based Generalization Analysis of Randomized Coordinate Descent for Pairwise Learning
by: Wu, Liang, et al.
Published: (2025)
by: Wu, Liang, et al.
Published: (2025)
Harmonized Gradient Descent for Class Imbalanced Data Stream Online Learning
by: Zhou, Han, et al.
Published: (2025)
by: Zhou, Han, et al.
Published: (2025)
Learning Operators by Regularized Stochastic Gradient Descent with Operator-valued Kernels
by: Yang, Jia-Qi, et al.
Published: (2025)
by: Yang, Jia-Qi, et al.
Published: (2025)
Quantum Algorithm for Sparse Online Learning with Truncated Gradient Descent
by: Lim, Debbie, et al.
Published: (2024)
by: Lim, Debbie, et al.
Published: (2024)
Limit Theorems for Stochastic Gradient Descent with Infinite Variance
by: Blanchet, Jose, et al.
Published: (2024)
by: Blanchet, Jose, et al.
Published: (2024)
The Power of Random Features and the Limits of Distribution-Free Gradient Descent
by: Karchmer, Ari, et al.
Published: (2025)
by: Karchmer, Ari, et al.
Published: (2025)
Central Limit Theorems for Stochastic Gradient Descent Quantile Estimators
by: Wei, Ziyang, et al.
Published: (2025)
by: Wei, Ziyang, et al.
Published: (2025)
Comparing Federated Stochastic Gradient Descent and Federated Averaging for Predicting Hospital Length of Stay
by: Balik, Mehmet Yigit
Published: (2024)
by: Balik, Mehmet Yigit
Published: (2024)
Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networks
by: Li, Binghui, et al.
Published: (2024)
by: Li, Binghui, et al.
Published: (2024)
Functional Central Limit Theorem for Stochastic Gradient Descent
by: Flamand, Kessang, et al.
Published: (2026)
by: Flamand, Kessang, et al.
Published: (2026)
Variational Online Mirror Descent for Robust Learning in Schrödinger Bridge
by: Han, Dong-Sig, et al.
Published: (2025)
by: Han, Dong-Sig, et al.
Published: (2025)
Quantum Natural Stochastic Pairwise Coordinate Descent
by: Sohail, Mohammad Aamir, et al.
Published: (2024)
by: Sohail, Mohammad Aamir, et al.
Published: (2024)
Metric Learning from Limited Pairwise Preference Comparisons
by: Wang, Zhi, et al.
Published: (2024)
by: Wang, Zhi, et al.
Published: (2024)
Beyond Cross-Validation: Adaptive Parameter Selection for Kernel-Based Gradient Descents
by: Liu, Xiaotong, et al.
Published: (2026)
by: Liu, Xiaotong, et al.
Published: (2026)
Unraveling the Gradient Descent Dynamics of Transformers
by: Song, Bingqing, et al.
Published: (2024)
by: Song, Bingqing, et al.
Published: (2024)
Curl Descent: Non-Gradient Learning Dynamics with Sign-Diverse Plasticity
by: Ninou, Hugo, et al.
Published: (2025)
by: Ninou, Hugo, et al.
Published: (2025)
Truncated Kernel Stochastic Gradient Descent on Spheres
by: Bai, Jinhui, et al.
Published: (2024)
by: Bai, Jinhui, et al.
Published: (2024)
Distributed Gradient Descent for Functional Learning
by: Yu, Zhan, et al.
Published: (2023)
by: Yu, Zhan, et al.
Published: (2023)
Feedback Descent: Open-Ended Text Optimization via Pairwise Comparison
by: Lee, Yoonho, et al.
Published: (2025)
by: Lee, Yoonho, et al.
Published: (2025)
Trained Mamba Emulates Online Gradient Descent in In-Context Linear Regression
by: Jiang, Jiarui, et al.
Published: (2025)
by: Jiang, Jiarui, et al.
Published: (2025)
Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent
by: Chang, Xiangyu, et al.
Published: (2022)
by: Chang, Xiangyu, et al.
Published: (2022)
The Limit Points of (Optimistic) Gradient Descent in Min-Max Optimization
by: Daskalakis, Constantinos, et al.
Published: (2018)
by: Daskalakis, Constantinos, et al.
Published: (2018)
Characterizing Dynamical Stability of Stochastic Gradient Descent in Overparameterized Learning
by: Chemnitz, Dennis, et al.
Published: (2024)
by: Chemnitz, Dennis, et al.
Published: (2024)
Similar Items
-
Uncovering the Spectral Bias in Diagonal State Space Models
by: Solozabal, Ruben, et al.
Published: (2025) -
Constrained Adversarial Perturbation
by: Nishad, Virendra, et al.
Published: (2025) -
Mechanistic Insights into Grokking from the Embedding Layer
by: AlquBoj, H. V., et al.
Published: (2025) -
New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions
by: Yuan, Xinzhe, et al.
Published: (2026) -
Zeroth-Order Hard-Thresholding: Gradient Error vs. Expansivity
by: de Vazelhes, William, et al.
Published: (2022)