Saved in:
| Main Authors: | Gupta, Kanan, Siegel, Jonathan W., Wojtowytsch, Stephan |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2302.05515 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Nesterov acceleration in benignly non-convex landscapes
by: Gupta, Kanan, et al.
Published: (2024)
by: Gupta, Kanan, et al.
Published: (2024)
Input Convex Kolmogorov Arnold Networks
by: Deschatre, Thomas, et al.
Published: (2025)
by: Deschatre, Thomas, et al.
Published: (2025)
Generative modeling of conditional probability distributions on the level-sets of collective variables
by: Akhyar, Fatima-Zahrae, et al.
Published: (2025)
by: Akhyar, Fatima-Zahrae, et al.
Published: (2025)
Exact Sequence Interpolation with Transformers
by: Alcalde, Albert, et al.
Published: (2025)
by: Alcalde, Albert, et al.
Published: (2025)
Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching
by: Denkert, Robert, et al.
Published: (2024)
by: Denkert, Robert, et al.
Published: (2024)
Explicit neural network classifiers for non-separable data
by: Ewald, Patrícia Muñoz
Published: (2025)
by: Ewald, Patrícia Muñoz
Published: (2025)
Diagonal Linear Networks and the Lasso Regularization Path
by: Berthier, Raphaël
Published: (2025)
by: Berthier, Raphaël
Published: (2025)
Supplementary Materials to Graph Convolutional Branch and Bound
by: Sciandra, Lorenzo, et al.
Published: (2024)
by: Sciandra, Lorenzo, et al.
Published: (2024)
Recent Advances in Non-convex Smoothness Conditions and Applicability to Deep Linear Neural Networks
by: Patel, Vivak, et al.
Published: (2024)
by: Patel, Vivak, et al.
Published: (2024)
Quantum-Inspired DRL Approach with LSTM and OU Noise for Cut Order Planning Optimization
by: Chrisnanto, Yulison Herry, et al.
Published: (2025)
by: Chrisnanto, Yulison Herry, et al.
Published: (2025)
ADAPT: Lightweight, Long-Range Machine Learning Force Fields Without Graphs
by: Dramko, Evan, et al.
Published: (2025)
by: Dramko, Evan, et al.
Published: (2025)
Tracking the Median of Gradients with a Stochastic Proximal Point Method
by: Schaipp, Fabian, et al.
Published: (2024)
by: Schaipp, Fabian, et al.
Published: (2024)
Representation and Regression Problems in Neural Networks: Relaxation, Generalization, and Numerics
by: Liu, Kang, et al.
Published: (2024)
by: Liu, Kang, et al.
Published: (2024)
Convergence of gradient descent for deep neural networks
by: Chatterjee, Sourav
Published: (2022)
by: Chatterjee, Sourav
Published: (2022)
Optimization Dynamics of Equivariant and Augmented Neural Networks
by: Nordenfors, Oskar, et al.
Published: (2023)
by: Nordenfors, Oskar, et al.
Published: (2023)
Learning time-scales in two-layers neural networks
by: Berthier, Raphaël, et al.
Published: (2023)
by: Berthier, Raphaël, et al.
Published: (2023)
Error Bound Analysis for the Regularized Loss of Deep Linear Neural Networks
by: Chen, Po, et al.
Published: (2025)
by: Chen, Po, et al.
Published: (2025)
BROS: Bias-Corrected Randomized Subspaces for Memory-Efficient Single-Loop Bilevel Optimization
by: Zhang, Hengrui, et al.
Published: (2026)
by: Zhang, Hengrui, et al.
Published: (2026)
Fixed-Point Neural Optimal Transport without Implicit Differentiation
by: Park, Yesom, et al.
Published: (2026)
by: Park, Yesom, et al.
Published: (2026)
A Layer Separation Optimization Framework for Cross-Entropy Training in Deep Learning
by: Liu, Yaru, et al.
Published: (2026)
by: Liu, Yaru, et al.
Published: (2026)
Data Augmentation and Regularization for Learning Group Equivariance
by: Nordenfors, Oskar, et al.
Published: (2025)
by: Nordenfors, Oskar, et al.
Published: (2025)
Constructive Universal Approximation and Finite Sample Memorization by Narrow Deep ReLU Networks
by: Hernández, Martín, et al.
Published: (2024)
by: Hernández, Martín, et al.
Published: (2024)
Terminally constrained flow-based generative models from an optimal control perspective
by: Gao, Weiguo, et al.
Published: (2026)
by: Gao, Weiguo, et al.
Published: (2026)
A Two-Phase Adaptive Balanced Penalty Method for Controllable Pareto Front Learning under Split Feasibility Conditions
by: Hoang, Nguyen Viet, et al.
Published: (2026)
by: Hoang, Nguyen Viet, et al.
Published: (2026)
On the existence of minimizers in shallow residual ReLU neural network optimization landscapes
by: Dereich, Steffen, et al.
Published: (2023)
by: Dereich, Steffen, et al.
Published: (2023)
On the existence of optimal shallow feedforward networks with ReLU activation
by: Dereich, Steffen, et al.
Published: (2023)
by: Dereich, Steffen, et al.
Published: (2023)
Power Homotopy for Zeroth-Order Non-Convex Optimizations
by: Xu, Chen
Published: (2025)
by: Xu, Chen
Published: (2025)
Global Optimization with A Power-Transformed Objective and Gaussian Smoothing
by: Xu, Chen
Published: (2024)
by: Xu, Chen
Published: (2024)
Beyond Discreteness: Sample Complexity Analysis of Straight-Through Estimator for 1-bit Quantization
by: Jeong, Halyun, et al.
Published: (2025)
by: Jeong, Halyun, et al.
Published: (2025)
On the Curse of Memory in Recurrent Neural Networks: Approximation and Optimization Analysis
by: Li, Zhong, et al.
Published: (2020)
by: Li, Zhong, et al.
Published: (2020)
Resolving gradient pathology in physics-informed epidemiological models
by: Golooba, Nickson, et al.
Published: (2026)
by: Golooba, Nickson, et al.
Published: (2026)
How to beat a Bayesian adversary
by: Ding, Zihan, et al.
Published: (2024)
by: Ding, Zihan, et al.
Published: (2024)
Relu and softplus neural nets as zero-sum turn-based games
by: Gaubert, Stephane, et al.
Published: (2025)
by: Gaubert, Stephane, et al.
Published: (2025)
Progressive Feedforward Collapse of ResNet Training
by: Wang, Sicong, et al.
Published: (2024)
by: Wang, Sicong, et al.
Published: (2024)
Exponential convergence rates for momentum stochastic gradient descent in the overparametrized setting
by: Gess, Benjamin, et al.
Published: (2023)
by: Gess, Benjamin, et al.
Published: (2023)
An alternative formulation of attention pooling function in translation
by: Conti, Eddie
Published: (2024)
by: Conti, Eddie
Published: (2024)
Cluster-based classification with neural ODEs via control
by: Álvarez-López, Antonio, et al.
Published: (2023)
by: Álvarez-López, Antonio, et al.
Published: (2023)
Quantitative Convergence of Wasserstein Gradient Flows of Kernel Mean Discrepancies
by: Chizat, Lénaïc, et al.
Published: (2026)
by: Chizat, Lénaïc, et al.
Published: (2026)
Progressive Power Homotopy for Non-convex Optimization
by: Xu, Chen
Published: (2026)
by: Xu, Chen
Published: (2026)
Iso-Riemannian Optimization on Learned Data Manifolds
by: Diepeveen, Willem, et al.
Published: (2025)
by: Diepeveen, Willem, et al.
Published: (2025)
Similar Items
-
Nesterov acceleration in benignly non-convex landscapes
by: Gupta, Kanan, et al.
Published: (2024) -
Input Convex Kolmogorov Arnold Networks
by: Deschatre, Thomas, et al.
Published: (2025) -
Generative modeling of conditional probability distributions on the level-sets of collective variables
by: Akhyar, Fatima-Zahrae, et al.
Published: (2025) -
Exact Sequence Interpolation with Transformers
by: Alcalde, Albert, et al.
Published: (2025) -
Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching
by: Denkert, Robert, et al.
Published: (2024)