:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gupta, Kanan, Siegel, Jonathan W., Wojtowytsch, Stephan
Format:	Preprint
Published:	2023
Subjects:	Machine Learning Optimization and Control 68T07
Online Access:	https://arxiv.org/abs/2302.05515
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Nesterov acceleration in benignly non-convex landscapes
by: Gupta, Kanan, et al.
Published: (2024)

Input Convex Kolmogorov Arnold Networks
by: Deschatre, Thomas, et al.
Published: (2025)

Generative modeling of conditional probability distributions on the level-sets of collective variables
by: Akhyar, Fatima-Zahrae, et al.
Published: (2025)

Exact Sequence Interpolation with Transformers
by: Alcalde, Albert, et al.
Published: (2025)

Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching
by: Denkert, Robert, et al.
Published: (2024)

Explicit neural network classifiers for non-separable data
by: Ewald, Patrícia Muñoz
Published: (2025)

Diagonal Linear Networks and the Lasso Regularization Path
by: Berthier, Raphaël
Published: (2025)

Supplementary Materials to Graph Convolutional Branch and Bound
by: Sciandra, Lorenzo, et al.
Published: (2024)

Recent Advances in Non-convex Smoothness Conditions and Applicability to Deep Linear Neural Networks
by: Patel, Vivak, et al.
Published: (2024)

Quantum-Inspired DRL Approach with LSTM and OU Noise for Cut Order Planning Optimization
by: Chrisnanto, Yulison Herry, et al.
Published: (2025)

ADAPT: Lightweight, Long-Range Machine Learning Force Fields Without Graphs
by: Dramko, Evan, et al.
Published: (2025)

Tracking the Median of Gradients with a Stochastic Proximal Point Method
by: Schaipp, Fabian, et al.
Published: (2024)

Representation and Regression Problems in Neural Networks: Relaxation, Generalization, and Numerics
by: Liu, Kang, et al.
Published: (2024)

Convergence of gradient descent for deep neural networks
by: Chatterjee, Sourav
Published: (2022)

Optimization Dynamics of Equivariant and Augmented Neural Networks
by: Nordenfors, Oskar, et al.
Published: (2023)

Learning time-scales in two-layers neural networks
by: Berthier, Raphaël, et al.
Published: (2023)

Error Bound Analysis for the Regularized Loss of Deep Linear Neural Networks
by: Chen, Po, et al.
Published: (2025)

BROS: Bias-Corrected Randomized Subspaces for Memory-Efficient Single-Loop Bilevel Optimization
by: Zhang, Hengrui, et al.
Published: (2026)

Fixed-Point Neural Optimal Transport without Implicit Differentiation
by: Park, Yesom, et al.
Published: (2026)

A Layer Separation Optimization Framework for Cross-Entropy Training in Deep Learning
by: Liu, Yaru, et al.
Published: (2026)

Data Augmentation and Regularization for Learning Group Equivariance
by: Nordenfors, Oskar, et al.
Published: (2025)

Constructive Universal Approximation and Finite Sample Memorization by Narrow Deep ReLU Networks
by: Hernández, Martín, et al.
Published: (2024)

Terminally constrained flow-based generative models from an optimal control perspective
by: Gao, Weiguo, et al.
Published: (2026)

A Two-Phase Adaptive Balanced Penalty Method for Controllable Pareto Front Learning under Split Feasibility Conditions
by: Hoang, Nguyen Viet, et al.
Published: (2026)

On the existence of minimizers in shallow residual ReLU neural network optimization landscapes
by: Dereich, Steffen, et al.
Published: (2023)

On the existence of optimal shallow feedforward networks with ReLU activation
by: Dereich, Steffen, et al.
Published: (2023)

Power Homotopy for Zeroth-Order Non-Convex Optimizations
by: Xu, Chen
Published: (2025)

Global Optimization with A Power-Transformed Objective and Gaussian Smoothing
by: Xu, Chen
Published: (2024)

Beyond Discreteness: Sample Complexity Analysis of Straight-Through Estimator for 1-bit Quantization
by: Jeong, Halyun, et al.
Published: (2025)

On the Curse of Memory in Recurrent Neural Networks: Approximation and Optimization Analysis
by: Li, Zhong, et al.
Published: (2020)

Resolving gradient pathology in physics-informed epidemiological models
by: Golooba, Nickson, et al.
Published: (2026)

How to beat a Bayesian adversary
by: Ding, Zihan, et al.
Published: (2024)

Relu and softplus neural nets as zero-sum turn-based games
by: Gaubert, Stephane, et al.
Published: (2025)

Progressive Feedforward Collapse of ResNet Training
by: Wang, Sicong, et al.
Published: (2024)

Exponential convergence rates for momentum stochastic gradient descent in the overparametrized setting
by: Gess, Benjamin, et al.
Published: (2023)

An alternative formulation of attention pooling function in translation
by: Conti, Eddie
Published: (2024)

Cluster-based classification with neural ODEs via control
by: Álvarez-López, Antonio, et al.
Published: (2023)

Quantitative Convergence of Wasserstein Gradient Flows of Kernel Mean Discrepancies
by: Chizat, Lénaïc, et al.
Published: (2026)

Progressive Power Homotopy for Non-convex Optimization
by: Xu, Chen
Published: (2026)

Iso-Riemannian Optimization on Learned Data Manifolds
by: Diepeveen, Willem, et al.
Published: (2025)