:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Luo, Shiyi, Liu, Mingshuo, Yu, Yifeng, Ren, Shangping, Bai, Yu
Format:	Preprint
Published:	2024
Subjects:	Machine Learning 68T10, 65K10
Online Access:	https://arxiv.org/abs/2408.01534
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Randomized Matrix Sketching for Neural Network Training and Gradient Monitoring
by: Antil, Harbir, et al.
Published: (2025)

Approximation and Gradient Descent Training with Neural Networks
by: Welper, G.
Published: (2024)

Recent Advances in Non-convex Smoothness Conditions and Applicability to Deep Linear Neural Networks
by: Patel, Vivak, et al.
Published: (2024)

Approximation of the Proximal Operator of the $\ell_\infty$ Norm Using a Neural Network
by: Linehan, Kathryn, et al.
Published: (2024)

Deep Unfolding Network for Nonlinear Multi-Frequency Electrical Impedance Tomography
by: Alberti, Giovanni S., et al.
Published: (2025)

An Asymptotically Optimal Coordinate Descent Algorithm for Learning Bayesian Networks from Gaussian Models
by: Xu, Tong, et al.
Published: (2024)

To be or not to be stable, that is the question: understanding neural networks for inverse problems
by: Evangelista, Davide, et al.
Published: (2022)

Error Bound Analysis for the Regularized Loss of Deep Linear Neural Networks
by: Chen, Po, et al.
Published: (2025)

A Layer Separation Optimization Framework for Cross-Entropy Training in Deep Learning
by: Liu, Yaru, et al.
Published: (2026)

A resource-efficient model for deep kernel learning
by: D'Amore, Luisa
Published: (2024)

Neural-network methods for two-dimensional finite-source reflector design
by: Hacking, Roel, et al.
Published: (2026)

Multilevel Training for Kolmogorov Arnold Networks
by: Southworth, Ben S., et al.
Published: (2026)

Derivative-Informed Fourier Neural Operator: Universal Approximation and Applications to PDE-Constrained Optimization
by: Yao, Boyuan, et al.
Published: (2025)

Leveraging joint sparsity in hierarchical Bayesian learning
by: Glaubitz, Jan, et al.
Published: (2023)

Sequential Least-Squares Estimators with Fast Randomized Sketching for Linear Statistical Models
by: Chen, Guan-Yu, et al.
Published: (2025)

On Maximum-a-Posteriori estimation with Plug & Play priors and stochastic gradient descent
by: Laumont, Rémi, et al.
Published: (2022)

Fixed-Point Neural Optimal Transport without Implicit Differentiation
by: Park, Yesom, et al.
Published: (2026)

Convergence of Momentum-Based Optimization Algorithms with Time-Varying Parameters
by: Vidyasagar, Mathukumalli
Published: (2025)

Consensus-based optimization for closed-box adversarial attacks and a connection to evolution strategies
by: Roith, Tim, et al.
Published: (2025)

Designing Preconditioners for SGD: Local Conditioning, Noise Floors, and Basin Stability
by: Scott, Mitchell, et al.
Published: (2025)

The Pontryagin Maximum Principle for Training Convolutional Neural Networks
by: Hofmann, Sebastian, et al.
Published: (2025)

SDFs from Unoriented Point Clouds using Neural Variational Heat Distances
by: Weidemaier, Samuel, et al.
Published: (2025)

Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)
by: Ilin, Ivan
Published: (2025)

OCTANE -- Optimal Control for Tensor-based Autoencoder Network Emergence: Explicit Case
by: Khatri, Ratna, et al.
Published: (2025)

Self2Seg: Single-Image Self-Supervised Joint Segmentation and Denoising
by: Gruber, Nadja, et al.
Published: (2023)

Holonorm
by: Yongueng, Daryl Noupa, et al.
Published: (2025)

Deceptron: Learned Local Inverses for Fast and Stable Physics Inversion
by: Kachhadiya, Aaditya L.
Published: (2025)

A Structure-Guided Gauss-Newton Method for Shallow ReLU Neural Network
by: Cai, Zhiqiang, et al.
Published: (2024)

Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition
by: Klawonn, Axel, et al.
Published: (2024)

Neural Network-Based Parameter Estimation for Non-Autonomous Differential Equations with Discontinuous Signals
by: Jo, Hyeontae, et al.
Published: (2025)

Faster Adaptive Optimization via Expected Gradient Outer Product Reparameterization
by: DePavia, Adela, et al.
Published: (2025)

A Unified Framework for Lifted Training and Inversion Approaches
by: Wang, Xiaoyu, et al.
Published: (2025)

Bayesian imaging using Plug & Play priors: when Langevin meets Tweedie
by: Laumont, Rémi, et al.
Published: (2021)

Stochastic Estimation of the Layer-wise Hessian Trace for Monitoring Neural-network Training
by: Bolshim, Maxim, et al.
Published: (2026)

On the Convergence Behavior of Preconditioned Gradient Descent Toward the Rich Learning Regime
by: Jiang, Shuai, et al.
Published: (2026)

SVD-Preconditioned Gradient Descent Method for Solving Nonlinear Least Squares Problems
by: Chang, Zhipeng, et al.
Published: (2026)

Preconditioned subgradient method for composite optimization: overparameterization and fast convergence
by: Díaz, Mateo, et al.
Published: (2025)

Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection
by: Chowdhary, Abhijit, et al.
Published: (2026)

Shape Gradient Based Non-Parametric Mumford-Shah Segmentation Without Level Sets
by: P, Shafeequdheen, et al.
Published: (2025)

Inter-Layer Hessian Analysis of Neural Networks with DAG Architectures
by: Bolshim, Maxim, et al.
Published: (2026)