Saved in:
| Main Authors: | Luo, Shiyi, Liu, Mingshuo, Yu, Yifeng, Ren, Shangping, Bai, Yu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.01534 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Randomized Matrix Sketching for Neural Network Training and Gradient Monitoring
by: Antil, Harbir, et al.
Published: (2025)
by: Antil, Harbir, et al.
Published: (2025)
Approximation and Gradient Descent Training with Neural Networks
by: Welper, G.
Published: (2024)
by: Welper, G.
Published: (2024)
Recent Advances in Non-convex Smoothness Conditions and Applicability to Deep Linear Neural Networks
by: Patel, Vivak, et al.
Published: (2024)
by: Patel, Vivak, et al.
Published: (2024)
Approximation of the Proximal Operator of the $\ell_\infty$ Norm Using a Neural Network
by: Linehan, Kathryn, et al.
Published: (2024)
by: Linehan, Kathryn, et al.
Published: (2024)
Deep Unfolding Network for Nonlinear Multi-Frequency Electrical Impedance Tomography
by: Alberti, Giovanni S., et al.
Published: (2025)
by: Alberti, Giovanni S., et al.
Published: (2025)
An Asymptotically Optimal Coordinate Descent Algorithm for Learning Bayesian Networks from Gaussian Models
by: Xu, Tong, et al.
Published: (2024)
by: Xu, Tong, et al.
Published: (2024)
To be or not to be stable, that is the question: understanding neural networks for inverse problems
by: Evangelista, Davide, et al.
Published: (2022)
by: Evangelista, Davide, et al.
Published: (2022)
Error Bound Analysis for the Regularized Loss of Deep Linear Neural Networks
by: Chen, Po, et al.
Published: (2025)
by: Chen, Po, et al.
Published: (2025)
A Layer Separation Optimization Framework for Cross-Entropy Training in Deep Learning
by: Liu, Yaru, et al.
Published: (2026)
by: Liu, Yaru, et al.
Published: (2026)
A resource-efficient model for deep kernel learning
by: D'Amore, Luisa
Published: (2024)
by: D'Amore, Luisa
Published: (2024)
Neural-network methods for two-dimensional finite-source reflector design
by: Hacking, Roel, et al.
Published: (2026)
by: Hacking, Roel, et al.
Published: (2026)
Multilevel Training for Kolmogorov Arnold Networks
by: Southworth, Ben S., et al.
Published: (2026)
by: Southworth, Ben S., et al.
Published: (2026)
Derivative-Informed Fourier Neural Operator: Universal Approximation and Applications to PDE-Constrained Optimization
by: Yao, Boyuan, et al.
Published: (2025)
by: Yao, Boyuan, et al.
Published: (2025)
Leveraging joint sparsity in hierarchical Bayesian learning
by: Glaubitz, Jan, et al.
Published: (2023)
by: Glaubitz, Jan, et al.
Published: (2023)
Sequential Least-Squares Estimators with Fast Randomized Sketching for Linear Statistical Models
by: Chen, Guan-Yu, et al.
Published: (2025)
by: Chen, Guan-Yu, et al.
Published: (2025)
On Maximum-a-Posteriori estimation with Plug & Play priors and stochastic gradient descent
by: Laumont, Rémi, et al.
Published: (2022)
by: Laumont, Rémi, et al.
Published: (2022)
Fixed-Point Neural Optimal Transport without Implicit Differentiation
by: Park, Yesom, et al.
Published: (2026)
by: Park, Yesom, et al.
Published: (2026)
Convergence of Momentum-Based Optimization Algorithms with Time-Varying Parameters
by: Vidyasagar, Mathukumalli
Published: (2025)
by: Vidyasagar, Mathukumalli
Published: (2025)
Consensus-based optimization for closed-box adversarial attacks and a connection to evolution strategies
by: Roith, Tim, et al.
Published: (2025)
by: Roith, Tim, et al.
Published: (2025)
Designing Preconditioners for SGD: Local Conditioning, Noise Floors, and Basin Stability
by: Scott, Mitchell, et al.
Published: (2025)
by: Scott, Mitchell, et al.
Published: (2025)
The Pontryagin Maximum Principle for Training Convolutional Neural Networks
by: Hofmann, Sebastian, et al.
Published: (2025)
by: Hofmann, Sebastian, et al.
Published: (2025)
SDFs from Unoriented Point Clouds using Neural Variational Heat Distances
by: Weidemaier, Samuel, et al.
Published: (2025)
by: Weidemaier, Samuel, et al.
Published: (2025)
Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)
by: Ilin, Ivan
Published: (2025)
by: Ilin, Ivan
Published: (2025)
OCTANE -- Optimal Control for Tensor-based Autoencoder Network Emergence: Explicit Case
by: Khatri, Ratna, et al.
Published: (2025)
by: Khatri, Ratna, et al.
Published: (2025)
Self2Seg: Single-Image Self-Supervised Joint Segmentation and Denoising
by: Gruber, Nadja, et al.
Published: (2023)
by: Gruber, Nadja, et al.
Published: (2023)
Holonorm
by: Yongueng, Daryl Noupa, et al.
Published: (2025)
by: Yongueng, Daryl Noupa, et al.
Published: (2025)
Deceptron: Learned Local Inverses for Fast and Stable Physics Inversion
by: Kachhadiya, Aaditya L.
Published: (2025)
by: Kachhadiya, Aaditya L.
Published: (2025)
A Structure-Guided Gauss-Newton Method for Shallow ReLU Neural Network
by: Cai, Zhiqiang, et al.
Published: (2024)
by: Cai, Zhiqiang, et al.
Published: (2024)
Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition
by: Klawonn, Axel, et al.
Published: (2024)
by: Klawonn, Axel, et al.
Published: (2024)
Neural Network-Based Parameter Estimation for Non-Autonomous Differential Equations with Discontinuous Signals
by: Jo, Hyeontae, et al.
Published: (2025)
by: Jo, Hyeontae, et al.
Published: (2025)
Faster Adaptive Optimization via Expected Gradient Outer Product Reparameterization
by: DePavia, Adela, et al.
Published: (2025)
by: DePavia, Adela, et al.
Published: (2025)
A Unified Framework for Lifted Training and Inversion Approaches
by: Wang, Xiaoyu, et al.
Published: (2025)
by: Wang, Xiaoyu, et al.
Published: (2025)
Bayesian imaging using Plug & Play priors: when Langevin meets Tweedie
by: Laumont, Rémi, et al.
Published: (2021)
by: Laumont, Rémi, et al.
Published: (2021)
Stochastic Estimation of the Layer-wise Hessian Trace for Monitoring Neural-network Training
by: Bolshim, Maxim, et al.
Published: (2026)
by: Bolshim, Maxim, et al.
Published: (2026)
On the Convergence Behavior of Preconditioned Gradient Descent Toward the Rich Learning Regime
by: Jiang, Shuai, et al.
Published: (2026)
by: Jiang, Shuai, et al.
Published: (2026)
SVD-Preconditioned Gradient Descent Method for Solving Nonlinear Least Squares Problems
by: Chang, Zhipeng, et al.
Published: (2026)
by: Chang, Zhipeng, et al.
Published: (2026)
Preconditioned subgradient method for composite optimization: overparameterization and fast convergence
by: Díaz, Mateo, et al.
Published: (2025)
by: Díaz, Mateo, et al.
Published: (2025)
Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection
by: Chowdhary, Abhijit, et al.
Published: (2026)
by: Chowdhary, Abhijit, et al.
Published: (2026)
Shape Gradient Based Non-Parametric Mumford-Shah Segmentation Without Level Sets
by: P, Shafeequdheen, et al.
Published: (2025)
by: P, Shafeequdheen, et al.
Published: (2025)
Inter-Layer Hessian Analysis of Neural Networks with DAG Architectures
by: Bolshim, Maxim, et al.
Published: (2026)
by: Bolshim, Maxim, et al.
Published: (2026)
Similar Items
-
Randomized Matrix Sketching for Neural Network Training and Gradient Monitoring
by: Antil, Harbir, et al.
Published: (2025) -
Approximation and Gradient Descent Training with Neural Networks
by: Welper, G.
Published: (2024) -
Recent Advances in Non-convex Smoothness Conditions and Applicability to Deep Linear Neural Networks
by: Patel, Vivak, et al.
Published: (2024) -
Approximation of the Proximal Operator of the $\ell_\infty$ Norm Using a Neural Network
by: Linehan, Kathryn, et al.
Published: (2024) -
Deep Unfolding Network for Nonlinear Multi-Frequency Electrical Impedance Tomography
by: Alberti, Giovanni S., et al.
Published: (2025)