Saved in:
| Main Authors: | Zhang, Yedi, Saxe, Andrew, Latham, Peter E. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.20607 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
When Are Bias-Free ReLU Networks Effectively Linear Networks?
by: Zhang, Yedi, et al.
Published: (2024)
by: Zhang, Yedi, et al.
Published: (2024)
Understanding Unimodal Bias in Multimodal Deep Linear Networks
by: Zhang, Yedi, et al.
Published: (2023)
by: Zhang, Yedi, et al.
Published: (2023)
Training Dynamics of In-Context Learning in Linear Attention
by: Zhang, Yedi, et al.
Published: (2025)
by: Zhang, Yedi, et al.
Published: (2025)
Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape
by: Bantzis, Ioannis, et al.
Published: (2025)
by: Bantzis, Ioannis, et al.
Published: (2025)
Neural Network-based High-index Saddle Dynamics Method for Searching Saddle Points and Solution Landscape
by: Liu, Yuankai, et al.
Published: (2024)
by: Liu, Yuankai, et al.
Published: (2024)
Stochastic Gradient Descent in the Saddle-to-Saddle Regime of Deep Linear Networks
by: Corlouer, Guillaume, et al.
Published: (2026)
by: Corlouer, Guillaume, et al.
Published: (2026)
Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks
by: Zhang, Leyang, et al.
Published: (2024)
by: Zhang, Leyang, et al.
Published: (2024)
Plateaus, Optima, and Overfitting in Multi-Layer Perceptrons: A Saddle-Saddle-Attractor Scenario
by: Maleknia, Alex Alì, et al.
Published: (2026)
by: Maleknia, Alex Alì, et al.
Published: (2026)
Directional Convergence Near Small Initializations and Saddles in Two-Homogeneous Neural Networks
by: Kumar, Akshay, et al.
Published: (2024)
by: Kumar, Akshay, et al.
Published: (2024)
A Theory of Saddle Escape in Deep Nonlinear Networks
by: Rawal, Divit, et al.
Published: (2026)
by: Rawal, Divit, et al.
Published: (2026)
Series of Hessian-Vector Products for Tractable Saddle-Free Newton Optimisation of Neural Networks
by: Oldewage, Elre T., et al.
Published: (2023)
by: Oldewage, Elre T., et al.
Published: (2023)
Federated Composite Saddle Point Optimization
by: Bai, Site, et al.
Published: (2023)
by: Bai, Site, et al.
Published: (2023)
Saddle Networks: Structure-Preserving Architectures for Convex-Concave Functions
by: Warin, Xavier
Published: (2026)
by: Warin, Xavier
Published: (2026)
Hierarchical Simplicity Bias of Neural Networks
by: Du, Zhehang
Published: (2023)
by: Du, Zhehang
Published: (2023)
Regret Minimization via Saddle Point Optimization
by: Kirschner, Johannes, et al.
Published: (2024)
by: Kirschner, Johannes, et al.
Published: (2024)
Dimension-Free Saddle-Point Escape in Muon
by: Long, Yanlin, et al.
Published: (2026)
by: Long, Yanlin, et al.
Published: (2026)
Enhancing Stability of Physics-Informed Neural Network Training Through Saddle-Point Reformulation
by: Bylinkin, Dmitry, et al.
Published: (2025)
by: Bylinkin, Dmitry, et al.
Published: (2025)
Saddle Hierarchy in Dense Associative Memory
by: Thériault, Robin, et al.
Published: (2025)
by: Thériault, Robin, et al.
Published: (2025)
Loss Landscape of Shallow ReLU-like Neural Networks: Stationary Points, Saddle Escape, and Network Embedding
by: Wu, Frank Zhengqing, et al.
Published: (2024)
by: Wu, Frank Zhengqing, et al.
Published: (2024)
Quantization Avoids Saddle Points in Distributed Optimization
by: Bo, Yanan, et al.
Published: (2024)
by: Bo, Yanan, et al.
Published: (2024)
Never Saddle for Reparameterized Steepest Descent as Mirror Flow
by: Jacobs, Tom, et al.
Published: (2026)
by: Jacobs, Tom, et al.
Published: (2026)
Dimer-Enhanced Optimization: A First-Order Approach to Escaping Saddle Points in Neural Network Training
by: Hu, Yue, et al.
Published: (2025)
by: Hu, Yue, et al.
Published: (2025)
Optimal Learning Rate Schedule for Balancing Effort and Performance
by: Njaradi, Valentina, et al.
Published: (2026)
by: Njaradi, Valentina, et al.
Published: (2026)
Inertial Newton Algorithms Avoiding Strict Saddle Points
by: Castera, Camille
Published: (2021)
by: Castera, Camille
Published: (2021)
Proximal Point Method for Online Saddle Point Problem
by: Meng, Qing-xin, et al.
Published: (2024)
by: Meng, Qing-xin, et al.
Published: (2024)
Efficiently Escaping Saddle Points for Policy Optimization
by: Khorasani, Sadegh, et al.
Published: (2023)
by: Khorasani, Sadegh, et al.
Published: (2023)
Only Strict Saddles in the Energy Landscape of Predictive Coding Networks?
by: Innocenti, Francesco, et al.
Published: (2024)
by: Innocenti, Francesco, et al.
Published: (2024)
Do Quantum Neural Networks have Simplicity Bias?
by: Pointing, Jessica
Published: (2024)
by: Pointing, Jessica
Published: (2024)
Simultaneous Learning and Optimization via Misspecified Saddle Point Problems
by: Ahmadi, Mohammad Mahdi, et al.
Published: (2025)
by: Ahmadi, Mohammad Mahdi, et al.
Published: (2025)
A Saddle Point Remedy: Power of Variable Elimination in Non-convex Optimization
by: Gan, Min, et al.
Published: (2025)
by: Gan, Min, et al.
Published: (2025)
Algorithm Development in Neural Networks: Insights from the Streaming Parity Task
by: van Rossem, Loek, et al.
Published: (2025)
by: van Rossem, Loek, et al.
Published: (2025)
Hessian-guided Perturbed Wasserstein Gradient Flows for Escaping Saddle Points
by: Yamamoto, Naoya, et al.
Published: (2025)
by: Yamamoto, Naoya, et al.
Published: (2025)
Type-II Saddles and Probabilistic Stability of Stochastic Gradient Descent
by: Ziyin, Liu, et al.
Published: (2023)
by: Ziyin, Liu, et al.
Published: (2023)
Bilevel Optimization over Saddle Points of Zero-Sum Markov Games
by: Zheng, Zihao, et al.
Published: (2026)
by: Zheng, Zihao, et al.
Published: (2026)
Exploring New Frontiers in Vertical Federated Learning: the Role of Saddle Point Reformulation
by: Beznosikov, Aleksandr, et al.
Published: (2026)
by: Beznosikov, Aleksandr, et al.
Published: (2026)
Online Min-Max Optimization: From Individual Regrets to Cumulative Saddle Points
by: Vyas, Abhijeet, et al.
Published: (2026)
by: Vyas, Abhijeet, et al.
Published: (2026)
A Compression Perspective on Simplicity Bias
by: Marty, Tom, et al.
Published: (2026)
by: Marty, Tom, et al.
Published: (2026)
WinQ: Accelerating Quantization-Aware Training of Language Models Around Saddle Points
by: Li, Dongyue, et al.
Published: (2026)
by: Li, Dongyue, et al.
Published: (2026)
Efficiently Escaping Saddle Points under Generalized Smoothness via Self-Bounding Regularity
by: Cao, Daniel Yiming, et al.
Published: (2025)
by: Cao, Daniel Yiming, et al.
Published: (2025)
Escaping Saddle Points for Nonsmooth Weakly Convex Functions via Perturbed Proximal Algorithms
by: Huang, Minhui, et al.
Published: (2021)
by: Huang, Minhui, et al.
Published: (2021)
Similar Items
-
When Are Bias-Free ReLU Networks Effectively Linear Networks?
by: Zhang, Yedi, et al.
Published: (2024) -
Understanding Unimodal Bias in Multimodal Deep Linear Networks
by: Zhang, Yedi, et al.
Published: (2023) -
Training Dynamics of In-Context Learning in Linear Attention
by: Zhang, Yedi, et al.
Published: (2025) -
Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape
by: Bantzis, Ioannis, et al.
Published: (2025) -
Neural Network-based High-index Saddle Dynamics Method for Searching Saddle Points and Solution Landscape
by: Liu, Yuankai, et al.
Published: (2024)