:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Qiao, Dan, Wang, Yu-Xiang
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2512.01473
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step Sizes
by: Qiao, Dan, et al.
Published: (2024)

Benign Overfitting for Regression with Trained Two-Layer ReLU Networks
by: Park, Junhyung, et al.
Published: (2024)

N-ReLU: Zero-Mean Stochastic Extension of ReLU
by: Manik, Md Motaleb Hossen, et al.
Published: (2025)

The Geometry of ReLU Networks through the ReLU Transition Graph
by: Dhayalkar, Sahil Rajesh
Published: (2025)

The Resurrection of the ReLU
by: Horuz, Coşku Can, et al.
Published: (2025)

Pathwise Explanation of ReLU Neural Networks
by: Lim, Seongwoo, et al.
Published: (2025)

Activation-Descent Regularization for Input Optimization of ReLU Networks
by: Yu, Hongzhan, et al.
Published: (2024)

ReLU Networks for Exact Generation of Similar Graphs
by: Ghafoor, Mamoona, et al.
Published: (2026)

Three Quantization Regimes for ReLU Networks
by: Ou, Weigutian, et al.
Published: (2024)

Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU Transformers
by: Wild, Cody, et al.
Published: (2024)

RePO: Understanding Preference Learning Through ReLU-Based Optimization
by: Wu, Junkang, et al.
Published: (2025)

Relating Piecewise Linear Kolmogorov Arnold Networks to ReLU Networks
by: Schoots, Nandi, et al.
Published: (2025)

Beyond ReLU: Chebyshev-DQN for Enhanced Deep Q-Networks
by: Yazdannik, Saman, et al.
Published: (2025)

Is ReLU Adversarially Robust?
by: Sooksatra, Korn, et al.
Published: (2024)

Convergence of Shallow ReLU Networks on Weakly Interacting Data
by: Dana, Léo, et al.
Published: (2025)

Unveiling the Training Dynamics of ReLU Networks through a Linear Lens
by: Ye, Longqing
Published: (2025)

Sufficient Conditions for Stability of Minimum-Norm Interpolating Deep ReLU Networks
by: Harzli, Ouns El, et al.
Published: (2026)

Expressive Power of ReLU and Step Networks under Floating-Point Operations
by: Park, Yeachan, et al.
Published: (2024)

Noisy Interpolation Learning with Shallow Univariate ReLU Networks
by: Joshi, Nirmit, et al.
Published: (2023)

Deep ReLU Networks Have Surprisingly Simple Polytopes
by: Fan, Feng-Lei, et al.
Published: (2023)

Topological Signatures of ReLU Neural Network Activation Patterns
by: Bosca, Vicente, et al.
Published: (2025)

$λ$-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks
by: Pérez-Corral, Cristian, et al.
Published: (2026)

Detecting Invariant Manifolds in ReLU-Based RNNs
by: Eisenmann, Lukas, et al.
Published: (2025)

On the Principles of ReLU Networks with One Hidden Layer
by: Huang, Changcun
Published: (2024)

ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs
by: Zhang, Zhengyan, et al.
Published: (2024)

Precise Verification of Transformers through ReLU-Catalyzed Abstraction Refinement
by: Liu, Hengjie, et al.
Published: (2026)

A Lower Bound for the Number of Linear Regions of Ternary ReLU Regression Neural Networks
by: Nakahara, Yuta, et al.
Published: (2025)

Compelling ReLU Networks to Exhibit Exponentially Many Linear Regions at Initialization and During Training
by: Milkert, Max, et al.
Published: (2023)

ReLU's Revival: On the Entropic Overload in Normalization-Free Large Language Models
by: Jha, Nandan Kumar, et al.
Published: (2024)

Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape
by: Bantzis, Ioannis, et al.
Published: (2025)

The Median is Easier than it Looks: Approximation with a Constant-Depth, Linear-Width ReLU Network
by: Dutta, Abhigyan, et al.
Published: (2026)

Geometry-induced Regularization in Deep ReLU Neural Networks
by: Bona-Pellissier, Joachim, et al.
Published: (2024)

Hidden Minima in Two-Layer ReLU Networks
by: Arjevani, Yossi
Published: (2023)

The Cost of Robustness: Tighter Bounds on Parameter Complexity for Robust Memorization in ReLU Nets
by: Kim, Yujun, et al.
Published: (2025)

Stable Minima of ReLU Neural Networks Suffer from the Curse of Dimensionality: The Neural Shattering Phenomenon
by: Liang, Tongtong, et al.
Published: (2025)

Algebraic Approach to Ridge-Regularized Mean Squared Error Minimization in Minimal ReLU Neural Network
by: Fukasaku, Ryoya, et al.
Published: (2025)

Uncertainty Quantification with Bayesian Higher Order ReLU KANs
by: Giroux, James, et al.
Published: (2024)

Training a Two Layer ReLU Network Analytically
by: Barbu, Adrian
Published: (2023)

Looped ReLU MLPs May Be All You Need as Practical Programmable Computers
by: Liang, Yingyu, et al.
Published: (2024)

HiQ-Lip: A Hierarchical Quantum-Classical Method for Global Lipschitz Constant Estimation of ReLU Networks
by: He, Haoqi, et al.
Published: (2025)