Saved in:
| Main Authors: | Qiao, Dan, Wang, Yu-Xiang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.01473 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step Sizes
by: Qiao, Dan, et al.
Published: (2024)
by: Qiao, Dan, et al.
Published: (2024)
Benign Overfitting for Regression with Trained Two-Layer ReLU Networks
by: Park, Junhyung, et al.
Published: (2024)
by: Park, Junhyung, et al.
Published: (2024)
N-ReLU: Zero-Mean Stochastic Extension of ReLU
by: Manik, Md Motaleb Hossen, et al.
Published: (2025)
by: Manik, Md Motaleb Hossen, et al.
Published: (2025)
The Geometry of ReLU Networks through the ReLU Transition Graph
by: Dhayalkar, Sahil Rajesh
Published: (2025)
by: Dhayalkar, Sahil Rajesh
Published: (2025)
The Resurrection of the ReLU
by: Horuz, Coşku Can, et al.
Published: (2025)
by: Horuz, Coşku Can, et al.
Published: (2025)
Pathwise Explanation of ReLU Neural Networks
by: Lim, Seongwoo, et al.
Published: (2025)
by: Lim, Seongwoo, et al.
Published: (2025)
Activation-Descent Regularization for Input Optimization of ReLU Networks
by: Yu, Hongzhan, et al.
Published: (2024)
by: Yu, Hongzhan, et al.
Published: (2024)
ReLU Networks for Exact Generation of Similar Graphs
by: Ghafoor, Mamoona, et al.
Published: (2026)
by: Ghafoor, Mamoona, et al.
Published: (2026)
Three Quantization Regimes for ReLU Networks
by: Ou, Weigutian, et al.
Published: (2024)
by: Ou, Weigutian, et al.
Published: (2024)
Uncovering Layer-Dependent Activation Sparsity Patterns in ReLU Transformers
by: Wild, Cody, et al.
Published: (2024)
by: Wild, Cody, et al.
Published: (2024)
RePO: Understanding Preference Learning Through ReLU-Based Optimization
by: Wu, Junkang, et al.
Published: (2025)
by: Wu, Junkang, et al.
Published: (2025)
Relating Piecewise Linear Kolmogorov Arnold Networks to ReLU Networks
by: Schoots, Nandi, et al.
Published: (2025)
by: Schoots, Nandi, et al.
Published: (2025)
Beyond ReLU: Chebyshev-DQN for Enhanced Deep Q-Networks
by: Yazdannik, Saman, et al.
Published: (2025)
by: Yazdannik, Saman, et al.
Published: (2025)
Is ReLU Adversarially Robust?
by: Sooksatra, Korn, et al.
Published: (2024)
by: Sooksatra, Korn, et al.
Published: (2024)
Convergence of Shallow ReLU Networks on Weakly Interacting Data
by: Dana, Léo, et al.
Published: (2025)
by: Dana, Léo, et al.
Published: (2025)
Unveiling the Training Dynamics of ReLU Networks through a Linear Lens
by: Ye, Longqing
Published: (2025)
by: Ye, Longqing
Published: (2025)
Sufficient Conditions for Stability of Minimum-Norm Interpolating Deep ReLU Networks
by: Harzli, Ouns El, et al.
Published: (2026)
by: Harzli, Ouns El, et al.
Published: (2026)
Expressive Power of ReLU and Step Networks under Floating-Point Operations
by: Park, Yeachan, et al.
Published: (2024)
by: Park, Yeachan, et al.
Published: (2024)
Noisy Interpolation Learning with Shallow Univariate ReLU Networks
by: Joshi, Nirmit, et al.
Published: (2023)
by: Joshi, Nirmit, et al.
Published: (2023)
Deep ReLU Networks Have Surprisingly Simple Polytopes
by: Fan, Feng-Lei, et al.
Published: (2023)
by: Fan, Feng-Lei, et al.
Published: (2023)
Topological Signatures of ReLU Neural Network Activation Patterns
by: Bosca, Vicente, et al.
Published: (2025)
by: Bosca, Vicente, et al.
Published: (2025)
$λ$-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks
by: Pérez-Corral, Cristian, et al.
Published: (2026)
by: Pérez-Corral, Cristian, et al.
Published: (2026)
Detecting Invariant Manifolds in ReLU-Based RNNs
by: Eisenmann, Lukas, et al.
Published: (2025)
by: Eisenmann, Lukas, et al.
Published: (2025)
On the Principles of ReLU Networks with One Hidden Layer
by: Huang, Changcun
Published: (2024)
by: Huang, Changcun
Published: (2024)
ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs
by: Zhang, Zhengyan, et al.
Published: (2024)
by: Zhang, Zhengyan, et al.
Published: (2024)
Precise Verification of Transformers through ReLU-Catalyzed Abstraction Refinement
by: Liu, Hengjie, et al.
Published: (2026)
by: Liu, Hengjie, et al.
Published: (2026)
A Lower Bound for the Number of Linear Regions of Ternary ReLU Regression Neural Networks
by: Nakahara, Yuta, et al.
Published: (2025)
by: Nakahara, Yuta, et al.
Published: (2025)
Compelling ReLU Networks to Exhibit Exponentially Many Linear Regions at Initialization and During Training
by: Milkert, Max, et al.
Published: (2023)
by: Milkert, Max, et al.
Published: (2023)
ReLU's Revival: On the Entropic Overload in Normalization-Free Large Language Models
by: Jha, Nandan Kumar, et al.
Published: (2024)
by: Jha, Nandan Kumar, et al.
Published: (2024)
Saddle-To-Saddle Dynamics in Deep ReLU Networks: Low-Rank Bias in the First Saddle Escape
by: Bantzis, Ioannis, et al.
Published: (2025)
by: Bantzis, Ioannis, et al.
Published: (2025)
The Median is Easier than it Looks: Approximation with a Constant-Depth, Linear-Width ReLU Network
by: Dutta, Abhigyan, et al.
Published: (2026)
by: Dutta, Abhigyan, et al.
Published: (2026)
Geometry-induced Regularization in Deep ReLU Neural Networks
by: Bona-Pellissier, Joachim, et al.
Published: (2024)
by: Bona-Pellissier, Joachim, et al.
Published: (2024)
Hidden Minima in Two-Layer ReLU Networks
by: Arjevani, Yossi
Published: (2023)
by: Arjevani, Yossi
Published: (2023)
The Cost of Robustness: Tighter Bounds on Parameter Complexity for Robust Memorization in ReLU Nets
by: Kim, Yujun, et al.
Published: (2025)
by: Kim, Yujun, et al.
Published: (2025)
Stable Minima of ReLU Neural Networks Suffer from the Curse of Dimensionality: The Neural Shattering Phenomenon
by: Liang, Tongtong, et al.
Published: (2025)
by: Liang, Tongtong, et al.
Published: (2025)
Algebraic Approach to Ridge-Regularized Mean Squared Error Minimization in Minimal ReLU Neural Network
by: Fukasaku, Ryoya, et al.
Published: (2025)
by: Fukasaku, Ryoya, et al.
Published: (2025)
Uncertainty Quantification with Bayesian Higher Order ReLU KANs
by: Giroux, James, et al.
Published: (2024)
by: Giroux, James, et al.
Published: (2024)
Training a Two Layer ReLU Network Analytically
by: Barbu, Adrian
Published: (2023)
by: Barbu, Adrian
Published: (2023)
Looped ReLU MLPs May Be All You Need as Practical Programmable Computers
by: Liang, Yingyu, et al.
Published: (2024)
by: Liang, Yingyu, et al.
Published: (2024)
HiQ-Lip: A Hierarchical Quantum-Classical Method for Global Lipschitz Constant Estimation of ReLU Networks
by: He, Haoqi, et al.
Published: (2025)
by: He, Haoqi, et al.
Published: (2025)
Similar Items
-
Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step Sizes
by: Qiao, Dan, et al.
Published: (2024) -
Benign Overfitting for Regression with Trained Two-Layer ReLU Networks
by: Park, Junhyung, et al.
Published: (2024) -
N-ReLU: Zero-Mean Stochastic Extension of ReLU
by: Manik, Md Motaleb Hossen, et al.
Published: (2025) -
The Geometry of ReLU Networks through the ReLU Transition Graph
by: Dhayalkar, Sahil Rajesh
Published: (2025) -
The Resurrection of the ReLU
by: Horuz, Coşku Can, et al.
Published: (2025)