Saved in:
| Main Authors: | Liang, Tongtong, Singh, Esha, Parhi, Rahul, Cloninger, Alexander, Wang, Yu-Xiang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.04807 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Generalization Below the Edge of Stability: The Role of Data Geometry
by: Liang, Tongtong, et al.
Published: (2025)
by: Liang, Tongtong, et al.
Published: (2025)
Stable Minima of ReLU Neural Networks Suffer from the Curse of Dimensionality: The Neural Shattering Phenomenon
by: Liang, Tongtong, et al.
Published: (2025)
by: Liang, Tongtong, et al.
Published: (2025)
On the Loss Landscape Geometry of Regularized Deep Matrix Factorization: Uniqueness and Sharpness
by: Kamber, Anil, et al.
Published: (2026)
by: Kamber, Anil, et al.
Published: (2026)
Function-Space Optimality of Neural Architectures with Multivariate Nonlinearities
by: Parhi, Rahul, et al.
Published: (2023)
by: Parhi, Rahul, et al.
Published: (2023)
LoLA: Low-Rank Linear Attention With Sparse Caching
by: McDermott, Luke, et al.
Published: (2025)
by: McDermott, Luke, et al.
Published: (2025)
Sharpness of Minima in Deep Matrix Factorization
by: Kamber, Anil, et al.
Published: (2025)
by: Kamber, Anil, et al.
Published: (2025)
A Gap Between the Gaussian RKHS and Neural Networks: An Infinite-Center Asymptotic Analysis
by: Kumar, Akash, et al.
Published: (2025)
by: Kumar, Akash, et al.
Published: (2025)
Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step Sizes
by: Qiao, Dan, et al.
Published: (2024)
by: Qiao, Dan, et al.
Published: (2024)
Finding Stable Subnetworks at Initialization with Dataset Distillation
by: McDermott, Luke, et al.
Published: (2025)
by: McDermott, Luke, et al.
Published: (2025)
Nonasymptotic Convergence Rates for Plug-and-Play Methods With MMSE Denoisers
by: Pritchard, Henry, et al.
Published: (2025)
by: Pritchard, Henry, et al.
Published: (2025)
Training Guarantees of Neural Network Classification Two-Sample Tests by Kernel Analysis
by: Khurana, Varun, et al.
Published: (2024)
by: Khurana, Varun, et al.
Published: (2024)
Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network Compression
by: Shenouda, Joseph, et al.
Published: (2023)
by: Shenouda, Joseph, et al.
Published: (2023)
Towards Sharp Minimax Risk Bounds for Operator Learning
by: Adcock, Ben, et al.
Published: (2025)
by: Adcock, Ben, et al.
Published: (2025)
Point Cloud Classification via Deep Set Linearized Optimal Transport
by: Mahan, Scott, et al.
Published: (2024)
by: Mahan, Scott, et al.
Published: (2024)
Random ReLU Neural Networks as Non-Gaussian Processes
by: Parhi, Rahul, et al.
Published: (2024)
by: Parhi, Rahul, et al.
Published: (2024)
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
by: Meng, Xiang, et al.
Published: (2024)
by: Meng, Xiang, et al.
Published: (2024)
LayerPipe2: Multistage Pipelining and Weight Recompute via Improved Exponential Moving Average for Training Neural Networks
by: Unnikrishnan, Nanda K., et al.
Published: (2025)
by: Unnikrishnan, Nanda K., et al.
Published: (2025)
CW-CNN & CW-AN: Convolutional Networks and Attention Networks for CW-Complexes
by: Khorana, Rahul
Published: (2024)
by: Khorana, Rahul
Published: (2024)
Generalization Bound for Diffusion Models using Random Features
by: Saha, Esha, et al.
Published: (2023)
by: Saha, Esha, et al.
Published: (2023)
Does Flatness imply Generalization for Logistic Loss in Univariate Two-Layer ReLU Network?
by: Qiao, Dan, et al.
Published: (2025)
by: Qiao, Dan, et al.
Published: (2025)
Risk Prediction of Cardiovascular Disease for Diabetic Patients with Machine Learning and Deep Learning Techniques
by: Chowdhury, Esha
Published: (2025)
by: Chowdhury, Esha
Published: (2025)
Transformers for Learning on Noisy and Task-Level Manifolds: Approximation and Generalization Insights
by: Shen, Zhaiming, et al.
Published: (2025)
by: Shen, Zhaiming, et al.
Published: (2025)
Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks
by: Francy, Samer, et al.
Published: (2024)
by: Francy, Samer, et al.
Published: (2024)
When Data Falls Short: Grokking Below the Critical Threshold
by: Singh, Vaibhav, et al.
Published: (2025)
by: Singh, Vaibhav, et al.
Published: (2025)
Linearized Optimal Transport pyLOT Library: A Toolkit for Machine Learning on Point Clouds
by: Linwu, Jun, et al.
Published: (2025)
by: Linwu, Jun, et al.
Published: (2025)
Learning Coupled System Dynamics under Incomplete Physical Constraints and Missing Data
by: Saha, Esha, et al.
Published: (2025)
by: Saha, Esha, et al.
Published: (2025)
GTAGCN: Generalized Topology Adaptive Graph Convolutional Networks
by: Singh, Sukhdeep, et al.
Published: (2024)
by: Singh, Sukhdeep, et al.
Published: (2024)
Weighted variation spaces and approximation by shallow ReLU networks
by: DeVore, Ronald, et al.
Published: (2023)
by: DeVore, Ronald, et al.
Published: (2023)
Robust Tangent Space Estimation via Laplacian Eigenvector Gradient Orthogonalization
by: Kohli, Dhruv, et al.
Published: (2025)
by: Kohli, Dhruv, et al.
Published: (2025)
Hierarchical Multiple Kernel K-Means Algorithm Based on Sparse Connectivity
by: Wang, Lei, et al.
Published: (2024)
by: Wang, Lei, et al.
Published: (2024)
The Transferability of Downsamped Sparse Graph Convolutional Networks
by: Shu, Qinji, et al.
Published: (2024)
by: Shu, Qinji, et al.
Published: (2024)
Deeper Insights into Deep Graph Convolutional Networks: Stability and Generalization
by: Yang, Guangrui, et al.
Published: (2024)
by: Yang, Guangrui, et al.
Published: (2024)
OT Score: An OT based Confidence Score for Prototype-Assisted Source Free Unsupervised Domain Adaptation
by: Zhang, Yiming, et al.
Published: (2025)
by: Zhang, Yiming, et al.
Published: (2025)
Linearized Optimal Transport for Analysis of High-Dimensional Point-Cloud and Single-Cell Data
by: Wang, Tianxiang, et al.
Published: (2025)
by: Wang, Tianxiang, et al.
Published: (2025)
Semi-Supervised Laplace Learning on Stiefel Manifolds
by: Holtz, Chester, et al.
Published: (2023)
by: Holtz, Chester, et al.
Published: (2023)
3BASiL: An Algorithmic Framework for Sparse plus Low-Rank Compression of LLMs
by: Makni, Mehdi, et al.
Published: (2026)
by: Makni, Mehdi, et al.
Published: (2026)
High-Order Tensor Regression in Sparse Convolutional Neural Networks
by: Algarte, Roberto Dias
Published: (2025)
by: Algarte, Roberto Dias
Published: (2025)
Channel-Wise MLPs Improve the Generalization of Recurrent Convolutional Networks
by: Breslow, Nathan
Published: (2025)
by: Breslow, Nathan
Published: (2025)
Generalized Linear Mode Connectivity for Transformers
by: Theus, Alexander, et al.
Published: (2025)
by: Theus, Alexander, et al.
Published: (2025)
Linear Mode Connectivity in Sparse Neural Networks
by: McDermott, Luke, et al.
Published: (2023)
by: McDermott, Luke, et al.
Published: (2023)
Similar Items
-
Generalization Below the Edge of Stability: The Role of Data Geometry
by: Liang, Tongtong, et al.
Published: (2025) -
Stable Minima of ReLU Neural Networks Suffer from the Curse of Dimensionality: The Neural Shattering Phenomenon
by: Liang, Tongtong, et al.
Published: (2025) -
On the Loss Landscape Geometry of Regularized Deep Matrix Factorization: Uniqueness and Sharpness
by: Kamber, Anil, et al.
Published: (2026) -
Function-Space Optimality of Neural Architectures with Multivariate Nonlinearities
by: Parhi, Rahul, et al.
Published: (2023) -
LoLA: Low-Rank Linear Attention With Sparse Caching
by: McDermott, Luke, et al.
Published: (2025)