Saved in:
| Main Authors: | Wu, Diyuan, Mondelli, Marco |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.19104 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Improved Scaling Laws via Weak-to-Strong Generalization in Random Feature Ridge Regression
by: Wu, Diyuan, et al.
Published: (2026)
by: Wu, Diyuan, et al.
Published: (2026)
Attention with Trained Embeddings Provably Selects Important Tokens
by: Wu, Diyuan, et al.
Published: (2025)
by: Wu, Diyuan, et al.
Published: (2025)
Beyond Unconstrained Features: Neural Collapse for Shallow Neural Networks with General Data
by: Hong, Wanli, et al.
Published: (2024)
by: Hong, Wanli, et al.
Published: (2024)
Neural Collapse versus Low-rank Bias: Is Deep Neural Collapse Really Optimal?
by: Súkeník, Peter, et al.
Published: (2024)
by: Súkeník, Peter, et al.
Published: (2024)
Privacy for Free in the Overparameterized Regime
by: Bombari, Simone, et al.
Published: (2024)
by: Bombari, Simone, et al.
Published: (2024)
Neural Collapse is Globally Optimal in Deep Regularized ResNets and Transformers
by: Súkeník, Peter, et al.
Published: (2025)
by: Súkeník, Peter, et al.
Published: (2025)
Wide Neural Networks Trained with Weight Decay Provably Exhibit Neural Collapse
by: Jacot, Arthur, et al.
Published: (2024)
by: Jacot, Arthur, et al.
Published: (2024)
How Spurious Features Are Memorized: Precise Analysis for Random and NTK Features
by: Bombari, Simone, et al.
Published: (2023)
by: Bombari, Simone, et al.
Published: (2023)
A Law of Data Reconstruction for Random Features (and Beyond)
by: Iurada, Leonardo, et al.
Published: (2025)
by: Iurada, Leonardo, et al.
Published: (2025)
Neural Collapse in Cumulative Link Models for Ordinal Regression: An Analysis with Unconstrained Feature Model
by: Ma, Chuang, et al.
Published: (2025)
by: Ma, Chuang, et al.
Published: (2025)
Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Feature Model
by: Dang, Hien, et al.
Published: (2024)
by: Dang, Hien, et al.
Published: (2024)
Towards Understanding the Word Sensitivity of Attention Layers: A Study via Random Features
by: Bombari, Simone, et al.
Published: (2024)
by: Bombari, Simone, et al.
Published: (2024)
Supervised Contrastive Representation Learning: Landscape Analysis with Unconstrained Features
by: Behnia, Tina, et al.
Published: (2024)
by: Behnia, Tina, et al.
Published: (2024)
Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models
by: Zhang, Yihan, et al.
Published: (2022)
by: Zhang, Yihan, et al.
Published: (2022)
Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization
by: Bombari, Simone, et al.
Published: (2025)
by: Bombari, Simone, et al.
Published: (2025)
Mode Collapse of Mean-Field Variational Inference
by: Sheng, Shunan, et al.
Published: (2025)
by: Sheng, Shunan, et al.
Published: (2025)
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
by: Andriopoulos, George, et al.
Published: (2025)
by: Andriopoulos, George, et al.
Published: (2025)
Generalization of Scaled Deep ResNets in the Mean-Field Regime
by: Chen, Yihang, et al.
Published: (2024)
by: Chen, Yihang, et al.
Published: (2024)
Improved Convergence of Score-Based Diffusion Models via Prediction-Correction
by: Pedrotti, Francesco, et al.
Published: (2023)
by: Pedrotti, Francesco, et al.
Published: (2023)
Matrix Denoising with Doubly Heteroscedastic Noise: Fundamental Limits and Optimal Spectral Methods
by: Zhang, Yihan, et al.
Published: (2024)
by: Zhang, Yihan, et al.
Published: (2024)
Optimal Regularization for Performative Learning
by: Cyffers, Edwige, et al.
Published: (2025)
by: Cyffers, Edwige, et al.
Published: (2025)
Geometric Analysis of Unconstrained Feature Models with $d=K$
by: Shen, Yi, et al.
Published: (2024)
by: Shen, Yi, et al.
Published: (2024)
Spectral Estimators for Multi-Index Models: Precise Asymptotics and Optimal Weak Recovery
by: Kovačević, Filip, et al.
Published: (2025)
by: Kovačević, Filip, et al.
Published: (2025)
Generalization Error of Graph Neural Networks in the Mean-field Regime
by: Aminian, Gholamali, et al.
Published: (2024)
by: Aminian, Gholamali, et al.
Published: (2024)
Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics
by: Mousavi-Hosseini, Alireza, et al.
Published: (2024)
by: Mousavi-Hosseini, Alireza, et al.
Published: (2024)
Spectral Estimators for Structured Generalized Linear Models via Approximate Message Passing
by: Zhang, Yihan, et al.
Published: (2023)
by: Zhang, Yihan, et al.
Published: (2023)
Transformers Learn Nonlinear Features In Context: Nonconvex Mean-field Dynamics on the Attention Landscape
by: Kim, Juno, et al.
Published: (2024)
by: Kim, Juno, et al.
Published: (2024)
Neural Collapse Dynamics: Depth, Activation, Regularisation, and Feature Norm Threshold
by: Rupa, Anamika Paul
Published: (2026)
by: Rupa, Anamika Paul
Published: (2026)
Optimal Estimation in Orthogonally Invariant Generalized Linear Models: Spectral Initialization and Approximate Message Passing
by: Zhang, Yihan, et al.
Published: (2026)
by: Zhang, Yihan, et al.
Published: (2026)
High-Dimensional Private Linear Regression with Optimal Rates
by: Bombari, Simone, et al.
Published: (2025)
by: Bombari, Simone, et al.
Published: (2025)
High-dimensional Analysis of Synthetic Data Selection
by: Rezaei, Parham, et al.
Published: (2025)
by: Rezaei, Parham, et al.
Published: (2025)
Average gradient outer product as a mechanism for deep neural collapse
by: Beaglehole, Daniel, et al.
Published: (2024)
by: Beaglehole, Daniel, et al.
Published: (2024)
Anti Mode-Collapse in Mean-Field Transformer via Auxiliary Variables
by: Imaizumi, Masaaki, et al.
Published: (2026)
by: Imaizumi, Masaaki, et al.
Published: (2026)
Full-Batch Gradient Descent Outperforms One-Pass SGD: Sample Complexity Separation in Single-Index Learning
by: Kovačević, Filip, et al.
Published: (2026)
by: Kovačević, Filip, et al.
Published: (2026)
Why Loss Re-weighting Works If You Stop Early: Training Dynamics of Unconstrained Features
by: Zhao, Yize, et al.
Published: (2026)
by: Zhao, Yize, et al.
Published: (2026)
Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)
by: Nam, Yoonsoo, et al.
Published: (2025)
by: Nam, Yoonsoo, et al.
Published: (2025)
Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth
by: Kögler, Kevin, et al.
Published: (2024)
by: Kögler, Kevin, et al.
Published: (2024)
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws
by: Ildiz, M. Emrullah, et al.
Published: (2024)
by: Ildiz, M. Emrullah, et al.
Published: (2024)
Asymptotics of Random Feature Regression Beyond the Linear Scaling Regime
by: Hu, Hong, et al.
Published: (2024)
by: Hu, Hong, et al.
Published: (2024)
Optimal Representation Size: High-Dimensional Analysis of Pretraining and Linear Probing
by: Njaradi, Valentina, et al.
Published: (2026)
by: Njaradi, Valentina, et al.
Published: (2026)
Similar Items
-
Improved Scaling Laws via Weak-to-Strong Generalization in Random Feature Ridge Regression
by: Wu, Diyuan, et al.
Published: (2026) -
Attention with Trained Embeddings Provably Selects Important Tokens
by: Wu, Diyuan, et al.
Published: (2025) -
Beyond Unconstrained Features: Neural Collapse for Shallow Neural Networks with General Data
by: Hong, Wanli, et al.
Published: (2024) -
Neural Collapse versus Low-rank Bias: Is Deep Neural Collapse Really Optimal?
by: Súkeník, Peter, et al.
Published: (2024) -
Privacy for Free in the Overparameterized Regime
by: Bombari, Simone, et al.
Published: (2024)