Saved in:
| Main Authors: | Wadia, Neha S., Dandi, Yatin, Jordan, Michael I. |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2309.04877 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Computational Advantage of Depth: Learning High-Dimensional Hierarchical Functions with Gradient Descent
by: Dandi, Yatin, et al.
Published: (2025)
by: Dandi, Yatin, et al.
Published: (2025)
A mixing time bound for Gibbs sampling from log-smooth log-concave distributions
by: Wadia, Neha S.
Published: (2024)
by: Wadia, Neha S.
Published: (2024)
Deep Learning of Compositional Targets with Hierarchical Spectral Methods
by: Tabanelli, Hugo, et al.
Published: (2026)
by: Tabanelli, Hugo, et al.
Published: (2026)
Provable Learning of Random Hierarchy Models and Hierarchical Shallow-to-Deep Chaining
by: Ren, Yunwei, et al.
Published: (2026)
by: Ren, Yunwei, et al.
Published: (2026)
The Benefits of Reusing Batches for Gradient Descent in Two-Layer Networks: Breaking the Curse of Information and Leap Exponents
by: Dandi, Yatin, et al.
Published: (2024)
by: Dandi, Yatin, et al.
Published: (2024)
How Two-Layer Neural Networks Learn, One (Giant) Step at a Time
by: Dandi, Yatin, et al.
Published: (2023)
by: Dandi, Yatin, et al.
Published: (2023)
Repetita Iuvant: Data Repetition Allows SGD to Learn High-Dimensional Multi-Index Functions
by: Arnaboldi, Luca, et al.
Published: (2024)
by: Arnaboldi, Luca, et al.
Published: (2024)
Deep Learning as Neural Low-Degree Filtering: A Spectral Theory of Hierarchical Feature Learning
by: Dandi, Yatin, et al.
Published: (2026)
by: Dandi, Yatin, et al.
Published: (2026)
Sampling with flows, diffusion and autoregressive neural networks: A spin-glass perspective
by: Ghio, Davide, et al.
Published: (2023)
by: Ghio, Davide, et al.
Published: (2023)
Rigorous Asymptotics for First-Order Algorithms Through the Dynamical Cavity Method
by: Dandi, Yatin, et al.
Published: (2026)
by: Dandi, Yatin, et al.
Published: (2026)
Online Learning and Information Exponents: On The Importance of Batch size, and Time/Complexity Tradeoffs
by: Arnaboldi, Luca, et al.
Published: (2024)
by: Arnaboldi, Luca, et al.
Published: (2024)
Asymptotics of Non-Convex Generalized Linear Models in High-Dimensions: A proof of the replica formula
by: Vilucchio, Matteo, et al.
Published: (2025)
by: Vilucchio, Matteo, et al.
Published: (2025)
Universality laws for Gaussian mixtures in generalized linear models
by: Dandi, Yatin, et al.
Published: (2023)
by: Dandi, Yatin, et al.
Published: (2023)
A Random Matrix Theory Perspective on the Spectrum of Learned Features and Asymptotic Generalization Capabilities
by: Dandi, Yatin, et al.
Published: (2024)
by: Dandi, Yatin, et al.
Published: (2024)
Scaling Laws from Sequential Feature Recovery: A Solvable Hierarchical Model
by: Wortsman-Zurich, Arie, et al.
Published: (2026)
by: Wortsman-Zurich, Arie, et al.
Published: (2026)
Fundamental limits of learning in sequence multi-index models and deep attention networks: High-dimensional asymptotics and sharp thresholds
by: Troiani, Emanuele, et al.
Published: (2025)
by: Troiani, Emanuele, et al.
Published: (2025)
Optimal Spectral Transitions in High-Dimensional Multi-Index Models
by: Defilippis, Leonardo, et al.
Published: (2025)
by: Defilippis, Leonardo, et al.
Published: (2025)
Perseus: A Simple and Optimal High-Order Method for Variational Inequalities
by: Lin, Tianyi, et al.
Published: (2022)
by: Lin, Tianyi, et al.
Published: (2022)
A Primal-Dual Approach to Solving Variational Inequalities with General Constraints
by: Chavdarova, Tatjana, et al.
Published: (2022)
by: Chavdarova, Tatjana, et al.
Published: (2022)
A Gentle Introduction to Conformal Time Series Forecasting
by: Stocker, M., et al.
Published: (2025)
by: Stocker, M., et al.
Published: (2025)
Fundamental computational limits of weak learnability in high-dimensional multi-index models
by: Troiani, Emanuele, et al.
Published: (2024)
by: Troiani, Emanuele, et al.
Published: (2024)
A Gentle Introduction and Tutorial on Deep Generative Models in Transportation Research
by: Choi, Seongjin, et al.
Published: (2024)
by: Choi, Seongjin, et al.
Published: (2024)
Learning Variational Inequalities from Data: Fast Generalization Rates under Strong Monotonicity
by: Zhao, Eric, et al.
Published: (2024)
by: Zhao, Eric, et al.
Published: (2024)
Asymptotics of feature learning in two-layer networks after one gradient-step
by: Cui, Hugo, et al.
Published: (2024)
by: Cui, Hugo, et al.
Published: (2024)
Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2025)
by: Marconato, Emanuele, et al.
Published: (2025)
Two-Timescale Gradient Descent Ascent Algorithms for Nonconvex Minimax Optimization
by: Lin, Tianyi, et al.
Published: (2024)
by: Lin, Tianyi, et al.
Published: (2024)
Introduction to Machine Learning
by: Younes, Laurent
Published: (2024)
by: Younes, Laurent
Published: (2024)
Delegating Data Collection in Decentralized Machine Learning
by: Ananthakrishnan, Nivasini, et al.
Published: (2023)
by: Ananthakrishnan, Nivasini, et al.
Published: (2023)
Reduced-Rank Multi-objective Policy Learning and Optimization
by: Nwankwo, Ezinne, et al.
Published: (2024)
by: Nwankwo, Ezinne, et al.
Published: (2024)
An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints
by: Lekeufack, Jordan, et al.
Published: (2024)
by: Lekeufack, Jordan, et al.
Published: (2024)
Gentle Local Robustness implies Generalization
by: Than, Khoat, et al.
Published: (2024)
by: Than, Khoat, et al.
Published: (2024)
Gradient Equilibrium in Online Learning: Theory and Applications
by: Angelopoulos, Anastasios N., et al.
Published: (2025)
by: Angelopoulos, Anastasios N., et al.
Published: (2025)
An Integrative Genome-Scale Metabolic Modeling and Machine Learning Framework for Predicting and Optimizing Single-Cell Protein Production in Saccharomyces cerevisiae
by: Nair, Neha K., et al.
Published: (2026)
by: Nair, Neha K., et al.
Published: (2026)
Maximally-stable Local Optima in Random Graphs and Spin Glasses: Phase Transitions and Universality
by: Dandi, Yatin, et al.
Published: (2023)
by: Dandi, Yatin, et al.
Published: (2023)
Minimum Number of Monochromatic Subgraphs of a Random Graph
by: Dandi, Yatin, et al.
Published: (2026)
by: Dandi, Yatin, et al.
Published: (2026)
On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems
by: Lin, Tianyi, et al.
Published: (2019)
by: Lin, Tianyi, et al.
Published: (2019)
A Variational Estimator for $L_p$ Calibration Errors
by: Berta, Eugène, et al.
Published: (2026)
by: Berta, Eugène, et al.
Published: (2026)
Learning Gentle Grasping from Human-Free Force Control Demonstration
by: Li, Mingxuan, et al.
Published: (2024)
by: Li, Mingxuan, et al.
Published: (2024)
Stopping Rules for Stochastic Gradient Descent via Anytime-Valid Confidence Sequences
by: Aolaritei, Liviu, et al.
Published: (2025)
by: Aolaritei, Liviu, et al.
Published: (2025)
Data-Driven Influence Functions for Optimization-Based Causal Inference
by: Jordan, Michael I., et al.
Published: (2022)
by: Jordan, Michael I., et al.
Published: (2022)
Similar Items
-
The Computational Advantage of Depth: Learning High-Dimensional Hierarchical Functions with Gradient Descent
by: Dandi, Yatin, et al.
Published: (2025) -
A mixing time bound for Gibbs sampling from log-smooth log-concave distributions
by: Wadia, Neha S.
Published: (2024) -
Deep Learning of Compositional Targets with Hierarchical Spectral Methods
by: Tabanelli, Hugo, et al.
Published: (2026) -
Provable Learning of Random Hierarchy Models and Hierarchical Shallow-to-Deep Chaining
by: Ren, Yunwei, et al.
Published: (2026) -
The Benefits of Reusing Batches for Gradient Descent in Two-Layer Networks: Breaking the Curse of Information and Leap Exponents
by: Dandi, Yatin, et al.
Published: (2024)