Saved in:
| Main Authors: | Ranabhat, Nishan, Javanparast, Behnam, Goerz, David, Inack, Estelle |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.07159 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Variational Neural Annealing
by: Hibat-Allah, Mohamed, et al.
Published: (2021)
by: Hibat-Allah, Mohamed, et al.
Published: (2021)
Sharp feature-learning transitions and Bayes-optimal neural scaling laws in extensive-width networks
by: Nguyen, Minh-Toan, et al.
Published: (2026)
by: Nguyen, Minh-Toan, et al.
Published: (2026)
Coding schemes in neural networks learning classification tasks
by: van Meegen, Alexander, et al.
Published: (2024)
by: van Meegen, Alexander, et al.
Published: (2024)
Spring-block theory of feature learning in deep neural networks
by: Shi, Cheng, et al.
Published: (2024)
by: Shi, Cheng, et al.
Published: (2024)
On the role of non-linear latent features in bipartite generative neural networks
by: Bonnaire, Tony, et al.
Published: (2025)
by: Bonnaire, Tony, et al.
Published: (2025)
The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems
by: Biazzo, Indaco
Published: (2023)
by: Biazzo, Indaco
Published: (2023)
Sampling with flows, diffusion and autoregressive neural networks: A spin-glass perspective
by: Ghio, Davide, et al.
Published: (2023)
by: Ghio, Davide, et al.
Published: (2023)
Dynamic neuron approach to deep neural networks: Decoupling neurons for renormalization group analysis
by: Lee, Donghee, et al.
Published: (2024)
by: Lee, Donghee, et al.
Published: (2024)
Weight fluctuations in (deep) linear neural networks and a derivation of the inverse-variance flatness relation
by: Gross, Markus, et al.
Published: (2023)
by: Gross, Markus, et al.
Published: (2023)
A theoretical perspective on mode collapse in variational inference
by: Soletskyi, Roman, et al.
Published: (2024)
by: Soletskyi, Roman, et al.
Published: (2024)
Stochastic Gradient Descent-like relaxation is equivalent to Metropolis dynamics in discrete optimization and inference problems
by: Angelini, Maria Chiara, et al.
Published: (2023)
by: Angelini, Maria Chiara, et al.
Published: (2023)
Exact finite-size scaling of the maximum likelihood spectra in the quenched and annealed Sherrington-Kirkpatrick spin glass
by: Wang, Ding, et al.
Published: (2024)
by: Wang, Ding, et al.
Published: (2024)
Rare Event Analysis of Large Language Models
by: Dorman, Jake McAllister, et al.
Published: (2026)
by: Dorman, Jake McAllister, et al.
Published: (2026)
The geometry and dynamics of annealed optimization in the coherent Ising machine with hidden and planted solutions
by: Ghimenti, Federico, et al.
Published: (2025)
by: Ghimenti, Federico, et al.
Published: (2025)
A Fourier perspective on the learning dynamics of neural networks: from sample complexities to mechanistic insights
by: Ricci, Fabiola, et al.
Published: (2026)
by: Ricci, Fabiola, et al.
Published: (2026)
Generalization vs. Specialization under Concept Shift
by: Nguyen, Alex, et al.
Published: (2024)
by: Nguyen, Alex, et al.
Published: (2024)
Geometry and universal scaling of Pareto-optimal signal compression
by: Berx, Jonas
Published: (2025)
by: Berx, Jonas
Published: (2025)
Sampling diverse near-optimal solutions via algorithmic quantum annealing
by: Mohseni, Masoud, et al.
Published: (2021)
by: Mohseni, Masoud, et al.
Published: (2021)
Exact solution of Dynamical Mean-Field Theory for a linear system with annealed disorder
by: Ferraro, Francesco, et al.
Published: (2024)
by: Ferraro, Francesco, et al.
Published: (2024)
Statistical mechanics of extensive-width Bayesian neural networks near interpolation
by: Barbier, Jean, et al.
Published: (2025)
by: Barbier, Jean, et al.
Published: (2025)
A statistical physics framework for optimal learning
by: Mignacco, Francesca, et al.
Published: (2025)
by: Mignacco, Francesca, et al.
Published: (2025)
Drift-Diffusion Matching: Embedding dynamics in latent manifolds of asymmetric neural networks
by: Nartallo-Kaluarachchi, Ramón, et al.
Published: (2026)
by: Nartallo-Kaluarachchi, Ramón, et al.
Published: (2026)
Analytic theory of dropout regularization
by: Mori, Francesco, et al.
Published: (2025)
by: Mori, Francesco, et al.
Published: (2025)
The Copycat Perceptron: Smashing Barriers Through Collective Learning
by: Catania, Giovanni, et al.
Published: (2023)
by: Catania, Giovanni, et al.
Published: (2023)
Fundamental operating regimes, hyper-parameter fine-tuning and glassiness: towards an interpretable replica-theory for trained restricted Boltzmann machines
by: Fachechi, Alberto, et al.
Published: (2024)
by: Fachechi, Alberto, et al.
Published: (2024)
Distinct mechanisms underlying in-context learning in transformers
by: Gibson, Cole, et al.
Published: (2026)
by: Gibson, Cole, et al.
Published: (2026)
Interpreting the Synchronization Gap: The Hidden Mechanism Inside Diffusion Transformers
by: Albrychiewicz, Emil, et al.
Published: (2026)
by: Albrychiewicz, Emil, et al.
Published: (2026)
Optimal Protocols for Continual Learning via Statistical Physics and Control Theory
by: Mori, Francesco, et al.
Published: (2024)
by: Mori, Francesco, et al.
Published: (2024)
Universal Scaling Laws of Absorbing Phase Transitions in Artificial Deep Neural Networks
by: Tamai, Keiichi, et al.
Published: (2023)
by: Tamai, Keiichi, et al.
Published: (2023)
Two failure modes of deep transformers and how to avoid them: a unified theory of signal propagation at initialisation
by: Giorlandino, Alessio, et al.
Published: (2025)
by: Giorlandino, Alessio, et al.
Published: (2025)
Inferring Higher-Order Couplings with Neural Networks
by: Decelle, Aurélien, et al.
Published: (2025)
by: Decelle, Aurélien, et al.
Published: (2025)
Explaining the effects of non-convergent sampling in the training of Energy-Based Models
by: Agoritsas, Elisabeth, et al.
Published: (2023)
by: Agoritsas, Elisabeth, et al.
Published: (2023)
Discrete generative diffusion models without stochastic differential equations: a tensor network approach
by: Causer, Luke, et al.
Published: (2024)
by: Causer, Luke, et al.
Published: (2024)
Factual recall in linear associative memories: sharp asymptotics and mechanistic insights
by: Giorlandino, Alessio, et al.
Published: (2026)
by: Giorlandino, Alessio, et al.
Published: (2026)
The Physics of Data and Tasks: Theories of Locality and Compositionality in Deep Learning
by: Favero, Alessandro
Published: (2025)
by: Favero, Alessandro
Published: (2025)
Escape dynamics and implicit bias of one-pass SGD in overparameterized quadratic networks
by: Bocchi, Dario, et al.
Published: (2026)
by: Bocchi, Dario, et al.
Published: (2026)
Inferring effective couplings with Restricted Boltzmann Machines
by: Decelle, Aurélien, et al.
Published: (2023)
by: Decelle, Aurélien, et al.
Published: (2023)
From Classical to Quantum: Extending Prometheus for Unsupervised Discovery of Phase Transitions in Three Dimensions and Quantum Systems
by: Yee, Brandon, et al.
Published: (2026)
by: Yee, Brandon, et al.
Published: (2026)
Transfer Learning in $\ell_1$ Regularized Regression: Hyperparameter Selection Strategy based on Sharp Asymptotic Analysis
by: Okajima, Koki, et al.
Published: (2024)
by: Okajima, Koki, et al.
Published: (2024)
A theoretical framework for overfitting in energy-based modeling
by: Catania, Giovanni, et al.
Published: (2025)
by: Catania, Giovanni, et al.
Published: (2025)
Similar Items
-
Variational Neural Annealing
by: Hibat-Allah, Mohamed, et al.
Published: (2021) -
Sharp feature-learning transitions and Bayes-optimal neural scaling laws in extensive-width networks
by: Nguyen, Minh-Toan, et al.
Published: (2026) -
Coding schemes in neural networks learning classification tasks
by: van Meegen, Alexander, et al.
Published: (2024) -
Spring-block theory of feature learning in deep neural networks
by: Shi, Cheng, et al.
Published: (2024) -
On the role of non-linear latent features in bipartite generative neural networks
by: Bonnaire, Tony, et al.
Published: (2025)