Saved in:
| Main Authors: | Yang, Xiguang, Arora, Krish, Bachmann, Michael |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.08341 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Thermodynamics-inspired Explanations of Artificial Intelligence
by: Mehdi, Shams, et al.
Published: (2022)
by: Mehdi, Shams, et al.
Published: (2022)
Universal Scaling Laws of Absorbing Phase Transitions in Artificial Deep Neural Networks
by: Tamai, Keiichi, et al.
Published: (2023)
by: Tamai, Keiichi, et al.
Published: (2023)
Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers
by: Tiberi, Lorenzo, et al.
Published: (2024)
by: Tiberi, Lorenzo, et al.
Published: (2024)
Inference in Spreading Processes with Neural-Network Priors
by: Ghio, Davide, et al.
Published: (2025)
by: Ghio, Davide, et al.
Published: (2025)
How do Probabilistic Graphical Models and Graph Neural Networks Look at Network Data?
by: Lapenna, Michela, et al.
Published: (2025)
by: Lapenna, Michela, et al.
Published: (2025)
Inferring Higher-Order Couplings with Neural Networks
by: Decelle, Aurélien, et al.
Published: (2025)
by: Decelle, Aurélien, et al.
Published: (2025)
Statistical Mechanics and Artificial Neural Networks: Principles, Models, and Applications
by: Böttcher, Lucas, et al.
Published: (2024)
by: Böttcher, Lucas, et al.
Published: (2024)
An Analytical Characterization of Sloppiness in Neural Networks: Insights from Linear Models
by: Mao, Jialin, et al.
Published: (2025)
by: Mao, Jialin, et al.
Published: (2025)
Gaussian Universality in Neural Network Dynamics with Generalized Structured Input Distributions
by: Bae, Jaeyong, et al.
Published: (2024)
by: Bae, Jaeyong, et al.
Published: (2024)
The committee machine: Computational to statistical gaps in learning a two-layers neural network
by: Aubin, Benjamin, et al.
Published: (2018)
by: Aubin, Benjamin, et al.
Published: (2018)
Universal Approximation of Mean-Field Models via Transformers
by: Biswal, Shiba, et al.
Published: (2024)
by: Biswal, Shiba, et al.
Published: (2024)
Variational Neural Annealing
by: Hibat-Allah, Mohamed, et al.
Published: (2021)
by: Hibat-Allah, Mohamed, et al.
Published: (2021)
Nearest-Neighbours Neural Network architecture for efficient sampling of statistical physics models
by: Del Bono, Luca Maria, et al.
Published: (2024)
by: Del Bono, Luca Maria, et al.
Published: (2024)
Message Passing Variational Autoregressive Network for Solving Intractable Ising Models
by: Ma, Qunlong, et al.
Published: (2024)
by: Ma, Qunlong, et al.
Published: (2024)
Neural Network Matrix Product Operator: A Multi-Dimensionally Integrable Machine Learning Potential
by: Hino, Kentaro, et al.
Published: (2024)
by: Hino, Kentaro, et al.
Published: (2024)
Dataset-learning duality and emergent criticality
by: Kukleva, Ekaterina, et al.
Published: (2024)
by: Kukleva, Ekaterina, et al.
Published: (2024)
Thermodynamics of bidirectional associative memories
by: Barra, Adriano, et al.
Published: (2022)
by: Barra, Adriano, et al.
Published: (2022)
Amorphous Solid Model of Vectorial Hopfield Neural Networks
by: Gallavotti, F., et al.
Published: (2025)
by: Gallavotti, F., et al.
Published: (2025)
Spectral Architecture Search for Neural Network Models
by: Peri, Gianluca, et al.
Published: (2025)
by: Peri, Gianluca, et al.
Published: (2025)
From latent dynamics to meaningful representations
by: Wang, Dedi, et al.
Published: (2022)
by: Wang, Dedi, et al.
Published: (2022)
Quantum Next Generation Reservoir Computing: An Efficient Quantum Algorithm for Forecasting Quantum Dynamics
by: Sornsaeng, Apimuk, et al.
Published: (2023)
by: Sornsaeng, Apimuk, et al.
Published: (2023)
Entropic Confinement and Mode Connectivity in Overparameterized Neural Networks
by: Di Carlo, Luca, et al.
Published: (2025)
by: Di Carlo, Luca, et al.
Published: (2025)
Generative artificial intelligence for computational chemistry: a roadmap to predicting emergent phenomena
by: Tiwary, Pratyush, et al.
Published: (2024)
by: Tiwary, Pratyush, et al.
Published: (2024)
A Generative Neural Annealer for Black-Box Combinatorial Optimization
by: Zhang, Yuan-Hang, et al.
Published: (2025)
by: Zhang, Yuan-Hang, et al.
Published: (2025)
Sparse Autoregressive Neural Networks for Classical Spin Systems
by: Biazzo, Indaco, et al.
Published: (2024)
by: Biazzo, Indaco, et al.
Published: (2024)
The Symmetric Perceptron: a Teacher-Student Scenario
by: Catania, Giovanni, et al.
Published: (2026)
by: Catania, Giovanni, et al.
Published: (2026)
Neural population geometry and optimal coding of tasks with shared latent structure
by: Wakhloo, Albert J., et al.
Published: (2024)
by: Wakhloo, Albert J., et al.
Published: (2024)
Overparametrization bends the landscape: BBP transitions at initialization in simple Neural Networks
by: Annesi, Brandon Livio, et al.
Published: (2025)
by: Annesi, Brandon Livio, et al.
Published: (2025)
Discrete generative diffusion models without stochastic differential equations: a tensor network approach
by: Causer, Luke, et al.
Published: (2024)
by: Causer, Luke, et al.
Published: (2024)
Weight fluctuations in (deep) linear neural networks and a derivation of the inverse-variance flatness relation
by: Gross, Markus, et al.
Published: (2023)
by: Gross, Markus, et al.
Published: (2023)
Estimating Global Input Relevance and Enforcing Sparse Representations with a Scalable Spectral Neural Network Approach
by: Chicchi, Lorenzo, et al.
Published: (2024)
by: Chicchi, Lorenzo, et al.
Published: (2024)
Two failure modes of deep transformers and how to avoid them: a unified theory of signal propagation at initialisation
by: Giorlandino, Alessio, et al.
Published: (2025)
by: Giorlandino, Alessio, et al.
Published: (2025)
Mapping of attention mechanisms to a generalized Potts model
by: Rende, Riccardo, et al.
Published: (2023)
by: Rende, Riccardo, et al.
Published: (2023)
Reinforced Disentanglers on Random Unitary Circuits
by: Bao, Ning, et al.
Published: (2024)
by: Bao, Ning, et al.
Published: (2024)
Maximum Likelihood Decoding of Quantum Error Correction Codes
by: Cao, Hanyan, et al.
Published: (2026)
by: Cao, Hanyan, et al.
Published: (2026)
Dissipation alters modes of information encoding in small quantum reservoirs near criticality
by: Cheamsawat, Krai, et al.
Published: (2024)
by: Cheamsawat, Krai, et al.
Published: (2024)
Analytic theory of dropout regularization
by: Mori, Francesco, et al.
Published: (2025)
by: Mori, Francesco, et al.
Published: (2025)
The Physics of Data and Tasks: Theories of Locality and Compositionality in Deep Learning
by: Favero, Alessandro
Published: (2025)
by: Favero, Alessandro
Published: (2025)
A theoretical framework for overfitting in energy-based modeling
by: Catania, Giovanni, et al.
Published: (2025)
by: Catania, Giovanni, et al.
Published: (2025)
Uncertainty in AI-driven Monte Carlo simulations
by: Tzivrailis, Dimitrios, et al.
Published: (2025)
by: Tzivrailis, Dimitrios, et al.
Published: (2025)
Similar Items
-
Thermodynamics-inspired Explanations of Artificial Intelligence
by: Mehdi, Shams, et al.
Published: (2022) -
Universal Scaling Laws of Absorbing Phase Transitions in Artificial Deep Neural Networks
by: Tamai, Keiichi, et al.
Published: (2023) -
Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers
by: Tiberi, Lorenzo, et al.
Published: (2024) -
Inference in Spreading Processes with Neural-Network Priors
by: Ghio, Davide, et al.
Published: (2025) -
How do Probabilistic Graphical Models and Graph Neural Networks Look at Network Data?
by: Lapenna, Michela, et al.
Published: (2025)