Saved in:
| Main Authors: | Chen, Zhongtian, Murfet, Daniel |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.18048 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Differentiation and Specialization of Attention Heads via the Refined Local Learning Coefficient
by: Wang, George, et al.
Published: (2024)
by: Wang, George, et al.
Published: (2024)
Linear Response Estimators for Singular Statistical Models
by: Elliott, Chris, et al.
Published: (2026)
by: Elliott, Chris, et al.
Published: (2026)
The Local Learning Coefficient: A Singularity-Aware Complexity Measure
by: Lau, Edmund, et al.
Published: (2023)
by: Lau, Edmund, et al.
Published: (2023)
Susceptibilities and Patterning: A Primer on Linear Response in Bayesian Learning
by: Elliott, Chris, et al.
Published: (2026)
by: Elliott, Chris, et al.
Published: (2026)
Patterning: The Dual of Interpretability
by: Wang, George, et al.
Published: (2026)
by: Wang, George, et al.
Published: (2026)
Programs as Singularities
by: Murfet, Daniel, et al.
Published: (2025)
by: Murfet, Daniel, et al.
Published: (2025)
Interpreting Reinforcement Learning Agents with Susceptibilities
by: Elliott, Chris, et al.
Published: (2026)
by: Elliott, Chris, et al.
Published: (2026)
Embryology of a Language Model
by: Wang, George, et al.
Published: (2025)
by: Wang, George, et al.
Published: (2025)
Structural Inference: Interpreting Small Language Models with Susceptibilities
by: Baker, Garrett, et al.
Published: (2025)
by: Baker, Garrett, et al.
Published: (2025)
Stagewise Reinforcement Learning and the Geometry of the Regret Landscape
by: Elliott, Chris, et al.
Published: (2026)
by: Elliott, Chris, et al.
Published: (2026)
Compressibility Measures Complexity: Minimum Description Length Meets Singular Learning Theory
by: Urdshals, Einar, et al.
Published: (2025)
by: Urdshals, Einar, et al.
Published: (2025)
Dynamics of Transient Structure in In-Context Linear Regression Transformers
by: Carroll, Liam, et al.
Published: (2025)
by: Carroll, Liam, et al.
Published: (2025)
Causal Spherical Hypergraph Networks for Modelling Social Uncertainty
by: Harit, Anoushka, et al.
Published: (2025)
by: Harit, Anoushka, et al.
Published: (2025)
Towards Spectroscopy: Susceptibility Clusters in Language Models
by: Gordon, Andrew, et al.
Published: (2026)
by: Gordon, Andrew, et al.
Published: (2026)
Design Principles for Sequence Models via Coefficient Dynamics
by: Sieber, Jerome, et al.
Published: (2025)
by: Sieber, Jerome, et al.
Published: (2025)
Actionable Interpretability via Causal Hypergraphs: Unravelling Batch Size Effects in Deep Learning
by: Sun, Zhongtian, et al.
Published: (2025)
by: Sun, Zhongtian, et al.
Published: (2025)
RicciFlowRec: A Geometric Root Cause Recommender Using Ricci Curvature on Financial Graphs
by: Sun, Zhongtian, et al.
Published: (2025)
by: Sun, Zhongtian, et al.
Published: (2025)
From News to Returns: A Granger-Causal Hypergraph Transformer on the Sphere
by: Harit, Anoushka, et al.
Published: (2025)
by: Harit, Anoushka, et al.
Published: (2025)
Probabilistic Learning and Generation in Deep Sequence Models
by: Chen, Wenlong
Published: (2026)
by: Chen, Wenlong
Published: (2026)
Loss Landscape Degeneracy and Stagewise Development in Transformers
by: Hoogland, Jesse, et al.
Published: (2024)
by: Hoogland, Jesse, et al.
Published: (2024)
Misclassification Rate and Privacy-Utility Trade-offs in Graph Convolutional Networks via Subsampling Stability
by: Zhang, Yexin, et al.
Published: (2026)
by: Zhang, Yexin, et al.
Published: (2026)
ManifoldMind: Dynamic Hyperbolic Reasoning for Trustworthy Recommendations
by: Harit, Anoushka, et al.
Published: (2025)
by: Harit, Anoushka, et al.
Published: (2025)
Matrix Completion with Hypergraphs:Sharp Thresholds and Efficient Algorithms
by: Ma, Zhongtian, et al.
Published: (2024)
by: Ma, Zhongtian, et al.
Published: (2024)
Conditional Sequence Modeling for Safe Reinforcement Learning
by: Bai, Wensong, et al.
Published: (2026)
by: Bai, Wensong, et al.
Published: (2026)
GLANCE: Graph Logic Attention Network with Cluster Enhancement for Heterophilous Graph Representation Learning
by: Sun, Zhongtian, et al.
Published: (2025)
by: Sun, Zhongtian, et al.
Published: (2025)
Graph Attention is Not Always Beneficial: A Theoretical Analysis of Graph Attention Mechanisms via Contextual Stochastic Block Models
by: Ma, Zhongtian, et al.
Published: (2024)
by: Ma, Zhongtian, et al.
Published: (2024)
You Are What You Eat -- AI Alignment Requires Understanding How Data Shapes Structure and Generalisation
by: Lehalleur, Simon Pepin, et al.
Published: (2025)
by: Lehalleur, Simon Pepin, et al.
Published: (2025)
Estimation of the Learning Coefficient Using Empirical Loss
by: Takio, Tatsuyoshi, et al.
Published: (2025)
by: Takio, Tatsuyoshi, et al.
Published: (2025)
An Improved Model-Free Decision-Estimation Coefficient with Applications in Adversarial MDPs
by: Liu, Haolin, et al.
Published: (2025)
by: Liu, Haolin, et al.
Published: (2025)
Learning Operators through Coefficient Mappings in Fixed Basis Spaces
by: Chen, Chuqi, et al.
Published: (2025)
by: Chen, Chuqi, et al.
Published: (2025)
Gated Graph Attention Networks with Learnable Temperature
by: Ma, Zhongtian, et al.
Published: (2026)
by: Ma, Zhongtian, et al.
Published: (2026)
Varying-Coefficient Mixture of Experts Model
by: Zhao, Qicheng, et al.
Published: (2026)
by: Zhao, Qicheng, et al.
Published: (2026)
Poolformer: Recurrent Networks with Pooling for Long-Sequence Modeling
by: Fernández, Daniel Gallo
Published: (2025)
by: Fernández, Daniel Gallo
Published: (2025)
Breaking Down Financial News Impact: A Novel AI Approach with Geometric Hypergraphs
by: Harit, Anoushka, et al.
Published: (2024)
by: Harit, Anoushka, et al.
Published: (2024)
Interpretable Machine Learning for Kronecker Coefficients
by: Butbaia, Giorgi, et al.
Published: (2025)
by: Butbaia, Giorgi, et al.
Published: (2025)
Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration
by: Wang, Zili, et al.
Published: (2026)
by: Wang, Zili, et al.
Published: (2026)
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking
by: Cundy, Chris, et al.
Published: (2023)
by: Cundy, Chris, et al.
Published: (2023)
Coefficient Shape Transfer Learning for Functional Linear Regression
by: Jiao, Shuhao, et al.
Published: (2025)
by: Jiao, Shuhao, et al.
Published: (2025)
Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling
by: Huang, Sili, et al.
Published: (2024)
by: Huang, Sili, et al.
Published: (2024)
MetaEformer: Unveiling and Leveraging Meta-patterns for Complex and Dynamic Systems Load Forecasting
by: Huang, Shaoyuan, et al.
Published: (2025)
by: Huang, Shaoyuan, et al.
Published: (2025)
Similar Items
-
Differentiation and Specialization of Attention Heads via the Refined Local Learning Coefficient
by: Wang, George, et al.
Published: (2024) -
Linear Response Estimators for Singular Statistical Models
by: Elliott, Chris, et al.
Published: (2026) -
The Local Learning Coefficient: A Singularity-Aware Complexity Measure
by: Lau, Edmund, et al.
Published: (2023) -
Susceptibilities and Patterning: A Primer on Linear Response in Bayesian Learning
by: Elliott, Chris, et al.
Published: (2026) -
Patterning: The Dual of Interpretability
by: Wang, George, et al.
Published: (2026)