:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Zhongtian, Murfet, Daniel
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2504.18048
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Differentiation and Specialization of Attention Heads via the Refined Local Learning Coefficient
by: Wang, George, et al.
Published: (2024)

Linear Response Estimators for Singular Statistical Models
by: Elliott, Chris, et al.
Published: (2026)

The Local Learning Coefficient: A Singularity-Aware Complexity Measure
by: Lau, Edmund, et al.
Published: (2023)

Susceptibilities and Patterning: A Primer on Linear Response in Bayesian Learning
by: Elliott, Chris, et al.
Published: (2026)

Patterning: The Dual of Interpretability
by: Wang, George, et al.
Published: (2026)

Programs as Singularities
by: Murfet, Daniel, et al.
Published: (2025)

Interpreting Reinforcement Learning Agents with Susceptibilities
by: Elliott, Chris, et al.
Published: (2026)

Embryology of a Language Model
by: Wang, George, et al.
Published: (2025)

Structural Inference: Interpreting Small Language Models with Susceptibilities
by: Baker, Garrett, et al.
Published: (2025)

Stagewise Reinforcement Learning and the Geometry of the Regret Landscape
by: Elliott, Chris, et al.
Published: (2026)

Compressibility Measures Complexity: Minimum Description Length Meets Singular Learning Theory
by: Urdshals, Einar, et al.
Published: (2025)

Dynamics of Transient Structure in In-Context Linear Regression Transformers
by: Carroll, Liam, et al.
Published: (2025)

Causal Spherical Hypergraph Networks for Modelling Social Uncertainty
by: Harit, Anoushka, et al.
Published: (2025)

Towards Spectroscopy: Susceptibility Clusters in Language Models
by: Gordon, Andrew, et al.
Published: (2026)

Design Principles for Sequence Models via Coefficient Dynamics
by: Sieber, Jerome, et al.
Published: (2025)

Actionable Interpretability via Causal Hypergraphs: Unravelling Batch Size Effects in Deep Learning
by: Sun, Zhongtian, et al.
Published: (2025)

RicciFlowRec: A Geometric Root Cause Recommender Using Ricci Curvature on Financial Graphs
by: Sun, Zhongtian, et al.
Published: (2025)

From News to Returns: A Granger-Causal Hypergraph Transformer on the Sphere
by: Harit, Anoushka, et al.
Published: (2025)

Probabilistic Learning and Generation in Deep Sequence Models
by: Chen, Wenlong
Published: (2026)

Loss Landscape Degeneracy and Stagewise Development in Transformers
by: Hoogland, Jesse, et al.
Published: (2024)

Misclassification Rate and Privacy-Utility Trade-offs in Graph Convolutional Networks via Subsampling Stability
by: Zhang, Yexin, et al.
Published: (2026)

ManifoldMind: Dynamic Hyperbolic Reasoning for Trustworthy Recommendations
by: Harit, Anoushka, et al.
Published: (2025)

Matrix Completion with Hypergraphs:Sharp Thresholds and Efficient Algorithms
by: Ma, Zhongtian, et al.
Published: (2024)

Conditional Sequence Modeling for Safe Reinforcement Learning
by: Bai, Wensong, et al.
Published: (2026)

GLANCE: Graph Logic Attention Network with Cluster Enhancement for Heterophilous Graph Representation Learning
by: Sun, Zhongtian, et al.
Published: (2025)

Graph Attention is Not Always Beneficial: A Theoretical Analysis of Graph Attention Mechanisms via Contextual Stochastic Block Models
by: Ma, Zhongtian, et al.
Published: (2024)

You Are What You Eat -- AI Alignment Requires Understanding How Data Shapes Structure and Generalisation
by: Lehalleur, Simon Pepin, et al.
Published: (2025)

Estimation of the Learning Coefficient Using Empirical Loss
by: Takio, Tatsuyoshi, et al.
Published: (2025)

An Improved Model-Free Decision-Estimation Coefficient with Applications in Adversarial MDPs
by: Liu, Haolin, et al.
Published: (2025)

Learning Operators through Coefficient Mappings in Fixed Basis Spaces
by: Chen, Chuqi, et al.
Published: (2025)

Gated Graph Attention Networks with Learnable Temperature
by: Ma, Zhongtian, et al.
Published: (2026)

Varying-Coefficient Mixture of Experts Model
by: Zhao, Qicheng, et al.
Published: (2026)

Poolformer: Recurrent Networks with Pooling for Long-Sequence Modeling
by: Fernández, Daniel Gallo
Published: (2025)

Breaking Down Financial News Impact: A Novel AI Approach with Geometric Hypergraphs
by: Harit, Anoushka, et al.
Published: (2024)

Interpretable Machine Learning for Kronecker Coefficients
by: Butbaia, Giorgi, et al.
Published: (2025)

Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration
by: Wang, Zili, et al.
Published: (2026)

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking
by: Cundy, Chris, et al.
Published: (2023)

Coefficient Shape Transfer Learning for Functional Linear Regression
by: Jiao, Shuhao, et al.
Published: (2025)

Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling
by: Huang, Sili, et al.
Published: (2024)

MetaEformer: Unveiling and Leveraging Meta-patterns for Complex and Dynamic Systems Load Forecasting
by: Huang, Shaoyuan, et al.
Published: (2025)