:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Baek, David D., Liu, Ziming, Tegmark, Max
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.05916
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Towards Understanding Distilled Reasoning Models: A Representational Approach
by: Baek, David D., et al.
Published: (2025)

Harmonic Loss Trains Interpretable AI Models
by: Baek, David D., et al.
Published: (2025)

A Neural Scaling Law from Lottery Ticket Ensembling
by: Liu, Ziming, et al.
Published: (2023)

OptPDE: Discovering Novel Integrable Systems via AI-Human Collaboration
by: Kantamneni, Subhash, et al.
Published: (2024)

Do Two AI Scientists Agree?
by: Fu, Xinghong, et al.
Published: (2025)

Investigating Representation Universality: Case Study on Genealogical Representations
by: Baek, David D., et al.
Published: (2024)

How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator
by: Kantamneni, Subhash, et al.
Published: (2024)

Neural Thermodynamic Laws for Large Language Model Training
by: Liu, Ziming, et al.
Published: (2025)

A Resource Model For Neural Scaling Law
by: Song, Jinyeop, et al.
Published: (2024)

The Quantization Model of Neural Scaling
by: Michaud, Eric J., et al.
Published: (2023)

Scaling Laws For Scalable Oversight
by: Engels, Joshua, et al.
Published: (2025)

Language Models Represent Space and Time
by: Gurnee, Wes, et al.
Published: (2023)

Language Models Use Trigonometry to Do Addition
by: Kantamneni, Subhash, et al.
Published: (2025)

Physics of Skill Learning
by: Liu, Ziming, et al.
Published: (2025)

Survival of the Fittest Representation: A Case Study with Modular Addition
by: Ding, Xiaoman Delores, et al.
Published: (2024)

KAN 2.0: Kolmogorov-Arnold Networks Meet Science
by: Liu, Ziming, et al.
Published: (2024)

Low-Rank Adapting Models for Sparse Autoencoders
by: Chen, Matthew, et al.
Published: (2025)

The Geometry of Concepts: Sparse Autoencoder Feature Structure
by: Li, Yuxiao, et al.
Published: (2024)

Decomposing The Dark Matter of Sparse Autoencoders
by: Engels, Joshua, et al.
Published: (2024)

On the creation of narrow AI: hierarchy and nonlocality of neural network skills
by: Michaud, Eric J., et al.
Published: (2025)

Opening the AI black box: program synthesis via mechanistic interpretability
by: Michaud, Eric J., et al.
Published: (2024)

Not All Language Model Features Are One-Dimensionally Linear
by: Engels, Joshua, et al.
Published: (2024)

KAN: Kolmogorov-Arnold Networks
by: Liu, Ziming, et al.
Published: (2024)

Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
by: Kantamneni, Subhash, et al.
Published: (2025)

The Remarkable Robustness of LLMs: Stages of Inference?
by: Lad, Vedang, et al.
Published: (2024)

H-EFT-VA: An Effective-Field-Theory Variational Ansatz with Provable Barren Plateau Avoidance
by: Hamid, Eyad I. B
Published: (2026)

Efficient Dictionary Learning with Switch Sparse Autoencoders
by: Mudide, Anish, et al.
Published: (2024)

Understanding Generative AI Content with Embedding Models
by: Vargas, Max, et al.
Published: (2024)

Understanding Generalization in Role-Playing Models via Information Theory
by: Li, Yongqi, et al.
Published: (2025)

Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models
by: Ren, Zirui, et al.
Published: (2026)

Numerical Fragility in Transformers: A Layer-wise Theory for Explaining, Forecasting, and Mitigating Instability
by: Baek, Jinwoo
Published: (2025)

Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning
by: Jin, Joobin, et al.
Published: (2025)

TopoSizing: An LLM-aided Framework of Topology-based Understanding and Sizing for AMS Circuits
by: Wei, Ziming, et al.
Published: (2025)

Optimal Control of Probabilistic Dynamics Models via Mean Hamiltonian Minimization
by: Leeftink, David, et al.
Published: (2025)

Dense SAE Latents Are Features, Not Bugs
by: Sun, Xiaoqing, et al.
Published: (2025)

KoopGen: Koopman Generator Networks for Representing and Predicting Dynamical Systems with Continuous Spectra
by: Su, Liangyu, et al.
Published: (2026)

Static and Dynamic Approaches to Computing Barycenters of Probability Measures on Graphs
by: Gentile, David, et al.
Published: (2026)

GenTL: A General Transfer Learning Model for Building Thermal Dynamics
by: Raisch, Fabian, et al.
Published: (2025)

SynCoGen: Synthesizable 3D Molecule Generation via Joint Reaction and Coordinate Modeling
by: Rekesh, Andrei, et al.
Published: (2025)

Stock Prediction via a Dual Relation Fusion Network incorporating Static and Dynamic Relations
by: Chen, Long, et al.
Published: (2025)