Saved in:
| Main Authors: | Baek, David D., Liu, Ziming, Tegmark, Max |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.05916 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Understanding Distilled Reasoning Models: A Representational Approach
by: Baek, David D., et al.
Published: (2025)
by: Baek, David D., et al.
Published: (2025)
Harmonic Loss Trains Interpretable AI Models
by: Baek, David D., et al.
Published: (2025)
by: Baek, David D., et al.
Published: (2025)
A Neural Scaling Law from Lottery Ticket Ensembling
by: Liu, Ziming, et al.
Published: (2023)
by: Liu, Ziming, et al.
Published: (2023)
OptPDE: Discovering Novel Integrable Systems via AI-Human Collaboration
by: Kantamneni, Subhash, et al.
Published: (2024)
by: Kantamneni, Subhash, et al.
Published: (2024)
Do Two AI Scientists Agree?
by: Fu, Xinghong, et al.
Published: (2025)
by: Fu, Xinghong, et al.
Published: (2025)
Investigating Representation Universality: Case Study on Genealogical Representations
by: Baek, David D., et al.
Published: (2024)
by: Baek, David D., et al.
Published: (2024)
How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator
by: Kantamneni, Subhash, et al.
Published: (2024)
by: Kantamneni, Subhash, et al.
Published: (2024)
Neural Thermodynamic Laws for Large Language Model Training
by: Liu, Ziming, et al.
Published: (2025)
by: Liu, Ziming, et al.
Published: (2025)
A Resource Model For Neural Scaling Law
by: Song, Jinyeop, et al.
Published: (2024)
by: Song, Jinyeop, et al.
Published: (2024)
The Quantization Model of Neural Scaling
by: Michaud, Eric J., et al.
Published: (2023)
by: Michaud, Eric J., et al.
Published: (2023)
Scaling Laws For Scalable Oversight
by: Engels, Joshua, et al.
Published: (2025)
by: Engels, Joshua, et al.
Published: (2025)
Language Models Represent Space and Time
by: Gurnee, Wes, et al.
Published: (2023)
by: Gurnee, Wes, et al.
Published: (2023)
Language Models Use Trigonometry to Do Addition
by: Kantamneni, Subhash, et al.
Published: (2025)
by: Kantamneni, Subhash, et al.
Published: (2025)
Physics of Skill Learning
by: Liu, Ziming, et al.
Published: (2025)
by: Liu, Ziming, et al.
Published: (2025)
Survival of the Fittest Representation: A Case Study with Modular Addition
by: Ding, Xiaoman Delores, et al.
Published: (2024)
by: Ding, Xiaoman Delores, et al.
Published: (2024)
KAN 2.0: Kolmogorov-Arnold Networks Meet Science
by: Liu, Ziming, et al.
Published: (2024)
by: Liu, Ziming, et al.
Published: (2024)
Low-Rank Adapting Models for Sparse Autoencoders
by: Chen, Matthew, et al.
Published: (2025)
by: Chen, Matthew, et al.
Published: (2025)
The Geometry of Concepts: Sparse Autoencoder Feature Structure
by: Li, Yuxiao, et al.
Published: (2024)
by: Li, Yuxiao, et al.
Published: (2024)
Decomposing The Dark Matter of Sparse Autoencoders
by: Engels, Joshua, et al.
Published: (2024)
by: Engels, Joshua, et al.
Published: (2024)
On the creation of narrow AI: hierarchy and nonlocality of neural network skills
by: Michaud, Eric J., et al.
Published: (2025)
by: Michaud, Eric J., et al.
Published: (2025)
Opening the AI black box: program synthesis via mechanistic interpretability
by: Michaud, Eric J., et al.
Published: (2024)
by: Michaud, Eric J., et al.
Published: (2024)
Not All Language Model Features Are One-Dimensionally Linear
by: Engels, Joshua, et al.
Published: (2024)
by: Engels, Joshua, et al.
Published: (2024)
KAN: Kolmogorov-Arnold Networks
by: Liu, Ziming, et al.
Published: (2024)
by: Liu, Ziming, et al.
Published: (2024)
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
by: Kantamneni, Subhash, et al.
Published: (2025)
by: Kantamneni, Subhash, et al.
Published: (2025)
The Remarkable Robustness of LLMs: Stages of Inference?
by: Lad, Vedang, et al.
Published: (2024)
by: Lad, Vedang, et al.
Published: (2024)
H-EFT-VA: An Effective-Field-Theory Variational Ansatz with Provable Barren Plateau Avoidance
by: Hamid, Eyad I. B
Published: (2026)
by: Hamid, Eyad I. B
Published: (2026)
Efficient Dictionary Learning with Switch Sparse Autoencoders
by: Mudide, Anish, et al.
Published: (2024)
by: Mudide, Anish, et al.
Published: (2024)
Understanding Generative AI Content with Embedding Models
by: Vargas, Max, et al.
Published: (2024)
by: Vargas, Max, et al.
Published: (2024)
Understanding Generalization in Role-Playing Models via Information Theory
by: Li, Yongqi, et al.
Published: (2025)
by: Li, Yongqi, et al.
Published: (2025)
Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models
by: Ren, Zirui, et al.
Published: (2026)
by: Ren, Zirui, et al.
Published: (2026)
Numerical Fragility in Transformers: A Layer-wise Theory for Explaining, Forecasting, and Mitigating Instability
by: Baek, Jinwoo
Published: (2025)
by: Baek, Jinwoo
Published: (2025)
Ctx2TrajGen: Traffic Context-Aware Microscale Vehicle Trajectories using Generative Adversarial Imitation Learning
by: Jin, Joobin, et al.
Published: (2025)
by: Jin, Joobin, et al.
Published: (2025)
TopoSizing: An LLM-aided Framework of Topology-based Understanding and Sizing for AMS Circuits
by: Wei, Ziming, et al.
Published: (2025)
by: Wei, Ziming, et al.
Published: (2025)
Optimal Control of Probabilistic Dynamics Models via Mean Hamiltonian Minimization
by: Leeftink, David, et al.
Published: (2025)
by: Leeftink, David, et al.
Published: (2025)
Dense SAE Latents Are Features, Not Bugs
by: Sun, Xiaoqing, et al.
Published: (2025)
by: Sun, Xiaoqing, et al.
Published: (2025)
KoopGen: Koopman Generator Networks for Representing and Predicting Dynamical Systems with Continuous Spectra
by: Su, Liangyu, et al.
Published: (2026)
by: Su, Liangyu, et al.
Published: (2026)
Static and Dynamic Approaches to Computing Barycenters of Probability Measures on Graphs
by: Gentile, David, et al.
Published: (2026)
by: Gentile, David, et al.
Published: (2026)
GenTL: A General Transfer Learning Model for Building Thermal Dynamics
by: Raisch, Fabian, et al.
Published: (2025)
by: Raisch, Fabian, et al.
Published: (2025)
SynCoGen: Synthesizable 3D Molecule Generation via Joint Reaction and Coordinate Modeling
by: Rekesh, Andrei, et al.
Published: (2025)
by: Rekesh, Andrei, et al.
Published: (2025)
Stock Prediction via a Dual Relation Fusion Network incorporating Static and Dynamic Relations
by: Chen, Long, et al.
Published: (2025)
by: Chen, Long, et al.
Published: (2025)
Similar Items
-
Towards Understanding Distilled Reasoning Models: A Representational Approach
by: Baek, David D., et al.
Published: (2025) -
Harmonic Loss Trains Interpretable AI Models
by: Baek, David D., et al.
Published: (2025) -
A Neural Scaling Law from Lottery Ticket Ensembling
by: Liu, Ziming, et al.
Published: (2023) -
OptPDE: Discovering Novel Integrable Systems via AI-Human Collaboration
by: Kantamneni, Subhash, et al.
Published: (2024) -
Do Two AI Scientists Agree?
by: Fu, Xinghong, et al.
Published: (2025)