Saved in:
| Main Authors: | Gurnee, Wes, Ameisen, Emmanuel, Kauvar, Isaac, Tarng, Julius, Pearce, Adam, Olah, Chris, Batson, Joshua |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.04480 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Emotion Concepts and their Function in a Large Language Model
by: Sofroniew, Nicholas, et al.
Published: (2026)
by: Sofroniew, Nicholas, et al.
Published: (2026)
Language Models Represent Space and Time
by: Gurnee, Wes, et al.
Published: (2023)
by: Gurnee, Wes, et al.
Published: (2023)
Not All Language Model Features Are One-Dimensionally Linear
by: Engels, Joshua, et al.
Published: (2024)
by: Engels, Joshua, et al.
Published: (2024)
Policy-shaped prediction: avoiding distractions in model-based reinforcement learning
by: Hutson, Miles, et al.
Published: (2024)
by: Hutson, Miles, et al.
Published: (2024)
The Remarkable Robustness of LLMs: Stages of Inference?
by: Lad, Vedang, et al.
Published: (2024)
by: Lad, Vedang, et al.
Published: (2024)
Refusal in Language Models Is Mediated by a Single Direction
by: Arditi, Andy, et al.
Published: (2024)
by: Arditi, Andy, et al.
Published: (2024)
Confidence Regulation Neurons in Language Models
by: Stolfo, Alessandro, et al.
Published: (2024)
by: Stolfo, Alessandro, et al.
Published: (2024)
Universal Neurons in GPT2 Language Models
by: Gurnee, Wes, et al.
Published: (2024)
by: Gurnee, Wes, et al.
Published: (2024)
Mechanisms of Introspective Awareness
by: Macar, Uzay, et al.
Published: (2026)
by: Macar, Uzay, et al.
Published: (2026)
When Scores Learn Geometry: Rate Separations under the Manifold Hypothesis
by: Li, Xiang, et al.
Published: (2025)
by: Li, Xiang, et al.
Published: (2025)
The Geometry of Self-Verification in a Task-Specific Reasoning Model
by: Lee, Andrew, et al.
Published: (2025)
by: Lee, Andrew, et al.
Published: (2025)
Guided Manifold Alignment with Geometry-Regularized Twin Autoencoders
by: Rhodes, Jake S., et al.
Published: (2025)
by: Rhodes, Jake S., et al.
Published: (2025)
GeoERM: Geometry-Aware Multi-Task Representation Learning on Riemannian Manifolds
by: Chen, Aoran, et al.
Published: (2025)
by: Chen, Aoran, et al.
Published: (2025)
Auditing language models for hidden objectives
by: Marks, Samuel, et al.
Published: (2025)
by: Marks, Samuel, et al.
Published: (2025)
Exploring the Manifold of Neural Networks Using Diffusion Geometry
by: Abel, Elliott, et al.
Published: (2024)
by: Abel, Elliott, et al.
Published: (2024)
Combatting Gerrymandering with Ranked Choice Voting: An Experimental Analysis of Multi-member Districts in the United States
by: Garg, Nikhil, et al.
Published: (2021)
by: Garg, Nikhil, et al.
Published: (2021)
Staying on the Manifold: Geometry-Aware Noise Injection
by: Jacobsen, Albert Kjøller, et al.
Published: (2025)
by: Jacobsen, Albert Kjøller, et al.
Published: (2025)
Manifold Diffusion Fields
by: Elhag, Ahmed A., et al.
Published: (2023)
by: Elhag, Ahmed A., et al.
Published: (2023)
Latent Planning Emerges with Scale
by: Hanna, Michael, et al.
Published: (2026)
by: Hanna, Michael, et al.
Published: (2026)
Geometry-Aware Uncertainty Quantification via Conformal Prediction on Manifolds
by: Shahbazi, Marzieh Amiri, et al.
Published: (2026)
by: Shahbazi, Marzieh Amiri, et al.
Published: (2026)
Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry
by: Wang, Rui, et al.
Published: (2025)
by: Wang, Rui, et al.
Published: (2025)
Random Forest-Supervised Manifold Alignment
by: Rhodes, Jake S., et al.
Published: (2024)
by: Rhodes, Jake S., et al.
Published: (2024)
The Geometry of Grokking: Norm Minimization on the Zero-Loss Manifold
by: Musat, Tiberiu
Published: (2025)
by: Musat, Tiberiu
Published: (2025)
When Can Transformers Count to n?
by: Yehudai, Gilad, et al.
Published: (2024)
by: Yehudai, Gilad, et al.
Published: (2024)
Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior
by: Wurgaft, Daniel, et al.
Published: (2026)
by: Wurgaft, Daniel, et al.
Published: (2026)
Geometry-Preserving Neural Architectures on Manifolds with Boundary
by: Elamvazhuthi, Karthik, et al.
Published: (2026)
by: Elamvazhuthi, Karthik, et al.
Published: (2026)
Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive
by: Farghly, Tyler, et al.
Published: (2025)
by: Farghly, Tyler, et al.
Published: (2025)
Large Language-Geometry Model: When LLM meets Equivariance
by: Li, Zongzhao, et al.
Published: (2025)
by: Li, Zongzhao, et al.
Published: (2025)
Graph Integration for Diffusion-Based Manifold Alignment
by: Rhodes, Jake S., et al.
Published: (2024)
by: Rhodes, Jake S., et al.
Published: (2024)
Vector Symbolic Algebras for the Abstraction and Reasoning Corpus
by: Joffe, Isaac, et al.
Published: (2025)
by: Joffe, Isaac, et al.
Published: (2025)
When can transformers reason with abstract symbols?
by: Boix-Adsera, Enric, et al.
Published: (2023)
by: Boix-Adsera, Enric, et al.
Published: (2023)
Score-based Pullback Riemannian Geometry: Extracting the Data Manifold Geometry using Anisotropic Flows
by: Diepeveen, Willem, et al.
Published: (2024)
by: Diepeveen, Willem, et al.
Published: (2024)
Learning When to Switch: Adaptive Policy Selection via Reinforcement Learning
by: Tava, Chris
Published: (2025)
by: Tava, Chris
Published: (2025)
Discovering Data Manifold Geometry via Non-Contracting Flows
by: Vigouroux, David, et al.
Published: (2026)
by: Vigouroux, David, et al.
Published: (2026)
Manifold-Matching Autoencoders
by: Cheret, Laurent, et al.
Published: (2026)
by: Cheret, Laurent, et al.
Published: (2026)
Understanding When Poisson Log-Normal Models Outperform Penalized Poisson Regression for Microbiome Count Data
by: Agyapong, Daniel, et al.
Published: (2026)
by: Agyapong, Daniel, et al.
Published: (2026)
Divisive Decisions: Improving Salience-Based Training for Generalization in Binary Classification Tasks
by: Piland, Jacob, et al.
Published: (2025)
by: Piland, Jacob, et al.
Published: (2025)
Manifold Matching using Shortest-Path Distance and Joint Neighborhood Selection
by: Shen, Cencheng, et al.
Published: (2014)
by: Shen, Cencheng, et al.
Published: (2014)
Diffusion Processes on Implicit Manifolds
by: Kawasaki-Borruat, Victor, et al.
Published: (2026)
by: Kawasaki-Borruat, Victor, et al.
Published: (2026)
Geometry-Aware Generative Autoencoders for Warped Riemannian Metric Learning and Generative Modeling on Data Manifolds
by: Sun, Xingzhi, et al.
Published: (2024)
by: Sun, Xingzhi, et al.
Published: (2024)
Similar Items
-
Emotion Concepts and their Function in a Large Language Model
by: Sofroniew, Nicholas, et al.
Published: (2026) -
Language Models Represent Space and Time
by: Gurnee, Wes, et al.
Published: (2023) -
Not All Language Model Features Are One-Dimensionally Linear
by: Engels, Joshua, et al.
Published: (2024) -
Policy-shaped prediction: avoiding distractions in model-based reinforcement learning
by: Hutson, Miles, et al.
Published: (2024) -
The Remarkable Robustness of LLMs: Stages of Inference?
by: Lad, Vedang, et al.
Published: (2024)