Saved in:
| Main Authors: | Saanum, Tankred, Demircan, Can, Gershman, Samuel J., Schulz, Eric |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.21534 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sparse Autoencoders Reveal Temporal Difference Learning in Large Language Models
by: Demircan, Can, et al.
Published: (2024)
by: Demircan, Can, et al.
Published: (2024)
Simplifying Latent Dynamics with Softly State-Invariant World Models
by: Saanum, Tankred, et al.
Published: (2024)
by: Saanum, Tankred, et al.
Published: (2024)
Next state prediction gives rise to entangled, yet compositional representations of objects
by: Saanum, Tankred, et al.
Published: (2024)
by: Saanum, Tankred, et al.
Published: (2024)
Evaluating alignment between humans and neural network representations in image-based learning tasks
by: Demircan, Can, et al.
Published: (2023)
by: Demircan, Can, et al.
Published: (2023)
Reference-Free Rating of LLM Responses via Latent Information
by: Girrbach, Leander, et al.
Published: (2025)
by: Girrbach, Leander, et al.
Published: (2025)
Can Vision Language Models Learn Intuitive Physics from Interaction?
by: Buschoff, Luca M. Schulze, et al.
Published: (2026)
by: Buschoff, Luca M. Schulze, et al.
Published: (2026)
Fast weight programming and linear transformers: from machine learning to neurobiology
by: Irie, Kazuki, et al.
Published: (2025)
by: Irie, Kazuki, et al.
Published: (2025)
A Variational Manifold Embedding Framework for Nonlinear Dimensionality Reduction
by: Vastola, John J., et al.
Published: (2025)
by: Vastola, John J., et al.
Published: (2025)
Blending Complementary Memory Systems in Hybrid Quadratic-Linear Transformers
by: Irie, Kazuki, et al.
Published: (2025)
by: Irie, Kazuki, et al.
Published: (2025)
Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules
by: Vastola, John J., et al.
Published: (2025)
by: Vastola, John J., et al.
Published: (2025)
Artificial intelligence for science: The easy and hard problems
by: Battleday, Ruairidh M., et al.
Published: (2024)
by: Battleday, Ruairidh M., et al.
Published: (2024)
Why Depth Matters in Parallelizable Sequence Models: A Lie Algebraic View
by: Heo, Gyuryang, et al.
Published: (2026)
by: Heo, Gyuryang, et al.
Published: (2026)
Key-value memory in the brain
by: Gershman, Samuel J., et al.
Published: (2025)
by: Gershman, Samuel J., et al.
Published: (2025)
General Intelligence Requires Reward-based Pretraining
by: Han, Seungwook, et al.
Published: (2025)
by: Han, Seungwook, et al.
Published: (2025)
Successor-Predecessor Intrinsic Exploration
by: Yu, Changmin, et al.
Published: (2023)
by: Yu, Changmin, et al.
Published: (2023)
In-Context Function Learning in Large Language Models
by: Akata, Elif, et al.
Published: (2026)
by: Akata, Elif, et al.
Published: (2026)
Grokking as the Transition from Lazy to Rich Training Dynamics
by: Kumar, Tanishq, et al.
Published: (2023)
by: Kumar, Tanishq, et al.
Published: (2023)
Generating Computational Cognitive Models using Large Language Models
by: Rmus, Milena, et al.
Published: (2025)
by: Rmus, Milena, et al.
Published: (2025)
In-context learning agents are asymmetric belief updaters
by: Schubert, Johannes A., et al.
Published: (2024)
by: Schubert, Johannes A., et al.
Published: (2024)
Preemptive Solving of Future Problems: Multitask Preplay in Humans and Machines
by: Carvalho, Wilka, et al.
Published: (2025)
by: Carvalho, Wilka, et al.
Published: (2025)
metabench -- A Sparse Benchmark of Reasoning and Knowledge in Large Language Models
by: Kipnis, Alex, et al.
Published: (2024)
by: Kipnis, Alex, et al.
Published: (2024)
Predictive representations: building blocks of intelligence
by: Carvalho, Wilka, et al.
Published: (2024)
by: Carvalho, Wilka, et al.
Published: (2024)
Do Mice Grok? Glimpses of Hidden Progress During Overtraining in Sensory Cortex
by: Kumar, Tanishq, et al.
Published: (2024)
by: Kumar, Tanishq, et al.
Published: (2024)
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
by: Marks, Samuel, et al.
Published: (2024)
by: Marks, Samuel, et al.
Published: (2024)
Pre-trained Large Language Models Learn Hidden Markov Models In-context
by: Dai, Yijia, et al.
Published: (2025)
by: Dai, Yijia, et al.
Published: (2025)
Task diversity produces systematic transfer but inhibits continual reinforcement learning
by: Seth, Purab, et al.
Published: (2026)
by: Seth, Purab, et al.
Published: (2026)
Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models
by: Qiu, Yifu, et al.
Published: (2025)
by: Qiu, Yifu, et al.
Published: (2025)
Online learning in bandits with predicted context
by: Guo, Yongyi, et al.
Published: (2023)
by: Guo, Yongyi, et al.
Published: (2023)
IC-Cache: Efficient Large Language Model Serving via In-context Caching
by: Yu, Yifan, et al.
Published: (2025)
by: Yu, Yifan, et al.
Published: (2025)
Scaling sparse feature circuit finding for in-context learning
by: Kharlapenko, Dmitrii, et al.
Published: (2025)
by: Kharlapenko, Dmitrii, et al.
Published: (2025)
Centaur: a foundation model of human cognition
by: Binz, Marcel, et al.
Published: (2024)
by: Binz, Marcel, et al.
Published: (2024)
Fractal interpolation in the context of prediction accuracy optimization
by: Baicoianu, Alexandra, et al.
Published: (2024)
by: Baicoianu, Alexandra, et al.
Published: (2024)
Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
by: Jagadish, Akshay K., et al.
Published: (2024)
by: Jagadish, Akshay K., et al.
Published: (2024)
Improving the forecast accuracy of wind power by leveraging multiple hierarchical structure
by: English, Lucas, et al.
Published: (2023)
by: English, Lucas, et al.
Published: (2023)
Probing the Decision Boundaries of In-context Learning in Large Language Models
by: Zhao, Siyan, et al.
Published: (2024)
by: Zhao, Siyan, et al.
Published: (2024)
Revisiting In-context Learning Inference Circuit in Large Language Models
by: Cho, Hakaze, et al.
Published: (2024)
by: Cho, Hakaze, et al.
Published: (2024)
In-context Autoencoder for Context Compression in a Large Language Model
by: Ge, Tao, et al.
Published: (2023)
by: Ge, Tao, et al.
Published: (2023)
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
by: Singh, Aaditya K., et al.
Published: (2024)
by: Singh, Aaditya K., et al.
Published: (2024)
Fine-tuning vs. In-context Learning in Large Language Models: A Formal Language Learning Perspective
by: Ghosh, Bishwamittra, et al.
Published: (2026)
by: Ghosh, Bishwamittra, et al.
Published: (2026)
Parallel In-context Learning for Large Vision Language Models
by: Yamaguchi, Shin'ya, et al.
Published: (2026)
by: Yamaguchi, Shin'ya, et al.
Published: (2026)
Similar Items
-
Sparse Autoencoders Reveal Temporal Difference Learning in Large Language Models
by: Demircan, Can, et al.
Published: (2024) -
Simplifying Latent Dynamics with Softly State-Invariant World Models
by: Saanum, Tankred, et al.
Published: (2024) -
Next state prediction gives rise to entangled, yet compositional representations of objects
by: Saanum, Tankred, et al.
Published: (2024) -
Evaluating alignment between humans and neural network representations in image-based learning tasks
by: Demircan, Can, et al.
Published: (2023) -
Reference-Free Rating of LLM Responses via Latent Information
by: Girrbach, Leander, et al.
Published: (2025)