:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Saanum, Tankred, Demircan, Can, Gershman, Samuel J., Schulz, Eric
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2509.21534
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Sparse Autoencoders Reveal Temporal Difference Learning in Large Language Models
by: Demircan, Can, et al.
Published: (2024)

Simplifying Latent Dynamics with Softly State-Invariant World Models
by: Saanum, Tankred, et al.
Published: (2024)

Next state prediction gives rise to entangled, yet compositional representations of objects
by: Saanum, Tankred, et al.
Published: (2024)

Evaluating alignment between humans and neural network representations in image-based learning tasks
by: Demircan, Can, et al.
Published: (2023)

Reference-Free Rating of LLM Responses via Latent Information
by: Girrbach, Leander, et al.
Published: (2025)

Can Vision Language Models Learn Intuitive Physics from Interaction?
by: Buschoff, Luca M. Schulze, et al.
Published: (2026)

Fast weight programming and linear transformers: from machine learning to neurobiology
by: Irie, Kazuki, et al.
Published: (2025)

A Variational Manifold Embedding Framework for Nonlinear Dimensionality Reduction
by: Vastola, John J., et al.
Published: (2025)

Blending Complementary Memory Systems in Hybrid Quadratic-Linear Transformers
by: Irie, Kazuki, et al.
Published: (2025)

Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules
by: Vastola, John J., et al.
Published: (2025)

Artificial intelligence for science: The easy and hard problems
by: Battleday, Ruairidh M., et al.
Published: (2024)

Why Depth Matters in Parallelizable Sequence Models: A Lie Algebraic View
by: Heo, Gyuryang, et al.
Published: (2026)

Key-value memory in the brain
by: Gershman, Samuel J., et al.
Published: (2025)

General Intelligence Requires Reward-based Pretraining
by: Han, Seungwook, et al.
Published: (2025)

Successor-Predecessor Intrinsic Exploration
by: Yu, Changmin, et al.
Published: (2023)

In-Context Function Learning in Large Language Models
by: Akata, Elif, et al.
Published: (2026)

Grokking as the Transition from Lazy to Rich Training Dynamics
by: Kumar, Tanishq, et al.
Published: (2023)

Generating Computational Cognitive Models using Large Language Models
by: Rmus, Milena, et al.
Published: (2025)

In-context learning agents are asymmetric belief updaters
by: Schubert, Johannes A., et al.
Published: (2024)

Preemptive Solving of Future Problems: Multitask Preplay in Humans and Machines
by: Carvalho, Wilka, et al.
Published: (2025)

metabench -- A Sparse Benchmark of Reasoning and Knowledge in Large Language Models
by: Kipnis, Alex, et al.
Published: (2024)

Predictive representations: building blocks of intelligence
by: Carvalho, Wilka, et al.
Published: (2024)

Do Mice Grok? Glimpses of Hidden Progress During Overtraining in Sensory Cortex
by: Kumar, Tanishq, et al.
Published: (2024)

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models
by: Marks, Samuel, et al.
Published: (2024)

Pre-trained Large Language Models Learn Hidden Markov Models In-context
by: Dai, Yijia, et al.
Published: (2025)

Task diversity produces systematic transfer but inhibits continual reinforcement learning
by: Seth, Purab, et al.
Published: (2026)

Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models
by: Qiu, Yifu, et al.
Published: (2025)

Online learning in bandits with predicted context
by: Guo, Yongyi, et al.
Published: (2023)

IC-Cache: Efficient Large Language Model Serving via In-context Caching
by: Yu, Yifan, et al.
Published: (2025)

Scaling sparse feature circuit finding for in-context learning
by: Kharlapenko, Dmitrii, et al.
Published: (2025)

Centaur: a foundation model of human cognition
by: Binz, Marcel, et al.
Published: (2024)

Fractal interpolation in the context of prediction accuracy optimization
by: Baicoianu, Alexandra, et al.
Published: (2024)

Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks
by: Jagadish, Akshay K., et al.
Published: (2024)

Improving the forecast accuracy of wind power by leveraging multiple hierarchical structure
by: English, Lucas, et al.
Published: (2023)

Probing the Decision Boundaries of In-context Learning in Large Language Models
by: Zhao, Siyan, et al.
Published: (2024)

Revisiting In-context Learning Inference Circuit in Large Language Models
by: Cho, Hakaze, et al.
Published: (2024)

In-context Autoencoder for Context Compression in a Large Language Model
by: Ge, Tao, et al.
Published: (2023)

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
by: Singh, Aaditya K., et al.
Published: (2024)

Fine-tuning vs. In-context Learning in Large Language Models: A Formal Language Learning Perspective
by: Ghosh, Bishwamittra, et al.
Published: (2026)

Parallel In-context Learning for Large Vision Language Models
by: Yamaguchi, Shin'ya, et al.
Published: (2026)