Saved in:
| Main Authors: | Griffiths, Thomas L., Lake, Brenden M., McCoy, R. Thomas, Pavlick, Ellie, Webb, Taylor W. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.05776 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Collocational bootstrapping: A hypothesis about the learning of subject-verb agreement in humans and neural networks
by: Hobbs, Claire, et al.
Published: (2026)
by: Hobbs, Claire, et al.
Published: (2026)
Investigating Concept Alignment Using Implausible Category Members
by: Rane, Sunayana, et al.
Published: (2026)
by: Rane, Sunayana, et al.
Published: (2026)
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
by: Prabhakar, Akshara, et al.
Published: (2024)
by: Prabhakar, Akshara, et al.
Published: (2024)
Distilling Symbolic Priors for Concept Learning into Neural Networks
by: Marinescu, Ioana, et al.
Published: (2024)
by: Marinescu, Ioana, et al.
Published: (2024)
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
by: Lepori, Michael A., et al.
Published: (2024)
by: Lepori, Michael A., et al.
Published: (2024)
From Prediction to Understanding: Will AI Foundation Models Transform Brain Science?
by: Serre, Thomas, et al.
Published: (2025)
by: Serre, Thomas, et al.
Published: (2025)
Overcoming classic challenges for artificial neural networks by providing incentives and practice
by: Irie, Kazuki, et al.
Published: (2024)
by: Irie, Kazuki, et al.
Published: (2024)
Not-So-Strange Love: Language Models and Generative Linguistic Theories are More Compatible than They Appear
by: McCoy, R. Thomas
Published: (2026)
by: McCoy, R. Thomas
Published: (2026)
Identifying and Mitigating the Influence of the Prior Distribution in Large Language Models
by: Zhang, Liyi, et al.
Published: (2025)
by: Zhang, Liyi, et al.
Published: (2025)
Teasing Apart Architecture and Initial Weights as Sources of Inductive Bias in Neural Networks
by: Bencomo, Gianluca, et al.
Published: (2025)
by: Bencomo, Gianluca, et al.
Published: (2025)
Deep Neural Networks Can Learn Generalizable Same-Different Visual Relations
by: Tartaglini, Alexa R., et al.
Published: (2023)
by: Tartaglini, Alexa R., et al.
Published: (2023)
When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1
by: McCoy, R. Thomas, et al.
Published: (2024)
by: McCoy, R. Thomas, et al.
Published: (2024)
How Do Language Models Compose Functions?
by: Khandelwal, Apoorv, et al.
Published: (2025)
by: Khandelwal, Apoorv, et al.
Published: (2025)
Does Training on Synthetic Data Make Models Less Robust?
by: Zhang, Lingze, et al.
Published: (2025)
by: Zhang, Lingze, et al.
Published: (2025)
LLMs model how humans induce logically structured rules
by: Loo, Alyssa, et al.
Published: (2025)
by: Loo, Alyssa, et al.
Published: (2025)
Are they human? Detecting large language models by probing human memory constraints
by: Schug, Simon, et al.
Published: (2026)
by: Schug, Simon, et al.
Published: (2026)
What is an "Abstract Reasoner"? Revisiting Experiments and Arguments about Large Language Models
by: Yun, Tian, et al.
Published: (2025)
by: Yun, Tian, et al.
Published: (2025)
mOthello: When Do Cross-Lingual Representation Alignment and Cross-Lingual Transfer Emerge in Multilingual Models?
by: Hua, Tianze, et al.
Published: (2024)
by: Hua, Tianze, et al.
Published: (2024)
Handling and Interpreting Missing Modalities in Patient Clinical Trajectories via Autoregressive Sequence Modeling
by: Wang, Andrew, et al.
Published: (2026)
by: Wang, Andrew, et al.
Published: (2026)
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2024)
by: Merullo, Jack, et al.
Published: (2024)
Minimization of Boolean Complexity in In-Context Concept Learning
by: Wang, Leroy Z., et al.
Published: (2024)
by: Wang, Leroy Z., et al.
Published: (2024)
Instilling Inductive Biases with Subnetworks
by: Zhang, Enyan, et al.
Published: (2023)
by: Zhang, Enyan, et al.
Published: (2023)
What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions
by: Zhang, Liyi, et al.
Published: (2024)
by: Zhang, Liyi, et al.
Published: (2024)
How Do Vision-Language Models Process Conflicting Information Across Modalities?
by: Hua, Tianze, et al.
Published: (2025)
by: Hua, Tianze, et al.
Published: (2025)
CoLLEGe: Concept Embedding Generation for Large Language Models
by: Teehan, Ryan, et al.
Published: (2024)
by: Teehan, Ryan, et al.
Published: (2024)
SAGE-Eval: Evaluating LLMs for Systematic Generalizations of Safety Facts
by: Yueh-Han, Chen, et al.
Published: (2025)
by: Yueh-Han, Chen, et al.
Published: (2025)
GPU-Accelerated ANNS: Quantized for Speed, Built for Change
by: McCoy, Hunter, et al.
Published: (2026)
by: McCoy, Hunter, et al.
Published: (2026)
Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction
by: Kumar, Sreejan, et al.
Published: (2024)
by: Kumar, Sreejan, et al.
Published: (2024)
Video Finetuning Improves Reasoning Between Frames
by: Yang, Ruiqi, et al.
Published: (2025)
by: Yang, Ruiqi, et al.
Published: (2025)
An explainable transformer circuit for compositional generalization
by: Tang, Cheng, et al.
Published: (2025)
by: Tang, Cheng, et al.
Published: (2025)
Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility
by: Lepori, Michael A., et al.
Published: (2025)
by: Lepori, Michael A., et al.
Published: (2025)
Compositional learning of functions in humans and machines
by: Zhou, Yanli, et al.
Published: (2024)
by: Zhou, Yanli, et al.
Published: (2024)
Transformer Mechanisms Mimic Frontostriatal Gating Operations When Trained on Human Working Memory Tasks
by: Traylor, Aaron, et al.
Published: (2024)
by: Traylor, Aaron, et al.
Published: (2024)
Dominion: A New Frontier for AI Research
by: Halawi, Danny, et al.
Published: (2024)
by: Halawi, Danny, et al.
Published: (2024)
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling
by: Zhang, Ruochen, et al.
Published: (2024)
by: Zhang, Ruochen, et al.
Published: (2024)
Rapid Word Learning Through Meta In-Context Learning
by: Wang, Wentao, et al.
Published: (2025)
by: Wang, Wentao, et al.
Published: (2025)
Do Large Language Models Reason Causally Like Us? Even Better?
by: Dettki, Hanna M., et al.
Published: (2025)
by: Dettki, Hanna M., et al.
Published: (2025)
Goals as Reward-Producing Programs
by: Davidson, Guy, et al.
Published: (2024)
by: Davidson, Guy, et al.
Published: (2024)
H-ARC: A Robust Estimate of Human Performance on the Abstraction and Reasoning Corpus Benchmark
by: LeGris, Solim, et al.
Published: (2024)
by: LeGris, Solim, et al.
Published: (2024)
Levels of Analysis for Large Language Models
by: Ku, Alexander Y., et al.
Published: (2025)
by: Ku, Alexander Y., et al.
Published: (2025)
Similar Items
-
Collocational bootstrapping: A hypothesis about the learning of subject-verb agreement in humans and neural networks
by: Hobbs, Claire, et al.
Published: (2026) -
Investigating Concept Alignment Using Implausible Category Members
by: Rane, Sunayana, et al.
Published: (2026) -
Deciphering the Factors Influencing the Efficacy of Chain-of-Thought: Probability, Memorization, and Noisy Reasoning
by: Prabhakar, Akshara, et al.
Published: (2024) -
Distilling Symbolic Priors for Concept Learning into Neural Networks
by: Marinescu, Ioana, et al.
Published: (2024) -
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
by: Lepori, Michael A., et al.
Published: (2024)