Saved in:
| Main Authors: | Lepori, Michael A., Serre, Thomas, Pavlick, Ellie |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2311.04354 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility
by: Lepori, Michael A., et al.
Published: (2025)
by: Lepori, Michael A., et al.
Published: (2025)
Circuit Component Reuse Across Tasks in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2023)
by: Merullo, Jack, et al.
Published: (2023)
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
by: Anand, Suraj, et al.
Published: (2024)
by: Anand, Suraj, et al.
Published: (2024)
From Prediction to Understanding: Will AI Foundation Models Transform Brain Science?
by: Serre, Thomas, et al.
Published: (2025)
by: Serre, Thomas, et al.
Published: (2025)
Are LLMs Models of Distributional Semantics? A Case Study on Quantifiers
by: Enyan, Zhang, et al.
Published: (2024)
by: Enyan, Zhang, et al.
Published: (2024)
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
by: Lepori, Michael A., et al.
Published: (2024)
by: Lepori, Michael A., et al.
Published: (2024)
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2024)
by: Merullo, Jack, et al.
Published: (2024)
How Do Language Models Compose Functions?
by: Khandelwal, Apoorv, et al.
Published: (2025)
by: Khandelwal, Apoorv, et al.
Published: (2025)
Does Training on Synthetic Data Make Models Less Robust?
by: Zhang, Lingze, et al.
Published: (2025)
by: Zhang, Lingze, et al.
Published: (2025)
Language Models Implement Simple Word2Vec-style Vector Arithmetic
by: Merullo, Jack, et al.
Published: (2023)
by: Merullo, Jack, et al.
Published: (2023)
mOthello: When Do Cross-Lingual Representation Alignment and Cross-Lingual Transfer Emerge in Multilingual Models?
by: Hua, Tianze, et al.
Published: (2024)
by: Hua, Tianze, et al.
Published: (2024)
What is an "Abstract Reasoner"? Revisiting Experiments and Arguments about Large Language Models
by: Yun, Tian, et al.
Published: (2025)
by: Yun, Tian, et al.
Published: (2025)
Source-Modality Monitoring in Vision-Language Models
by: Hua, Etha Tianze, et al.
Published: (2026)
by: Hua, Etha Tianze, et al.
Published: (2026)
How Do Vision-Language Models Process Conflicting Information Across Modalities?
by: Hua, Tianze, et al.
Published: (2025)
by: Hua, Tianze, et al.
Published: (2025)
A Knapsack by Any Other Name: Presentation impacts LLM performance on NP-hard problems
by: Duchnowski, Alex, et al.
Published: (2025)
by: Duchnowski, Alex, et al.
Published: (2025)
LLMs as Models for Analogical Reasoning
by: Musker, Sam, et al.
Published: (2024)
by: Musker, Sam, et al.
Published: (2024)
Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline
by: Lu, Meng, et al.
Published: (2025)
by: Lu, Meng, et al.
Published: (2025)
Born a Transformer -- Always a Transformer? On the Effect of Pretraining on Architectural Abilities
by: Jobanputra, Mayank, et al.
Published: (2025)
by: Jobanputra, Mayank, et al.
Published: (2025)
Transferring Linear Features Across Language Models With Model Stitching
by: Chen, Alan, et al.
Published: (2025)
by: Chen, Alan, et al.
Published: (2025)
Instilling Inductive Biases with Subnetworks
by: Zhang, Enyan, et al.
Published: (2023)
by: Zhang, Enyan, et al.
Published: (2023)
Does CLIP Bind Concepts? Probing Compositionality in Large Image Models
by: Lewis, Martha, et al.
Published: (2022)
by: Lewis, Martha, et al.
Published: (2022)
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling
by: Zhang, Ruochen, et al.
Published: (2024)
by: Zhang, Ruochen, et al.
Published: (2024)
Shared Lexical Task Representations Explain Behavioral Variability In LLMs
by: Yang, Zhuonan, et al.
Published: (2026)
by: Yang, Zhuonan, et al.
Published: (2026)
Signatures of human-like processing in Transformer forward passes
by: Hu, Jennifer, et al.
Published: (2025)
by: Hu, Jennifer, et al.
Published: (2025)
Bayesian Preference Elicitation with Language Models
by: Handa, Kunal, et al.
Published: (2024)
by: Handa, Kunal, et al.
Published: (2024)
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
by: Khandelwal, Apoorv, et al.
Published: (2024)
by: Khandelwal, Apoorv, et al.
Published: (2024)
The dynamic interplay between in-context and in-weight learning in humans and neural networks
by: Russin, Jacob, et al.
Published: (2024)
by: Russin, Jacob, et al.
Published: (2024)
Racing Thoughts: Explaining Contextualization Errors in Large Language Models
by: Lepori, Michael A., et al.
Published: (2024)
by: Lepori, Michael A., et al.
Published: (2024)
Deep Neural Networks Can Learn Generalizable Same-Different Visual Relations
by: Tartaglini, Alexa R., et al.
Published: (2023)
by: Tartaglini, Alexa R., et al.
Published: (2023)
Task-Specific Knowledge Distillation via Intermediate Probes
by: Brown, Ryan, et al.
Published: (2026)
by: Brown, Ryan, et al.
Published: (2026)
Can LLMs subtract numbers?
by: Jobanputra, Mayank, et al.
Published: (2025)
by: Jobanputra, Mayank, et al.
Published: (2025)
Language Models Struggle to Use Representations Learned In-Context
by: Lepori, Michael A., et al.
Published: (2026)
by: Lepori, Michael A., et al.
Published: (2026)
A Latent-Variable Model for Intrinsic Probing
by: Stańczak, Karolina, et al.
Published: (2022)
by: Stańczak, Karolina, et al.
Published: (2022)
Observable Propagation: Uncovering Feature Vectors in Transformers
by: Dunefsky, Jacob, et al.
Published: (2023)
by: Dunefsky, Jacob, et al.
Published: (2023)
Video Finetuning Improves Reasoning Between Frames
by: Yang, Ruiqi, et al.
Published: (2025)
by: Yang, Ruiqi, et al.
Published: (2025)
Finding Transformer Circuits with Edge Pruning
by: Bhaskar, Adithya, et al.
Published: (2024)
by: Bhaskar, Adithya, et al.
Published: (2024)
From Sequence to Structure: Uncovering Substructure Reasoning in Transformers
by: Dai, Xinnan, et al.
Published: (2025)
by: Dai, Xinnan, et al.
Published: (2025)
Transformer See, Transformer Do: Copying as an Intermediate Step in Learning Analogical Reasoning
by: Hellwig, Philipp, et al.
Published: (2026)
by: Hellwig, Philipp, et al.
Published: (2026)
Efficient Automated Circuit Discovery in Transformers using Contextual Decomposition
by: Hsu, Aliyah R., et al.
Published: (2024)
by: Hsu, Aliyah R., et al.
Published: (2024)
Embedded Named Entity Recognition using Probing Classifiers
by: Popovič, Nicholas, et al.
Published: (2024)
by: Popovič, Nicholas, et al.
Published: (2024)
Similar Items
-
Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility
by: Lepori, Michael A., et al.
Published: (2025) -
Circuit Component Reuse Across Tasks in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2023) -
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
by: Anand, Suraj, et al.
Published: (2024) -
From Prediction to Understanding: Will AI Foundation Models Transform Brain Science?
by: Serre, Thomas, et al.
Published: (2025) -
Are LLMs Models of Distributional Semantics? A Case Study on Quantifiers
by: Enyan, Zhang, et al.
Published: (2024)