:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lepori, Michael A., Serre, Thomas, Pavlick, Ellie
Format:	Preprint
Published:	2023
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2311.04354
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility
by: Lepori, Michael A., et al.
Published: (2025)

Circuit Component Reuse Across Tasks in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2023)

Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
by: Anand, Suraj, et al.
Published: (2024)

From Prediction to Understanding: Will AI Foundation Models Transform Brain Science?
by: Serre, Thomas, et al.
Published: (2025)

Are LLMs Models of Distributional Semantics? A Case Study on Quantifiers
by: Enyan, Zhang, et al.
Published: (2024)

Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
by: Lepori, Michael A., et al.
Published: (2024)

Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2024)

How Do Language Models Compose Functions?
by: Khandelwal, Apoorv, et al.
Published: (2025)

Does Training on Synthetic Data Make Models Less Robust?
by: Zhang, Lingze, et al.
Published: (2025)

Language Models Implement Simple Word2Vec-style Vector Arithmetic
by: Merullo, Jack, et al.
Published: (2023)

mOthello: When Do Cross-Lingual Representation Alignment and Cross-Lingual Transfer Emerge in Multilingual Models?
by: Hua, Tianze, et al.
Published: (2024)

What is an "Abstract Reasoner"? Revisiting Experiments and Arguments about Large Language Models
by: Yun, Tian, et al.
Published: (2025)

Source-Modality Monitoring in Vision-Language Models
by: Hua, Etha Tianze, et al.
Published: (2026)

How Do Vision-Language Models Process Conflicting Information Across Modalities?
by: Hua, Tianze, et al.
Published: (2025)

A Knapsack by Any Other Name: Presentation impacts LLM performance on NP-hard problems
by: Duchnowski, Alex, et al.
Published: (2025)

LLMs as Models for Analogical Reasoning
by: Musker, Sam, et al.
Published: (2024)

Paths Not Taken: Understanding and Mending the Multilingual Factual Recall Pipeline
by: Lu, Meng, et al.
Published: (2025)

Born a Transformer -- Always a Transformer? On the Effect of Pretraining on Architectural Abilities
by: Jobanputra, Mayank, et al.
Published: (2025)

Transferring Linear Features Across Language Models With Model Stitching
by: Chen, Alan, et al.
Published: (2025)

Instilling Inductive Biases with Subnetworks
by: Zhang, Enyan, et al.
Published: (2023)

Does CLIP Bind Concepts? Probing Compositionality in Large Image Models
by: Lewis, Martha, et al.
Published: (2022)

The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling
by: Zhang, Ruochen, et al.
Published: (2024)

Shared Lexical Task Representations Explain Behavioral Variability In LLMs
by: Yang, Zhuonan, et al.
Published: (2026)

Signatures of human-like processing in Transformer forward passes
by: Hu, Jennifer, et al.
Published: (2025)

Bayesian Preference Elicitation with Language Models
by: Handa, Kunal, et al.
Published: (2024)

$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
by: Khandelwal, Apoorv, et al.
Published: (2024)

The dynamic interplay between in-context and in-weight learning in humans and neural networks
by: Russin, Jacob, et al.
Published: (2024)

Racing Thoughts: Explaining Contextualization Errors in Large Language Models
by: Lepori, Michael A., et al.
Published: (2024)

Deep Neural Networks Can Learn Generalizable Same-Different Visual Relations
by: Tartaglini, Alexa R., et al.
Published: (2023)

Task-Specific Knowledge Distillation via Intermediate Probes
by: Brown, Ryan, et al.
Published: (2026)

Can LLMs subtract numbers?
by: Jobanputra, Mayank, et al.
Published: (2025)

Language Models Struggle to Use Representations Learned In-Context
by: Lepori, Michael A., et al.
Published: (2026)

A Latent-Variable Model for Intrinsic Probing
by: Stańczak, Karolina, et al.
Published: (2022)

Observable Propagation: Uncovering Feature Vectors in Transformers
by: Dunefsky, Jacob, et al.
Published: (2023)

Video Finetuning Improves Reasoning Between Frames
by: Yang, Ruiqi, et al.
Published: (2025)

Finding Transformer Circuits with Edge Pruning
by: Bhaskar, Adithya, et al.
Published: (2024)

From Sequence to Structure: Uncovering Substructure Reasoning in Transformers
by: Dai, Xinnan, et al.
Published: (2025)

Transformer See, Transformer Do: Copying as an Intermediate Step in Learning Analogical Reasoning
by: Hellwig, Philipp, et al.
Published: (2026)

Efficient Automated Circuit Discovery in Transformers using Contextual Decomposition
by: Hsu, Aliyah R., et al.
Published: (2024)

Embedded Named Entity Recognition using Probing Classifiers
by: Popovič, Nicholas, et al.
Published: (2024)