Saved in:
| Main Authors: | Paape, Dario, Linzen, Tal, Vasishth, Shravan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.04489 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis
by: Timkey, William, et al.
Published: (2026)
by: Timkey, William, et al.
Published: (2026)
Assessing effect sizes, variability, and power in the on-line study of language production
by: Audrey, Bürki, et al.
Published: (2024)
by: Audrey, Bürki, et al.
Published: (2024)
What can LLMs tell us about the mechanisms behind polarity illusions in humans? Experiments across model scales and training steps
by: Paape, Dario
Published: (2026)
by: Paape, Dario
Published: (2026)
Do Language Models' Words Refer?
by: Mandelkern, Matthew, et al.
Published: (2023)
by: Mandelkern, Matthew, et al.
Published: (2023)
SPAWNing Structural Priming Predictions from a Cognitively Motivated Parser
by: Prasad, Grusha, et al.
Published: (2024)
by: Prasad, Grusha, et al.
Published: (2024)
Manipulating language models' training data to study syntactic constraint learning: the case of English passivization
by: Leong, Cara Su-Yi, et al.
Published: (2024)
by: Leong, Cara Su-Yi, et al.
Published: (2024)
Multilingual Prompting for Improving LLM Generation Diversity
by: Wang, Qihan, et al.
Published: (2025)
by: Wang, Qihan, et al.
Published: (2025)
Escaping the sentence-level paradigm in machine translation
by: Post, Matt, et al.
Published: (2023)
by: Post, Matt, et al.
Published: (2023)
Entailment Semantics Can Be Extracted from an Ideal Language Model
by: Merrill, William, et al.
Published: (2022)
by: Merrill, William, et al.
Published: (2022)
Evaluating In-Context Translation with Synchronous Context-Free Grammar Transduction
by: Petty, Jackson, et al.
Published: (2026)
by: Petty, Jackson, et al.
Published: (2026)
How Does Code Pretraining Affect Language Model Task Performance?
by: Petty, Jackson, et al.
Published: (2024)
by: Petty, Jackson, et al.
Published: (2024)
What Goes Into a LM Acceptability Judgment? Rethinking the Impact of Frequency and Length
by: Tjuatja, Lindia, et al.
Published: (2024)
by: Tjuatja, Lindia, et al.
Published: (2024)
In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax
by: Mueller, Aaron, et al.
Published: (2023)
by: Mueller, Aaron, et al.
Published: (2023)
Attention-aware semantic relevance predicting Chinese sentence reading
by: Sun, Kun
Published: (2024)
by: Sun, Kun
Published: (2024)
Large language models can disambiguate opioid slang on social media
by: Carpenter, Kristy A., et al.
Published: (2026)
by: Carpenter, Kristy A., et al.
Published: (2026)
Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment
by: Merrill, William, et al.
Published: (2024)
by: Merrill, William, et al.
Published: (2024)
Emergence of Linear Truth Encodings in Language Models
by: Ravfogel, Shauli, et al.
Published: (2025)
by: Ravfogel, Shauli, et al.
Published: (2025)
Language Models Struggle to Use Representations Learned In-Context
by: Lepori, Michael A., et al.
Published: (2026)
by: Lepori, Michael A., et al.
Published: (2026)
Power in Numbers: Robust reading comprehension by finetuning with four adversarial sentences per example
by: Marcus, Ariel
Published: (2024)
by: Marcus, Ariel
Published: (2024)
Rapid Word Learning Through Meta In-Context Learning
by: Wang, Wentao, et al.
Published: (2025)
by: Wang, Wentao, et al.
Published: (2025)
RELIC: Evaluating Complex Reasoning via the Recognition of Languages In-Context
by: Petty, Jackson, et al.
Published: (2025)
by: Petty, Jackson, et al.
Published: (2025)
The Impact of Depth on Compositional Generalization in Transformer Language Models
by: Petty, Jackson, et al.
Published: (2023)
by: Petty, Jackson, et al.
Published: (2023)
Always Learning, Always Mixing: Efficient and Simple Data Mixing All The Time
by: Hu, Michael Y., et al.
Published: (2026)
by: Hu, Michael Y., et al.
Published: (2026)
Between Circuits and Chomsky: Pre-pretraining on Formal Languages Imparts Linguistic Biases
by: Hu, Michael Y., et al.
Published: (2025)
by: Hu, Michael Y., et al.
Published: (2025)
Bayesian Teaching Enables Probabilistic Reasoning in Large Language Models
by: Qiu, Linlu, et al.
Published: (2025)
by: Qiu, Linlu, et al.
Published: (2025)
Temperature-scaling surprisal estimates improve fit to human reading times -- but does it do so for the "right reasons"?
by: Liu, Tong, et al.
Published: (2023)
by: Liu, Tong, et al.
Published: (2023)
Subword models struggle with word learning, but surprisal hides it
by: Bunzeck, Bastian, et al.
Published: (2025)
by: Bunzeck, Bastian, et al.
Published: (2025)
Is persona enough for personality? Using ChatGPT to reconstruct an agent's latent personality from simple descriptions
by: Ji, Yongyi, et al.
Published: (2024)
by: Ji, Yongyi, et al.
Published: (2024)
A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models
by: Eisape, Tiwalayo, et al.
Published: (2023)
by: Eisape, Tiwalayo, et al.
Published: (2023)
Variation of sentence length across time and genre
by: Rudnicka, Karolina
Published: (2025)
by: Rudnicka, Karolina
Published: (2025)
POS-tagging to highlight the skeletal structure of sentences
by: Churakov, Grigorii
Published: (2024)
by: Churakov, Grigorii
Published: (2024)
Recovering document annotations for sentence-level bitext
by: Wicks, Rachel, et al.
Published: (2024)
by: Wicks, Rachel, et al.
Published: (2024)
Iti-Validator: A Guardrail Framework for Validating and Correcting LLM-Generated Itineraries
by: Gadbail, Shravan, et al.
Published: (2025)
by: Gadbail, Shravan, et al.
Published: (2025)
Generating bilingual example sentences with large language models as lexicography assistants
by: Merx, Raphael, et al.
Published: (2024)
by: Merx, Raphael, et al.
Published: (2024)
Thinking beyond the anthropomorphic paradigm benefits LLM research
by: Ibrahim, Lujain, et al.
Published: (2025)
by: Ibrahim, Lujain, et al.
Published: (2025)
How accurate are Bayes factor-based null hypothesis tests? A simulation study
by: Schad, Daniel J., et al.
Published: (2024)
by: Schad, Daniel J., et al.
Published: (2024)
Early Transformers: A study on Efficient Training of Transformer Models through Early-Bird Lottery Tickets
by: Cheekati, Shravan
Published: (2024)
by: Cheekati, Shravan
Published: (2024)
Are LLM-based methods good enough for detecting unfair terms of service?
by: Frasheri, Mirgita, et al.
Published: (2024)
by: Frasheri, Mirgita, et al.
Published: (2024)
Decomposition of surprisal: Unified computational model of ERP components in language processing
by: Li, Jiaxuan, et al.
Published: (2024)
by: Li, Jiaxuan, et al.
Published: (2024)
Neural paraphrasing by automatically crawled and aligned sentence pairs
by: Globo, Achille, et al.
Published: (2024)
by: Globo, Achille, et al.
Published: (2024)
Similar Items
-
Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis
by: Timkey, William, et al.
Published: (2026) -
Assessing effect sizes, variability, and power in the on-line study of language production
by: Audrey, Bürki, et al.
Published: (2024) -
What can LLMs tell us about the mechanisms behind polarity illusions in humans? Experiments across model scales and training steps
by: Paape, Dario
Published: (2026) -
Do Language Models' Words Refer?
by: Mandelkern, Matthew, et al.
Published: (2023) -
SPAWNing Structural Priming Predictions from a Cognitively Motivated Parser
by: Prasad, Grusha, et al.
Published: (2024)