Saved in:
Bibliographic Details
Main Authors: Olsen, Katrina, Padó, Sebastian
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.11699
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913139225788416
author Olsen, Katrina
Padó, Sebastian
author_facet Olsen, Katrina
Padó, Sebastian
contents Nonsensical and anomalous sentences have been instrumental in the development of computational models of semantic interpretation. A core challenge is to distinguish between what is merely anomalous (but can be interpreted given a supporting context) and what is truly nonsensical. However, it is unclear (a) how nonsensical, rather than merely anomalous, existing datasets are; and (b) how well LLMs can make this distinction. In this paper, we answer both questions by collecting sensicality judgments from human raters and LLMs on sentences from five semantically deviant datasets: both context-free and when providing a context. We find that raters consider most sentences at most anomalous, and only a few as properly nonsensical. We also show that LLMs are substantially skilled in generating plausible contexts for anomalous cases.
format Preprint
id arxiv_https___arxiv_org_abs_2602_11699
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Finding Sense in Nonsense with Generated Contexts: Perspectives from Humans and Language Models
Olsen, Katrina
Padó, Sebastian
Computation and Language
Nonsensical and anomalous sentences have been instrumental in the development of computational models of semantic interpretation. A core challenge is to distinguish between what is merely anomalous (but can be interpreted given a supporting context) and what is truly nonsensical. However, it is unclear (a) how nonsensical, rather than merely anomalous, existing datasets are; and (b) how well LLMs can make this distinction. In this paper, we answer both questions by collecting sensicality judgments from human raters and LLMs on sentences from five semantically deviant datasets: both context-free and when providing a context. We find that raters consider most sentences at most anomalous, and only a few as properly nonsensical. We also show that LLMs are substantially skilled in generating plausible contexts for anomalous cases.
title Finding Sense in Nonsense with Generated Contexts: Perspectives from Humans and Language Models
topic Computation and Language
url https://arxiv.org/abs/2602.11699