Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Olsen, Katrina, Padó, Sebastian
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.11699
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913139225788416
author	Olsen, Katrina Padó, Sebastian
author_facet	Olsen, Katrina Padó, Sebastian
contents	Nonsensical and anomalous sentences have been instrumental in the development of computational models of semantic interpretation. A core challenge is to distinguish between what is merely anomalous (but can be interpreted given a supporting context) and what is truly nonsensical. However, it is unclear (a) how nonsensical, rather than merely anomalous, existing datasets are; and (b) how well LLMs can make this distinction. In this paper, we answer both questions by collecting sensicality judgments from human raters and LLMs on sentences from five semantically deviant datasets: both context-free and when providing a context. We find that raters consider most sentences at most anomalous, and only a few as properly nonsensical. We also show that LLMs are substantially skilled in generating plausible contexts for anomalous cases.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_11699
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Finding Sense in Nonsense with Generated Contexts: Perspectives from Humans and Language Models Olsen, Katrina Padó, Sebastian Computation and Language Nonsensical and anomalous sentences have been instrumental in the development of computational models of semantic interpretation. A core challenge is to distinguish between what is merely anomalous (but can be interpreted given a supporting context) and what is truly nonsensical. However, it is unclear (a) how nonsensical, rather than merely anomalous, existing datasets are; and (b) how well LLMs can make this distinction. In this paper, we answer both questions by collecting sensicality judgments from human raters and LLMs on sentences from five semantically deviant datasets: both context-free and when providing a context. We find that raters consider most sentences at most anomalous, and only a few as properly nonsensical. We also show that LLMs are substantially skilled in generating plausible contexts for anomalous cases.
title	Finding Sense in Nonsense with Generated Contexts: Perspectives from Humans and Language Models
topic	Computation and Language
url	https://arxiv.org/abs/2602.11699

Similar Items