Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Fang, Shaohua, Li, Yue, Cong, Yan
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2509.10860
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909786287636480
author	Fang, Shaohua Li, Yue Cong, Yan
author_facet	Fang, Shaohua Li, Yue Cong, Yan
contents	Sentences with multiple quantifiers often lead to interpretive ambiguities, which can vary across languages. This study adopts a cross-linguistic approach to examine how large language models (LLMs) handle quantifier scope interpretation in English and Chinese, using probabilities to assess interpretive likelihood. Human similarity (HS) scores were used to quantify the extent to which LLMs emulate human performance across language groups. Results reveal that most LLMs prefer the surface scope interpretations, aligning with human tendencies, while only some differentiate between English and Chinese in the inverse scope preferences, reflecting human-similar patterns. HS scores highlight variability in LLMs' approximation of human behavior, but their overall potential to align with humans is notable. Differences in model architecture, scale, and particularly models' pre-training data language background, significantly influence how closely LLMs approximate human quantifier scope interpretations.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_10860
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Quantifier Scope Interpretation in Language Learners and LLMs Fang, Shaohua Li, Yue Cong, Yan Computation and Language Sentences with multiple quantifiers often lead to interpretive ambiguities, which can vary across languages. This study adopts a cross-linguistic approach to examine how large language models (LLMs) handle quantifier scope interpretation in English and Chinese, using probabilities to assess interpretive likelihood. Human similarity (HS) scores were used to quantify the extent to which LLMs emulate human performance across language groups. Results reveal that most LLMs prefer the surface scope interpretations, aligning with human tendencies, while only some differentiate between English and Chinese in the inverse scope preferences, reflecting human-similar patterns. HS scores highlight variability in LLMs' approximation of human behavior, but their overall potential to align with humans is notable. Differences in model architecture, scale, and particularly models' pre-training data language background, significantly influence how closely LLMs approximate human quantifier scope interpretations.
title	Quantifier Scope Interpretation in Language Learners and LLMs
topic	Computation and Language
url	https://arxiv.org/abs/2509.10860

Similar Items