Saved in:
Bibliographic Details
Main Authors: Fang, Shaohua, Li, Yue, Cong, Yan
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.10860
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909786287636480
author Fang, Shaohua
Li, Yue
Cong, Yan
author_facet Fang, Shaohua
Li, Yue
Cong, Yan
contents Sentences with multiple quantifiers often lead to interpretive ambiguities, which can vary across languages. This study adopts a cross-linguistic approach to examine how large language models (LLMs) handle quantifier scope interpretation in English and Chinese, using probabilities to assess interpretive likelihood. Human similarity (HS) scores were used to quantify the extent to which LLMs emulate human performance across language groups. Results reveal that most LLMs prefer the surface scope interpretations, aligning with human tendencies, while only some differentiate between English and Chinese in the inverse scope preferences, reflecting human-similar patterns. HS scores highlight variability in LLMs' approximation of human behavior, but their overall potential to align with humans is notable. Differences in model architecture, scale, and particularly models' pre-training data language background, significantly influence how closely LLMs approximate human quantifier scope interpretations.
format Preprint
id arxiv_https___arxiv_org_abs_2509_10860
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Quantifier Scope Interpretation in Language Learners and LLMs
Fang, Shaohua
Li, Yue
Cong, Yan
Computation and Language
Sentences with multiple quantifiers often lead to interpretive ambiguities, which can vary across languages. This study adopts a cross-linguistic approach to examine how large language models (LLMs) handle quantifier scope interpretation in English and Chinese, using probabilities to assess interpretive likelihood. Human similarity (HS) scores were used to quantify the extent to which LLMs emulate human performance across language groups. Results reveal that most LLMs prefer the surface scope interpretations, aligning with human tendencies, while only some differentiate between English and Chinese in the inverse scope preferences, reflecting human-similar patterns. HS scores highlight variability in LLMs' approximation of human behavior, but their overall potential to align with humans is notable. Differences in model architecture, scale, and particularly models' pre-training data language background, significantly influence how closely LLMs approximate human quantifier scope interpretations.
title Quantifier Scope Interpretation in Language Learners and LLMs
topic Computation and Language
url https://arxiv.org/abs/2509.10860