Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Morocho, Erika Elizabeth Taday, Cima, Lorenzo, Fagni, Tiziano, Avvenuti, Marco, Cresci, Stefano
Format:	Preprint
Published:	2026
Subjects:	Computers and Society Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.18462
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914341468504064
author	Morocho, Erika Elizabeth Taday Cima, Lorenzo Fagni, Tiziano Avvenuti, Marco Cresci, Stefano
author_facet	Morocho, Erika Elizabeth Taday Cima, Lorenzo Fagni, Tiziano Avvenuti, Marco Cresci, Stefano
contents	Using persona-conditioned LLMs as synthetic survey respondents has become a common practice in computational social science and agent-based simulations. Yet, it remains unclear whether multi-attribute persona prompting improves LLM reliability or instead introduces distortions. Here we contribute to this assessment by leveraging a large dataset of U.S. microdata from the World Values Survey. Concretely, we evaluate two open-weight chat models and a random-guesser baseline across more than 70K respondent-item instances. We find that persona prompting does not yield a clear aggregate improvement in survey alignment and, in many cases, significantly degrades performance. Persona effects are highly heterogeneous as most items exhibit minimal change, while a small subset of questions and underrepresented subgroups experience disproportionate distortions. Our findings highlight a key adverse impact of current persona-based simulation practices: demographic conditioning can redistribute error in ways that undermine subgroup fidelity and risk misleading downstream analyses.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_18462
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Assessing the Reliability of Persona-Conditioned LLMs as Synthetic Survey Respondents Morocho, Erika Elizabeth Taday Cima, Lorenzo Fagni, Tiziano Avvenuti, Marco Cresci, Stefano Computers and Society Artificial Intelligence Using persona-conditioned LLMs as synthetic survey respondents has become a common practice in computational social science and agent-based simulations. Yet, it remains unclear whether multi-attribute persona prompting improves LLM reliability or instead introduces distortions. Here we contribute to this assessment by leveraging a large dataset of U.S. microdata from the World Values Survey. Concretely, we evaluate two open-weight chat models and a random-guesser baseline across more than 70K respondent-item instances. We find that persona prompting does not yield a clear aggregate improvement in survey alignment and, in many cases, significantly degrades performance. Persona effects are highly heterogeneous as most items exhibit minimal change, while a small subset of questions and underrepresented subgroups experience disproportionate distortions. Our findings highlight a key adverse impact of current persona-based simulation practices: demographic conditioning can redistribute error in ways that undermine subgroup fidelity and risk misleading downstream analyses.
title	Assessing the Reliability of Persona-Conditioned LLMs as Synthetic Survey Respondents
topic	Computers and Society Artificial Intelligence
url	https://arxiv.org/abs/2602.18462

Similar Items