Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Shoresh, David, Kraus, Sarit, Loewenstein, Yonatan
Format:	Preprint
Published:	2026
Subjects:	Computers and Society
Online Access:	https://arxiv.org/abs/2604.08727
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913020814295040
author	Shoresh, David Kraus, Sarit Loewenstein, Yonatan
author_facet	Shoresh, David Kraus, Sarit Loewenstein, Yonatan
contents	As large language model (LLM) agents become more prevalent in real world social settings, social intelligence will play an increasingly critical role. But social intelligence is still a poorly defined construct, for humans and artificial agents. We introduce a multiplayer arena of mixed cooperative and competitive social games to study LLM social intelligence. The controllability of LLM based agents enables systematic evaluation, which also supports broader inferences about social intelligence per se. We evaluated eight diverse LLMs (24B to 1T parameters) using a Communicate Predict Act (COMPACT) interaction protocol and fine grained probing of social dynamics. Elo style ratings reveal consistent performance differences across models, but this scalar measure provides only a partial characterization of social intelligence. To address this limitation, we analyze gameplay traces to extract sociocognitive metrics capturing action prediction, communicative influence, strategic reasoning, and tradeoffs under conflicting interests. These sociocognitive metrics exhibit strong intramodel consistency and they reliably predict pairwise agent advantage in game outcomes (AUC ROC = 0.82). Feature importance analysis indicates that surprisingly, influence, transparency, and adaptability are more predictive of success than Theory of Mind inference or deep planning. Together, our results advance a testable, multidimensional conception of social intelligence and provide empirical insights into the capacities that underpin it.
format	Preprint
id	arxiv_https___arxiv_org_abs_2604_08727
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Communicate-Predict-Act: Evaluating Social Intelligence of Agents Shoresh, David Kraus, Sarit Loewenstein, Yonatan Computers and Society As large language model (LLM) agents become more prevalent in real world social settings, social intelligence will play an increasingly critical role. But social intelligence is still a poorly defined construct, for humans and artificial agents. We introduce a multiplayer arena of mixed cooperative and competitive social games to study LLM social intelligence. The controllability of LLM based agents enables systematic evaluation, which also supports broader inferences about social intelligence per se. We evaluated eight diverse LLMs (24B to 1T parameters) using a Communicate Predict Act (COMPACT) interaction protocol and fine grained probing of social dynamics. Elo style ratings reveal consistent performance differences across models, but this scalar measure provides only a partial characterization of social intelligence. To address this limitation, we analyze gameplay traces to extract sociocognitive metrics capturing action prediction, communicative influence, strategic reasoning, and tradeoffs under conflicting interests. These sociocognitive metrics exhibit strong intramodel consistency and they reliably predict pairwise agent advantage in game outcomes (AUC ROC = 0.82). Feature importance analysis indicates that surprisingly, influence, transparency, and adaptability are more predictive of success than Theory of Mind inference or deep planning. Together, our results advance a testable, multidimensional conception of social intelligence and provide empirical insights into the capacities that underpin it.
title	Communicate-Predict-Act: Evaluating Social Intelligence of Agents
topic	Computers and Society
url	https://arxiv.org/abs/2604.08727

Similar Items