Saved in:
Bibliographic Details
Main Authors: Long, Olivia, Teplica, Carter
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.18467
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914005461762048
author Long, Olivia
Teplica, Carter
author_facet Long, Olivia
Teplica, Carter
contents As AI agents become increasingly capable of tool use and long-horizon tasks, they have begun to be deployed in settings where multiple agents can interact. However, whereas prior work has mostly focused on human-AI interactions, there is an increasing need to understand AI-AI interactions. In this paper, we adapt the iterated public goods game, a classic behavioral economics game, to analyze the behavior of four reasoning and non-reasoning models across two conditions: models are either told they are playing against "another AI agent" or told their opponents are themselves. We find that, across different settings, telling LLMs that they are playing against themselves significantly changes their tendency to cooperate. While our study is conducted in a toy environment, our results may provide insights into multi-agent settings where agents "unconsciously" discriminating against each other could inexplicably increase or decrease cooperation.
format Preprint
id arxiv_https___arxiv_org_abs_2508_18467
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle The AI in the Mirror: LLM Self-Recognition in an Iterated Public Goods Game
Long, Olivia
Teplica, Carter
Artificial Intelligence
As AI agents become increasingly capable of tool use and long-horizon tasks, they have begun to be deployed in settings where multiple agents can interact. However, whereas prior work has mostly focused on human-AI interactions, there is an increasing need to understand AI-AI interactions. In this paper, we adapt the iterated public goods game, a classic behavioral economics game, to analyze the behavior of four reasoning and non-reasoning models across two conditions: models are either told they are playing against "another AI agent" or told their opponents are themselves. We find that, across different settings, telling LLMs that they are playing against themselves significantly changes their tendency to cooperate. While our study is conducted in a toy environment, our results may provide insights into multi-agent settings where agents "unconsciously" discriminating against each other could inexplicably increase or decrease cooperation.
title The AI in the Mirror: LLM Self-Recognition in an Iterated Public Goods Game
topic Artificial Intelligence
url https://arxiv.org/abs/2508.18467