Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Calvano, Emilio, Possnig, Clemens, Tolvanen, Juha
Format:	Preprint
Published:	2026
Subjects:	Theoretical Economics
Online Access:	https://arxiv.org/abs/2602.12035
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914326015639552
author	Calvano, Emilio Possnig, Clemens Tolvanen, Juha
author_facet	Calvano, Emilio Possnig, Clemens Tolvanen, Juha
contents	We analyze strategic communication when advice is generated by a reinforcement-learning algorithm rather than by a fully rational sender. Building on the cheap-talk framework of Crawford and Sobel (1982), an advisor adapts its messages based on payoff feedback, while a decision maker best-responds. We provide a theoretical analysis of the long-run communication outcomes induced by such reward-driven adaptation. With aligned preferences, we establish that learning robustly leads to informative communication even from uninformative initial policies. With misaligned preferences, no stable outcome exists; instead, learning generates cycles that sustain highly informative communication and payoffs exceeding those of any static equilibrium.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_12035
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	The Algorithmic Advantage: How Reinforcement Learning Generates Rich Communication Calvano, Emilio Possnig, Clemens Tolvanen, Juha Theoretical Economics We analyze strategic communication when advice is generated by a reinforcement-learning algorithm rather than by a fully rational sender. Building on the cheap-talk framework of Crawford and Sobel (1982), an advisor adapts its messages based on payoff feedback, while a decision maker best-responds. We provide a theoretical analysis of the long-run communication outcomes induced by such reward-driven adaptation. With aligned preferences, we establish that learning robustly leads to informative communication even from uninformative initial policies. With misaligned preferences, no stable outcome exists; instead, learning generates cycles that sustain highly informative communication and payoffs exceeding those of any static equilibrium.
title	The Algorithmic Advantage: How Reinforcement Learning Generates Rich Communication
topic	Theoretical Economics
url	https://arxiv.org/abs/2602.12035

Similar Items