Enregistré dans:
Détails bibliographiques
Auteurs principaux: Demir, Alper, Aydın, Hüseyin, Tessera, Kale-ab Abebe, Abel, David, Albrecht, Stefano V.
Format: Preprint
Publié: 2026
Sujets:
Accès en ligne:https://arxiv.org/abs/2602.15407
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866912909605470208
author Demir, Alper
Aydın, Hüseyin
Tessera, Kale-ab Abebe
Abel, David
Albrecht, Stefano V.
author_facet Demir, Alper
Aydın, Hüseyin
Tessera, Kale-ab Abebe
Abel, David
Albrecht, Stefano V.
contents Sequential Social Dilemmas (SSDs) provide a key framework for studying how cooperation emerges when individual incentives conflict with collective welfare. In Multi-Agent Reinforcement Learning, these problems are often addressed by incorporating intrinsic drives that encourage prosocial or fair behavior. However, most existing methods assume that agents face identical incentives in the dilemma and require continuous access to global information about other agents to assess fairness. In this work, we introduce asymmetric variants of well-known SSD environments and examine how natural differences between agents influence cooperation dynamics. Our findings reveal that existing fairness-based methods struggle to adapt under asymmetric conditions by enforcing raw equality that wrongfully incentivize defection. To address this, we propose three modifications: (i) redefining fairness by accounting for agents' reward ranges, (ii) introducing an agent-based weighting mechanism to better handle inherent asymmetries, and (iii) localizing social feedback to make the methods effective under partial observability without requiring global information sharing. Experimental results show that in asymmetric scenarios, our method fosters faster emergence of cooperative policies compared to existing approaches, without sacrificing scalability or practicality.
format Preprint
id arxiv_https___arxiv_org_abs_2602_15407
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social Dilemmas
Demir, Alper
Aydın, Hüseyin
Tessera, Kale-ab Abebe
Abel, David
Albrecht, Stefano V.
Machine Learning
Sequential Social Dilemmas (SSDs) provide a key framework for studying how cooperation emerges when individual incentives conflict with collective welfare. In Multi-Agent Reinforcement Learning, these problems are often addressed by incorporating intrinsic drives that encourage prosocial or fair behavior. However, most existing methods assume that agents face identical incentives in the dilemma and require continuous access to global information about other agents to assess fairness. In this work, we introduce asymmetric variants of well-known SSD environments and examine how natural differences between agents influence cooperation dynamics. Our findings reveal that existing fairness-based methods struggle to adapt under asymmetric conditions by enforcing raw equality that wrongfully incentivize defection. To address this, we propose three modifications: (i) redefining fairness by accounting for agents' reward ranges, (ii) introducing an agent-based weighting mechanism to better handle inherent asymmetries, and (iii) localizing social feedback to make the methods effective under partial observability without requiring global information sharing. Experimental results show that in asymmetric scenarios, our method fosters faster emergence of cooperative policies compared to existing approaches, without sacrificing scalability or practicality.
title Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social Dilemmas
topic Machine Learning
url https://arxiv.org/abs/2602.15407