Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yang, Zhaoqilin, Li, Chanchan, Liu, Tianqi, Zhao, Hongxin, Tian, Youliang
Format:	Preprint
Published:	2025
Subjects:	Multiagent Systems Computer Science and Game Theory
Online Access:	https://arxiv.org/abs/2510.08607
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909833719971840
author	Yang, Zhaoqilin Li, Chanchan Liu, Tianqi Zhao, Hongxin Tian, Youliang
author_facet	Yang, Zhaoqilin Li, Chanchan Liu, Tianqi Zhao, Hongxin Tian, Youliang
contents	Inspired by the principle of self-regulating cooperation in collective institutions, we propose the Group Relative Policy Optimization with Global Cooperation Constraint (GRPO-GCC) framework. This work is the first to introduce GRPO into spatial public goods games, establishing a new deep reinforcement learning baseline for structured populations. GRPO-GCC integrates group relative policy optimization with a global cooperation constraint that strengthens incentives at intermediate cooperation levels while weakening them at extremes. This mechanism aligns local decision making with sustainable collective outcomes and prevents collapse into either universal defection or unconditional cooperation. The framework advances beyond existing approaches by combining group-normalized advantage estimation, a reference-anchored KL penalty, and a global incentive term that dynamically adjusts cooperative payoffs. As a result, it achieves accelerated cooperation onset, stabilized policy adaptation, and long-term sustainability. GRPO-GCC demonstrates how a simple yet global signal can reshape incentives toward resilient cooperation, and provides a new paradigm for multi-agent reinforcement learning in socio-technical systems.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_08607
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	GRPO-GCC: Enhancing Cooperation in Spatial Public Goods Games via Group Relative Policy Optimization with Global Cooperation Constraint Yang, Zhaoqilin Li, Chanchan Liu, Tianqi Zhao, Hongxin Tian, Youliang Multiagent Systems Computer Science and Game Theory Inspired by the principle of self-regulating cooperation in collective institutions, we propose the Group Relative Policy Optimization with Global Cooperation Constraint (GRPO-GCC) framework. This work is the first to introduce GRPO into spatial public goods games, establishing a new deep reinforcement learning baseline for structured populations. GRPO-GCC integrates group relative policy optimization with a global cooperation constraint that strengthens incentives at intermediate cooperation levels while weakening them at extremes. This mechanism aligns local decision making with sustainable collective outcomes and prevents collapse into either universal defection or unconditional cooperation. The framework advances beyond existing approaches by combining group-normalized advantage estimation, a reference-anchored KL penalty, and a global incentive term that dynamically adjusts cooperative payoffs. As a result, it achieves accelerated cooperation onset, stabilized policy adaptation, and long-term sustainability. GRPO-GCC demonstrates how a simple yet global signal can reshape incentives toward resilient cooperation, and provides a new paradigm for multi-agent reinforcement learning in socio-technical systems.
title	GRPO-GCC: Enhancing Cooperation in Spatial Public Goods Games via Group Relative Policy Optimization with Global Cooperation Constraint
topic	Multiagent Systems Computer Science and Game Theory
url	https://arxiv.org/abs/2510.08607

Similar Items