Saved in:
Bibliographic Details
Main Authors: Hamed, Aya, Marden, Jason R., Shamma, Jeff S.
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.17848
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910238587748352
author Hamed, Aya
Marden, Jason R.
Shamma, Jeff S.
author_facet Hamed, Aya
Marden, Jason R.
Shamma, Jeff S.
contents Strategic multi-agent systems are fundamentally characterized by decentralization, uncertainty, and ambiguity. Agents operating under limited observations will often need to make decisions based on simplified internal models of the environment, reflecting bounded rationality in both computational capacity and environmental knowledge. The Empirical Evidence Equilibrium (EEE) framework explicitly accounts for these limitations by modeling each agent as forming a potentially misspecified belief derived from signals obtained through partial observations of the environment. The resulting equilibrium concept captures the system's steady state under bounded rationality and decentralization. In this work, we study games in which the environment dynamics are driven jointly by exogenous factors and agents' actions. We analyze agent behavior under Q-value iteration where each agent independently forms a belief model, computes Q-values, and derives a greedy strategy, yet the collective actions of all agents jointly shape the environment each agent faces at the next stage. We prove that despite this decentralization, an EEE emerges from the joint dynamics when the coupling between agents' actions and the environment is sufficiently weak. We further extend this result to softmax policies, establishing a contraction result under a sufficient coupling condition.
format Preprint
id arxiv_https___arxiv_org_abs_2605_17848
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Learning Empirical Evidence Equilibria under Weak Environmental Coupling
Hamed, Aya
Marden, Jason R.
Shamma, Jeff S.
Computer Science and Game Theory
Strategic multi-agent systems are fundamentally characterized by decentralization, uncertainty, and ambiguity. Agents operating under limited observations will often need to make decisions based on simplified internal models of the environment, reflecting bounded rationality in both computational capacity and environmental knowledge. The Empirical Evidence Equilibrium (EEE) framework explicitly accounts for these limitations by modeling each agent as forming a potentially misspecified belief derived from signals obtained through partial observations of the environment. The resulting equilibrium concept captures the system's steady state under bounded rationality and decentralization. In this work, we study games in which the environment dynamics are driven jointly by exogenous factors and agents' actions. We analyze agent behavior under Q-value iteration where each agent independently forms a belief model, computes Q-values, and derives a greedy strategy, yet the collective actions of all agents jointly shape the environment each agent faces at the next stage. We prove that despite this decentralization, an EEE emerges from the joint dynamics when the coupling between agents' actions and the environment is sufficiently weak. We further extend this result to softmax policies, establishing a contraction result under a sufficient coupling condition.
title Learning Empirical Evidence Equilibria under Weak Environmental Coupling
topic Computer Science and Game Theory
url https://arxiv.org/abs/2605.17848