Saved in:
Bibliographic Details
Main Authors: Yu, Ryan, Nowak, Mateusz, Xie, Qintong, Feng, Michelle Yilin, Chin, Peter
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.02016
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Current approximate Coarse Correlated Equilibria (CCE) algorithms struggle with equilibrium approximation for games in large stochastic environments but are theoretically guaranteed to converge to a strong solution concept. In contrast, modern Reinforcement Learning (RL) algorithms provide faster training yet yield weaker solutions. We introduce Exp3-IXrl - a blend of RL and game-theoretic approach, separating the RL agent's action selection from the equilibrium computation while preserving the integrity of the learning process. We demonstrate that our algorithm expands the application of equilibrium approximation algorithms to new environments. Specifically, we show the improved performance in a complex and adversarial cybersecurity network environment - the Cyber Operations Research Gym - and in the classical multi-armed bandit settings.