Ito, S., Luo, H., Maiti, A., Tsuchiya, T., & Wu, Y. (2026). Adversarial Learning in Games with Bandit Feedback: Logarithmic Pure-Strategy Maximin Regret.
Citación estilo ChicagoIto, Shinji, Haipeng Luo, Arnab Maiti, Taira Tsuchiya, and Yue Wu. Adversarial Learning in Games with Bandit Feedback: Logarithmic Pure-Strategy Maximin Regret. 2026.
Cita MLAIto, Shinji, et al. Adversarial Learning in Games with Bandit Feedback: Logarithmic Pure-Strategy Maximin Regret. 2026.
Warning: These citations may not always be 100% accurate.