Kikkawa, N., & Ohno, H. (2024). Unified theory of upper confidence bound policies for bandit problems targeting total reward, maximal reward, and more.
Citazione stile Chigago Style (17a edizione)Kikkawa, Nobuaki, e Hiroshi Ohno. Unified Theory of Upper Confidence Bound Policies for Bandit Problems Targeting Total Reward, Maximal Reward, and More. 2024.
Citatione MLA (9a ed.)Kikkawa, Nobuaki, e Hiroshi Ohno. Unified Theory of Upper Confidence Bound Policies for Bandit Problems Targeting Total Reward, Maximal Reward, and More. 2024.
Attenzione: Queste citazioni potrebbero non essere precise al 100%.