Tang, W., & Zhou, X. Y. (2024). Regret of exploratory policy improvement and $q$-learning.
Chicago Style (17th ed.) CitationTang, Wenpin, and Xun Yu Zhou. Regret of Exploratory Policy Improvement and $q$-learning. 2024.
MLA (9th ed.) CitationTang, Wenpin, and Xun Yu Zhou. Regret of Exploratory Policy Improvement and $q$-learning. 2024.
Warning: These citations may not always be 100% accurate.