APA (7th ed.) Citation

Tang, W., & Zhou, X. Y. (2024). Regret of exploratory policy improvement and $q$-learning.

Chicago Style (17th ed.) Citation

Tang, Wenpin, and Xun Yu Zhou. Regret of Exploratory Policy Improvement and $q$-learning. 2024.

MLA (9th ed.) Citation

Tang, Wenpin, and Xun Yu Zhou. Regret of Exploratory Policy Improvement and $q$-learning. 2024.

Warning: These citations may not always be 100% accurate.