Chen, Z., Chen, Z., Si, N., & Wang, S. (2026). Achieving $\varepsilon^{-2}$ Dependence for Average-Reward Q-Learning with a New Contraction Principle.
Chicago Style (17th ed.) CitationChen, Zijun, Zaiwei Chen, Nian Si, and Shengbo Wang. Achieving $\varepsilon^{-2}$ Dependence for Average-Reward Q-Learning with a New Contraction Principle. 2026.
MLA (9th ed.) CitationChen, Zijun, et al. Achieving $\varepsilon^{-2}$ Dependence for Average-Reward Q-Learning with a New Contraction Principle. 2026.
Warning: These citations may not always be 100% accurate.