APA (7th ed.) Citation

Yu, K., Baek, B., & Lee, D. (2026). Learning Weakly Communicating Average-Reward CMDPs: Strong Duality and Improved Regret.

Chicago Style (17th ed.) Citation

Yu, Kihyun, Beomhan Baek, and Dabeen Lee. Learning Weakly Communicating Average-Reward CMDPs: Strong Duality and Improved Regret. 2026.

MLA (9th ed.) Citation

Yu, Kihyun, et al. Learning Weakly Communicating Average-Reward CMDPs: Strong Duality and Improved Regret. 2026.

Warning: These citations may not always be 100% accurate.