Zheng, Z., Zhang, H., & Xue, L. (2024). Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost.
Chicago Style (17th ed.) CitationZheng, Zhong, Haochen Zhang, and Lingzhou Xue. Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost. 2024.
MLA (9th ed.) CitationZheng, Zhong, et al. Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost. 2024.
Warning: These citations may not always be 100% accurate.