Liu, K., Liu, J. K., Chen, M., & Liu, Y. (2025). Rethinking KL Regularization in RLHF: From Value Estimation to Gradient Optimization.
Chicago Style (17th ed.) CitationLiu, Kezhao, Jason Klein Liu, Mingtao Chen, and Yiming Liu. Rethinking KL Regularization in RLHF: From Value Estimation to Gradient Optimization. 2025.
MLA (9th ed.) CitationLiu, Kezhao, et al. Rethinking KL Regularization in RLHF: From Value Estimation to Gradient Optimization. 2025.
Warning: These citations may not always be 100% accurate.