Xie, Z., Chen, J., Chen, L., Mao, W., Xu, J., & Kong, L. (2025). Teaching Language Models to Critique via Reinforcement Learning.
Chicago Style (17th ed.) CitationXie, Zhihui, Jie Chen, Liyu Chen, Weichao Mao, Jingjing Xu, and Lingpeng Kong. Teaching Language Models to Critique via Reinforcement Learning. 2025.
MLA (9th ed.) CitationXie, Zhihui, et al. Teaching Language Models to Critique via Reinforcement Learning. 2025.
Warning: These citations may not always be 100% accurate.