Han, P., Krishnan, A., Friedland, G., You, J., & Kong, C. (2025). Self-Aligned Reward: Towards Effective and Efficient Reasoners.
Chicago Style (17th ed.) CitationHan, Peixuan, Adit Krishnan, Gerald Friedland, Jiaxuan You, and Chris Kong. Self-Aligned Reward: Towards Effective and Efficient Reasoners. 2025.
MLA (9th ed.) CitationHan, Peixuan, et al. Self-Aligned Reward: Towards Effective and Efficient Reasoners. 2025.
Warning: These citations may not always be 100% accurate.