APA (7th ed.) Citation

Han, P., Krishnan, A., Friedland, G., You, J., & Kong, C. (2025). Self-Aligned Reward: Towards Effective and Efficient Reasoners.

Chicago Style (17th ed.) Citation

Han, Peixuan, Adit Krishnan, Gerald Friedland, Jiaxuan You, and Chris Kong. Self-Aligned Reward: Towards Effective and Efficient Reasoners. 2025.

MLA (9th ed.) Citation

Han, Peixuan, et al. Self-Aligned Reward: Towards Effective and Efficient Reasoners. 2025.

Warning: These citations may not always be 100% accurate.