APA (7th ed.) Citation

Sullivan, M., & Koller, A. (2025). GRPO is Secretly a Process Reward Model.

Chicago Style (17th ed.) Citation

Sullivan, Michael, and Alexander Koller. GRPO Is Secretly a Process Reward Model. 2025.

MLA (9th ed.) Citation

Sullivan, Michael, and Alexander Koller. GRPO Is Secretly a Process Reward Model. 2025.

Warning: These citations may not always be 100% accurate.