Kim, Y. (2026). MC-GRPO: Median-Centered Group Relative Policy Optimization for Small-Rollout Reinforcement Learning.
Chicago Style (17th ed.) CitationKim, Youngeun. MC-GRPO: Median-Centered Group Relative Policy Optimization for Small-Rollout Reinforcement Learning. 2026.
MLA (9th ed.) CitationKim, Youngeun. MC-GRPO: Median-Centered Group Relative Policy Optimization for Small-Rollout Reinforcement Learning. 2026.
Warning: These citations may not always be 100% accurate.