APA (7th ed.) Citation

Liang, Z., Zhou, Y., Lu, S., Zhang, X., Mi, H., & Yu, D. (2026). Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data.

Chicago Style (17th ed.) Citation

Liang, Zhenwen, Yujun Zhou, Sidi Lu, Xiangliang Zhang, Haitao Mi, and Dong Yu. Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data. 2026.

MLA (9th ed.) Citation

Liang, Zhenwen, et al. Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data. 2026.

Warning: These citations may not always be 100% accurate.