Liu, Y., Li, S., Cao, L., Xie, Y., Zhou, M., Dong, H., . . . Zhang, D. (2025). SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning.
Chicago Style (17th ed.) CitationLiu, Yihao, Shuocheng Li, Lang Cao, Yuhang Xie, Mengyu Zhou, Haoyu Dong, Xiaojun Ma, Shi Han, and Dongmei Zhang. SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning. 2025.
MLA (9th ed.) CitationLiu, Yihao, et al. SuperRL: Reinforcement Learning with Supervision to Boost Language Model Reasoning. 2025.
Warning: These citations may not always be 100% accurate.