Xin, R., Liu, H., Wang, Z., Zhang, Y., Sui, D., Hu, X., & Wang, B. (2025). Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers.
Chicago Style (17th ed.) CitationXin, Rihui, Han Liu, Zecheng Wang, Yupeng Zhang, Dianbo Sui, Xiaolin Hu, and Bingning Wang. Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems Without Ground Truth Answers. 2025.
MLA (9th ed.) CitationXin, Rihui, et al. Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems Without Ground Truth Answers. 2025.
Warning: These citations may not always be 100% accurate.