He, H., Rong, Z., Ji, K., Li, C., Huang, Q., Xia, C., . . . Zhang, H. (2025). Rethinking Reasoning Quality in Large Language Models through Enhanced Chain-of-Thought via RL.
Chicago Style (17th ed.) CitationHe, Haoyang, Zihua Rong, Kun Ji, Chenyang Li, Qing Huang, Chong Xia, Lan Yang, and Honggang Zhang. Rethinking Reasoning Quality in Large Language Models Through Enhanced Chain-of-Thought via RL. 2025.
MLA (9th ed.) CitationHe, Haoyang, et al. Rethinking Reasoning Quality in Large Language Models Through Enhanced Chain-of-Thought via RL. 2025.
Warning: These citations may not always be 100% accurate.