Wang, H., Yu, X., Wu, J., Liu, J., Sun, X., Bansal, M., & Liu, Z. (2026). Stabilizing Efficient Reasoning with Step-Level Advantage Selection.
Chicago Style (17th ed.) CitationWang, Han, Xiaodong Yu, Jialian Wu, Jiang Liu, Ximeng Sun, Mohit Bansal, and Zicheng Liu. Stabilizing Efficient Reasoning with Step-Level Advantage Selection. 2026.
MLA (9th ed.) CitationWang, Han, et al. Stabilizing Efficient Reasoning with Step-Level Advantage Selection. 2026.
Warning: These citations may not always be 100% accurate.