APA (7th ed.) Citation

Wang, H., Yu, X., Wu, J., Liu, J., Sun, X., Bansal, M., & Liu, Z. (2026). Stabilizing Efficient Reasoning with Step-Level Advantage Selection.

Chicago Style (17th ed.) Citation

Wang, Han, Xiaodong Yu, Jialian Wu, Jiang Liu, Ximeng Sun, Mohit Bansal, and Zicheng Liu. Stabilizing Efficient Reasoning with Step-Level Advantage Selection. 2026.

MLA (9th ed.) Citation

Wang, Han, et al. Stabilizing Efficient Reasoning with Step-Level Advantage Selection. 2026.

Warning: These citations may not always be 100% accurate.