Liang, Q., Zhu, Y., Ge, C., Yang, L., Shen, Y., Zheng, B., & Guo, S. (2026). Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning.
Chicago Style (17th ed.) CitationLiang, Qiao, Yuke Zhu, Chao Ge, Lei Yang, Ying Shen, Bo Zheng, and Sheng Guo. Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning. 2026.
MLA (9th ed.) CitationLiang, Qiao, et al. Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning. 2026.
Warning: These citations may not always be 100% accurate.