Hu, T., Fu, Q., Chen, Y., Liu, Z., & Ding, B. (2026). SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees.
Chicago Style (17th ed.) CitationHu, Tianyi, Qingxu Fu, Yanxi Chen, Zhaoyang Liu, and Bolin Ding. SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees. 2026.
MLA (9th ed.) CitationHu, Tianyi, et al. SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees. 2026.
Warning: These citations may not always be 100% accurate.