Wang, H., Cui, H., Zhang, C., Liu, X., Jin, S., Geng, S., . . . Sun, Y. (2026). T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning.
Chicago Style (17th ed.) CitationWang, Haixin, et al. T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning. 2026.
MLA (9th ed.) CitationWang, Haixin, et al. T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning. 2026.
Warning: These citations may not always be 100% accurate.