Zhang, Z., Song, K., Wang, X., Hu, Y., Yan, W., Zhao, C., . . . Wang, S. (2026). CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use.
Chicago Style (17th ed.) CitationZhang, Zhen, et al. CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use. 2026.
MLA (9th ed.) CitationZhang, Zhen, et al. CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use. 2026.
Warning: These citations may not always be 100% accurate.