Wan, Q., Xu, Z., Wei, L., Shen, X., & Sun, J. (2026). Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning.
Chicago Style (17th ed.) CitationWan, Qian, Ziao Xu, Luona Wei, Xiaoxuan Shen, and Jianwen Sun. Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning. 2026.
MLA (9th ed.) CitationWan, Qian, et al. Mitigating Overthinking in Large Reasoning Models via Difficulty-aware Reinforcement Learning. 2026.
Warning: These citations may not always be 100% accurate.