Xu, T., Chen, Y., & Li, M. (2026). CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning.
Chicago Style (17th ed.) CitationXu, Tianshi, Yuteng Chen, and Meng Li. CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning. 2026.
MLA (9th ed.) CitationXu, Tianshi, et al. CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning. 2026.
Warning: These citations may not always be 100% accurate.