Lauffer, N., Deng, X., Kundurthy, S., Kenstler, B., & Da, J. (2025). Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections.
Chicago Style (17th ed.) CitationLauffer, Niklas, Xiang Deng, Srivatsa Kundurthy, Brad Kenstler, and Jeff Da. Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections. 2025.
MLA (9th ed.) CitationLauffer, Niklas, et al. Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections. 2025.
Warning: These citations may not always be 100% accurate.