Li, Y., Nijkamp, E., Yavuz, S., & Joty, S. (2026). Learning from Language Feedback via Variational Policy Distillation.
Chicago Style (17th ed.) CitationLi, Yang, Erik Nijkamp, Semih Yavuz, and Shafiq Joty. Learning from Language Feedback via Variational Policy Distillation. 2026.
MLA (9th ed.) CitationLi, Yang, et al. Learning from Language Feedback via Variational Policy Distillation. 2026.
Warning: These citations may not always be 100% accurate.