Shen, Y., Tu, L., & Wang, W. (2026). Reinforcement Learning-based Knowledge Distillation with LLM-as-a-Judge.
Chicago Style (17th ed.) CitationShen, Yiyang, Lifu Tu, and Weiran Wang. Reinforcement Learning-based Knowledge Distillation with LLM-as-a-Judge. 2026.
MLA (9th ed.) CitationShen, Yiyang, et al. Reinforcement Learning-based Knowledge Distillation with LLM-as-a-Judge. 2026.
Warning: These citations may not always be 100% accurate.