APA (7th ed.) Citation

Tan, Z., & Hong, Y. (2026). Self-Supervised On-Policy Distillation for Reasoning Language Models.

Chicago Style (17th ed.) Citation

Tan, Zhiquan, and Yinrong Hong. Self-Supervised On-Policy Distillation for Reasoning Language Models. 2026.

MLA (9th ed.) Citation

Tan, Zhiquan, and Yinrong Hong. Self-Supervised On-Policy Distillation for Reasoning Language Models. 2026.

Warning: These citations may not always be 100% accurate.