APA (7th ed.) Citation

Zhao, Z., Ma, X., Yang, L., Feng, Y., Shi, D., He, J., . . . Wu, X. (2026). ROSD: Reflective On-Policy Self-Distillation for Language Model Reasoning across Domains.

Chicago Style (17th ed.) Citation

Zhao, Ziqi, Xinyu Ma, Liu Yang, Yujie Feng, Daiting Shi, Jingzhou He, Xin Xin, Zhaochun Ren, and Xiao-Ming Wu. ROSD: Reflective On-Policy Self-Distillation for Language Model Reasoning Across Domains. 2026.

MLA (9th ed.) Citation

Zhao, Ziqi, et al. ROSD: Reflective On-Policy Self-Distillation for Language Model Reasoning Across Domains. 2026.

Warning: These citations may not always be 100% accurate.