Zhao, S., Xu, Y., Zhu, L., & Yang, Y. (2025). Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data.
Chicago Style (17th ed.) CitationZhao, Shuai, Yunqiu Xu, Linchao Zhu, and Yi Yang. Learning from Reference Answers: Versatile Language Model Alignment Without Binary Human Preference Data. 2025.
MLA (9th ed.) CitationZhao, Shuai, et al. Learning from Reference Answers: Versatile Language Model Alignment Without Binary Human Preference Data. 2025.
Warning: These citations may not always be 100% accurate.