Liu, Y. (2025). PEO: Improving Bi-Factorial Preference Alignment with Post-Training Policy Extrapolation.
Chicago Style (17th ed.) CitationLiu, Yuxuan. PEO: Improving Bi-Factorial Preference Alignment with Post-Training Policy Extrapolation. 2025.
MLA (9th ed.) CitationLiu, Yuxuan. PEO: Improving Bi-Factorial Preference Alignment with Post-Training Policy Extrapolation. 2025.
Warning: These citations may not always be 100% accurate.