Jia, C. (2025). Bootstrapping LLMs via Preference-Based Policy Optimization.
Chicago Style (17th ed.) CitationJia, Chen. Bootstrapping LLMs via Preference-Based Policy Optimization. 2025.
MLA (9th ed.) CitationJia, Chen. Bootstrapping LLMs via Preference-Based Policy Optimization. 2025.
Warning: These citations may not always be 100% accurate.