Chen, R., Liang, J., Gao, S., Wan, F., & Quan, X. (2024). Self-Evolution Fine-Tuning for Policy Optimization.
Chicago Style (17th ed.) CitationChen, Ruijun, Jiehao Liang, Shiping Gao, Fanqi Wan, and Xiaojun Quan. Self-Evolution Fine-Tuning for Policy Optimization. 2024.
MLA (9th ed.) CitationChen, Ruijun, et al. Self-Evolution Fine-Tuning for Policy Optimization. 2024.
Warning: These citations may not always be 100% accurate.