Cheng, P., Yang, Y., Li, J., Dai, Y., Hu, T., Cao, P., . . . Li, X. (2023). Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game.
Chicago Style (17th ed.) CitationCheng, Pengyu, Yifan Yang, Jian Li, Yong Dai, Tianhao Hu, Peixin Cao, Nan Du, and Xiaolong Li. Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game. 2023.
MLA (9th ed.) CitationCheng, Pengyu, et al. Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game. 2023.
Warning: These citations may not always be 100% accurate.