Xie, X., Li, T., & Zhu, Q. (2024). Learning from Response not Preference: A Stackelberg Approach for LLM Detoxification using Non-parallel Data.
Chicago Style (17th ed.) CitationXie, Xinhong, Tao Li, and Quanyan Zhu. Learning from Response Not Preference: A Stackelberg Approach for LLM Detoxification Using Non-parallel Data. 2024.
MLA (9th ed.) CitationXie, Xinhong, et al. Learning from Response Not Preference: A Stackelberg Approach for LLM Detoxification Using Non-parallel Data. 2024.
Warning: These citations may not always be 100% accurate.