Ding, P., Sun, W., Li, D., Zou, W., Wang, J., Chen, J., & Huang, S. (2025). SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models.
Chicago Style (17th ed.) CitationDing, Peng, Wen Sun, Dailin Li, Wei Zou, Jiaming Wang, Jiajun Chen, and Shujian Huang. SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models. 2025.
MLA (9th ed.) CitationDing, Peng, et al. SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models. 2025.
Warning: These citations may not always be 100% accurate.