Lu, H., Zhu, M., & Yu, H. (2025). Hard Negative Sample-Augmented DPO Post-Training for Small Language Models.
Chicago Style (17th ed.) CitationLu, Haocheng, Minjun Zhu, and Henry Yu. Hard Negative Sample-Augmented DPO Post-Training for Small Language Models. 2025.
MLA (9th ed.) CitationLu, Haocheng, et al. Hard Negative Sample-Augmented DPO Post-Training for Small Language Models. 2025.
Warning: These citations may not always be 100% accurate.