Liang, R., Hsu, C., Yu, C., Agrawal, S., Huang, S., Lin, C., . . . Sun, S. (2025). Adaptive Helpfulness-Harmlessness Alignment with Preference Vectors.
Chicago Style (17th ed.) CitationLiang, Ren-Wei, Chin-Ting Hsu, Chan-Hung Yu, Saransh Agrawal, Shih-Cheng Huang, Chieh-Yen Lin, Shang-Tse Chen, Kuan-Hao Huang, and Shao-Hua Sun. Adaptive Helpfulness-Harmlessness Alignment with Preference Vectors. 2025.
MLA (9th ed.) CitationLiang, Ren-Wei, et al. Adaptive Helpfulness-Harmlessness Alignment with Preference Vectors. 2025.
Warning: These citations may not always be 100% accurate.