Kaufmann, T., Metz, Y., Keim, D., & Hüllermeier, E. (2025). ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning.
Chicago Style (17th ed.) CitationKaufmann, Timo, Yannick Metz, Daniel Keim, and Eyke Hüllermeier. ResponseRank: Data-Efficient Reward Modeling Through Preference Strength Learning. 2025.
MLA (9th ed.) CitationKaufmann, Timo, et al. ResponseRank: Data-Efficient Reward Modeling Through Preference Strength Learning. 2025.
Warning: These citations may not always be 100% accurate.