Tao, L., Du, X., & Li, S. (2025). Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis.
Chicago Style (17th ed.) CitationTao, Leitian, Xuefeng Du, and Sharon Li. Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis. 2025.
MLA (9th ed.) CitationTao, Leitian, et al. Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis. 2025.
Warning: These citations may not always be 100% accurate.