Wong, M. F., & Tan, C. W. (2025). Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models.
Chicago Style (17th ed.) CitationWong, Man Fai, and Chee Wei Tan. Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models. 2025.
MLA (9th ed.) CitationWong, Man Fai, and Chee Wei Tan. Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models. 2025.
Warning: These citations may not always be 100% accurate.