Wang, B., Zheng, R., Chen, L., Liu, Y., Dou, S., Huang, C., . . . Jiang, Y. (2024). Secrets of RLHF in Large Language Models Part II: Reward Modeling.
Chicago Style (17th ed.) CitationWang, Binghai, et al. Secrets of RLHF in Large Language Models Part II: Reward Modeling. 2024.
MLA (9th ed.) CitationWang, Binghai, et al. Secrets of RLHF in Large Language Models Part II: Reward Modeling. 2024.
Warning: These citations may not always be 100% accurate.