Wang, Z., Ramnath, K., Bi, B., Pentyala, S. K., Chaudhuri, S., Mehrotra, S., . . . Cheng. (2024). Reinforcement Learning for LLM Post-Training: A Survey.
Chicago Style (17th ed.) CitationWang, Zhichao, et al. Reinforcement Learning for LLM Post-Training: A Survey. 2024.
MLA (9th ed.) CitationWang, Zhichao, et al. Reinforcement Learning for LLM Post-Training: A Survey. 2024.
Warning: These citations may not always be 100% accurate.