Surana, R., Mundada, G., Jiang, X., Wang, C., Tang, Z., Jiao, D., . . . McAuley, J. (2026). Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning.
Chicago Style (17th ed.) CitationSurana, Rohan, et al. Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning. 2026.
MLA (9th ed.) CitationSurana, Rohan, et al. Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning. 2026.
Warning: These citations may not always be 100% accurate.