Shi, D., Han, Z., Ostermann, S., Jin, R., van Genabith, J., & Xiong, D. (2026). Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models.
Chicago Style (17th ed.) CitationShi, Dan, Zhuowen Han, Simon Ostermann, Renren Jin, Josef van Genabith, and Deyi Xiong. Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models. 2026.
MLA (9th ed.) CitationShi, Dan, et al. Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models. 2026.
Warning: These citations may not always be 100% accurate.