APA (7th ed.) Citation

Shi, D., Han, Z., Ostermann, S., Jin, R., van Genabith, J., & Xiong, D. (2026). Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models.

Chicago Style (17th ed.) Citation

Shi, Dan, Zhuowen Han, Simon Ostermann, Renren Jin, Josef van Genabith, and Deyi Xiong. Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models. 2026.

MLA (9th ed.) Citation

Shi, Dan, et al. Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models. 2026.

Warning: These citations may not always be 100% accurate.