Tomihari, A. (2026). Learning Dynamics in RL Post-Training for Language Models.
Citazione stile Chigago Style (17a edizione)Tomihari, Akiyoshi. Learning Dynamics in RL Post-Training for Language Models. 2026.
Citatione MLA (9a ed.)Tomihari, Akiyoshi. Learning Dynamics in RL Post-Training for Language Models. 2026.
Attenzione: Queste citazioni potrebbero non essere precise al 100%.