Williams, J., & Tureci, E. (2026). Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models.
Chicago Style (17th ed.) CitationWilliams, Jonathan, and Esin Tureci. Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models. 2026.
MLA (9th ed.) CitationWilliams, Jonathan, and Esin Tureci. Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models. 2026.
Warning: These citations may not always be 100% accurate.