Mahrooghi, I., Lotfi, A., & Abbe, E. (2026). Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning.
Chicago Style (17th ed.) CitationMahrooghi, Ilia, Aryo Lotfi, and Emmanuel Abbe. Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning. 2026.
MLA (9th ed.) CitationMahrooghi, Ilia, et al. Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning. 2026.
Warning: These citations may not always be 100% accurate.