Yu, Q., Tartaglini, A., Hase, P., Guestrin, C., & Potts, C. (2026). Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning.
Chicago Style (17th ed.) CitationYu, Qinan, Alexa Tartaglini, Peter Hase, Carlos Guestrin, and Christopher Potts. Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning. 2026.
MLA (9th ed.) CitationYu, Qinan, et al. Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning. 2026.
Warning: These citations may not always be 100% accurate.