Baker, B., Huizinga, J., Gao, L., Dou, Z., Guan, M. Y., Madry, A., . . . Farhi, D. (2025). Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation.
Chicago Style (17th ed.) CitationBaker, Bowen, Joost Huizinga, Leo Gao, Zehao Dou, Melody Y. Guan, Aleksander Madry, Wojciech Zaremba, Jakub Pachocki, and David Farhi. Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation. 2025.
MLA (9th ed.) CitationBaker, Bowen, et al. Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation. 2025.
Warning: These citations may not always be 100% accurate.