Wang, J., & Huang, J. (2026). Reward Hacking as Equilibrium under Finite Evaluation.
Chicago Style (17th ed.) CitationWang, Jiacheng, and Jinbin Huang. Reward Hacking as Equilibrium Under Finite Evaluation. 2026.
MLA (9th ed.) CitationWang, Jiacheng, and Jinbin Huang. Reward Hacking as Equilibrium Under Finite Evaluation. 2026.
Warning: These citations may not always be 100% accurate.