Pan, J., He, H., Bowman, S. R., & Feng, S. (2024). Spontaneous Reward Hacking in Iterative Self-Refinement.
Chicago Style (17th ed.) CitationPan, Jane, He He, Samuel R. Bowman, and Shi Feng. Spontaneous Reward Hacking in Iterative Self-Refinement. 2024.
MLA (9th ed.) CitationPan, Jane, et al. Spontaneous Reward Hacking in Iterative Self-Refinement. 2024.
Warning: These citations may not always be 100% accurate.