Song, Z., Qiang, W., Zhao, S., Zheng, C., & Hua, G. (2025). Reward Model Generalization for Compute-Aware Test-Time Reasoning.
Chicago Style (17th ed.) CitationSong, Zeen, Wenwen Qiang, Siyu Zhao, Changwen Zheng, and Gang Hua. Reward Model Generalization for Compute-Aware Test-Time Reasoning. 2025.
MLA (9th ed.) CitationSong, Zeen, et al. Reward Model Generalization for Compute-Aware Test-Time Reasoning. 2025.
Warning: These citations may not always be 100% accurate.