Zhu, Y., & Lu, Y. (2026). On the Power of (Approximate) Reward Models for Inference-Time Scaling.
Chicago Style (17th ed.) CitationZhu, Youheng, and Yiping Lu. On the Power of (Approximate) Reward Models for Inference-Time Scaling. 2026.
MLA (9th ed.) CitationZhu, Youheng, and Yiping Lu. On the Power of (Approximate) Reward Models for Inference-Time Scaling. 2026.
Warning: These citations may not always be 100% accurate.