APA (7th ed.) Citation

Bagirov, F., Arkhipov, M., Sycheva, K., Glukhov, E., & Bogomolov, E. (2025). The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation.

Chicago Style (17th ed.) Citation

Bagirov, Farid, Mikhail Arkhipov, Ksenia Sycheva, Evgeniy Glukhov, and Egor Bogomolov. The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via Max@k Optimisation. 2025.

MLA (9th ed.) Citation

Bagirov, Farid, et al. The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via Max@k Optimisation. 2025.

Warning: These citations may not always be 100% accurate.