Matrenok, S., Moalla, S., & Gulcehre, C. (2025). Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions.
Chicago Style (17th ed.) CitationMatrenok, Simon, Skander Moalla, and Caglar Gulcehre. Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions. 2025.
MLA (9th ed.) CitationMatrenok, Simon, et al. Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions. 2025.
Warning: These citations may not always be 100% accurate.