Le, L. T., Shu, H., Nguyen, T., Hong, C. S., & Tran, N. H. (2024). $i$REPO: $i$mplicit Reward Pairwise Difference based Empirical Preference Optimization.
Citazione stile Chigago Style (17a edizione)Le, Long Tan, Han Shu, Tung-Anh Nguyen, Choong Seon Hong, e Nguyen H. Tran. $i$REPO: $i$mplicit Reward Pairwise Difference Based Empirical Preference Optimization. 2024.
Citatione MLA (9a ed.)Le, Long Tan, et al. $i$REPO: $i$mplicit Reward Pairwise Difference Based Empirical Preference Optimization. 2024.
Attenzione: Queste citazioni potrebbero non essere precise al 100%.