Rajaram, S., Cotton, R. J., & Sinz, F. H. (2025). Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning.
Citazione stile Chigago Style (17a edizione)Rajaram, Sara, R. James Cotton, e Fabian H. Sinz. Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning. 2025.
Citatione MLA (9a ed.)Rajaram, Sara, et al. Similarity as Reward Alignment: Robust and Versatile Preference-based Reinforcement Learning. 2025.
Attenzione: Queste citazioni potrebbero non essere precise al 100%.