Lindström, A. D., Methnani, L., Krause, L., Ericson, P., de Troya, Í. M. d. R., Mollo, D. C., & Dobbe, R. (2024). AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations.
Chicago Style (17th ed.) CitationLindström, Adam Dahlgren, Leila Methnani, Lea Krause, Petter Ericson, Íñigo Martínez de Rituerto de Troya, Dimitri Coelho Mollo, and Roel Dobbe. AI Alignment Through Reinforcement Learning from Human Feedback? Contradictions and Limitations. 2024.
MLA (9th ed.) CitationLindström, Adam Dahlgren, et al. AI Alignment Through Reinforcement Learning from Human Feedback? Contradictions and Limitations. 2024.
Warning: These citations may not always be 100% accurate.