Liu, S., Xu, S., Qiu, W., Zhang, H., & Zhu, M. (2025). Explainable reinforcement learning from human feedback to improve alignment.
Chicago Style (17th ed.) CitationLiu, Shicheng, Siyuan Xu, Wenjie Qiu, Hangfan Zhang, and Minghui Zhu. Explainable Reinforcement Learning from Human Feedback to Improve Alignment. 2025.
MLA (9th ed.) CitationLiu, Shicheng, et al. Explainable Reinforcement Learning from Human Feedback to Improve Alignment. 2025.
Warning: These citations may not always be 100% accurate.