Wang, H., Wang, J., Wu, S., & Xiao, X. (2026). GEM: Guided Expectation-Maximization for Behavior-Normalized Candidate Action Selection in Offline RL.
Chicago Style (17th ed.) CitationWang, Haoyu, Jingcheng Wang, Shunyu Wu, and Xinwei Xiao. GEM: Guided Expectation-Maximization for Behavior-Normalized Candidate Action Selection in Offline RL. 2026.
MLA (9th ed.) CitationWang, Haoyu, et al. GEM: Guided Expectation-Maximization for Behavior-Normalized Candidate Action Selection in Offline RL. 2026.
Warning: These citations may not always be 100% accurate.