Li, B., Ma, N., & Wang, Z. (2024). Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space.
Chicago Style (17th ed.) CitationLi, Bangzheng, Ningshan Ma, and Zifan Wang. Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space. 2024.
MLA (9th ed.) CitationLi, Bangzheng, et al. Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space. 2024.
Warning: These citations may not always be 100% accurate.