APA (7th ed.) Citation

Li, B., Ma, N., & Wang, Z. (2024). Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space.

Chicago Style (17th ed.) Citation

Li, Bangzheng, Ningshan Ma, and Zifan Wang. Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space. 2024.

MLA (9th ed.) Citation

Li, Bangzheng, et al. Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space. 2024.

Warning: These citations may not always be 100% accurate.