Lyu, X., Li, S., Siriya, S., Pu, Y., & Chen, M. (2020). Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods.
Citación estilo ChicagoLyu, Xubo, Site Li, Seth Siriya, Ye Pu, and Mo Chen. Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods. 2020.
Cita MLALyu, Xubo, et al. Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods. 2020.
Warning: These citations may not always be 100% accurate.