Zhang, H., & Yang, Z. (2024). Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning.
Cita Chicago Style (17a ed.)Zhang, Haobin, y Zhuang Yang. Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning. 2024.
Cita MLA (9a ed.)Zhang, Haobin, y Zhuang Yang. Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning. 2024.
Precaución: Estas citas no son 100% exactas.