You, H., Liu, Y., Zong, D., Wang, Q., Vitchutripop, T., Wang, Q., . . . Abraham, I. (2026). Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient.
Cita Chicago Style (17a ed.)You, Haoxiang, Yilang Liu, Davis Zong, Qian Wang, Teeratham Vitchutripop, Qi Wang, Daniel Rakita, y Ian Abraham. Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient. 2026.
Cita MLA (9a ed.)You, Haoxiang, et al. Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient. 2026.
Precaución: Estas citas no son 100% exactas.