Felizardo, L. K., Fadda, E., Brandimarte, P., Del-Moral-Hernandez, E., & Nascimento, M. C. V. (2025). A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks.
Chicago Style (17th ed.) CitationFelizardo, Leonardo Kanashiro, Edoardo Fadda, Paolo Brandimarte, Emilio Del-Moral-Hernandez, and Mariá Cristina Vasconcelos Nascimento. A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks. 2025.
MLA (9th ed.) CitationFelizardo, Leonardo Kanashiro, et al. A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks. 2025.