Ishida, S., & Henriques, J. F. (2024). SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments.
Chicago Style (17th ed.) CitationIshida, Shu, and João F. Henriques. SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments. 2024.
MLA (9th ed.) CitationIshida, Shu, and João F. Henriques. SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments. 2024.
Warning: These citations may not always be 100% accurate.