Hu, H., Qian, J., & Simchi-Levi, D. (2026). Model-Based Reinforcement Learning with Double Oracle Efficiency in Policy Optimization and Offline Estimation.
Chicago Style (17th ed.) CitationHu, Haichen, Jian Qian, and David Simchi-Levi. Model-Based Reinforcement Learning with Double Oracle Efficiency in Policy Optimization and Offline Estimation. 2026.
MLA (9th ed.) CitationHu, Haichen, et al. Model-Based Reinforcement Learning with Double Oracle Efficiency in Policy Optimization and Offline Estimation. 2026.
Warning: These citations may not always be 100% accurate.