APA (7th ed.) Citation

Hu, H., Qian, J., & Simchi-Levi, D. (2026). Model-Based Reinforcement Learning with Double Oracle Efficiency in Policy Optimization and Offline Estimation.

Chicago Style (17th ed.) Citation

Hu, Haichen, Jian Qian, and David Simchi-Levi. Model-Based Reinforcement Learning with Double Oracle Efficiency in Policy Optimization and Offline Estimation. 2026.

MLA (9th ed.) Citation

Hu, Haichen, et al. Model-Based Reinforcement Learning with Double Oracle Efficiency in Policy Optimization and Offline Estimation. 2026.

Warning: These citations may not always be 100% accurate.