Martin, C., & Sandholm, T. (2024). AlphaZeroES: Direct score maximization outperforms planning loss minimization.
Chicago Style (17th ed.) CitationMartin, Carlos, and Tuomas Sandholm. AlphaZeroES: Direct Score Maximization Outperforms Planning Loss Minimization. 2024.
MLA (9th ed.) CitationMartin, Carlos, and Tuomas Sandholm. AlphaZeroES: Direct Score Maximization Outperforms Planning Loss Minimization. 2024.
Warning: These citations may not always be 100% accurate.