Landers, M., Killian, T. W., Hartvigsen, T., & Doryab, A. (2026). Improving and Accelerating Offline RL in Large Discrete Action Spaces with Structured Policy Initialization.
Chicago Style (17th ed.) CitationLanders, Matthew, Taylor W. Killian, Thomas Hartvigsen, and Afsaneh Doryab. Improving and Accelerating Offline RL in Large Discrete Action Spaces with Structured Policy Initialization. 2026.
MLA (9th ed.) CitationLanders, Matthew, et al. Improving and Accelerating Offline RL in Large Discrete Action Spaces with Structured Policy Initialization. 2026.
Warning: These citations may not always be 100% accurate.