Tao, L., Kulikov, I., Saha, S., Wang, T., Xu, J., Li, S., . . . Yu, P. (2025). Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense.
Chicago Style (17th ed.) CitationTao, Leitian, Ilia Kulikov, Swarnadeep Saha, Tianlu Wang, Jing Xu, Sharon Li, Jason E. Weston, and Ping Yu. Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense. 2025.
MLA (9th ed.) CitationTao, Leitian, et al. Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense. 2025.
Warning: These citations may not always be 100% accurate.