Li, D., Zhou, J., Brunswic, L. M., Ghaddar, A., Sun, Q., Ma, L., . . . Zhang, Y. (2025). Omni-Thinker: Scaling Multi-Task RL in LLMs with Hybrid Reward and Task Scheduling.
Chicago Style (17th ed.) CitationLi, Derek, et al. Omni-Thinker: Scaling Multi-Task RL in LLMs with Hybrid Reward and Task Scheduling. 2025.
MLA (9th ed.) CitationLi, Derek, et al. Omni-Thinker: Scaling Multi-Task RL in LLMs with Hybrid Reward and Task Scheduling. 2025.
Warning: These citations may not always be 100% accurate.