Li, Y., Huang, Z., Wu, Y., Wang, W., Li, X., Luo, Y., . . . Liu, P. (2026). One Sample to Rule Them All: Extreme Data Efficiency in Multidiscipline Reasoning with Reinforcement Learning.
Chicago Style (17th ed.) CitationLi, Yiyuan, Zhen Huang, Yanan Wu, Weixun Wang, Xuefeng Li, Yijia Luo, Wenbo Su, Bo Zheng, and Pengfei Liu. One Sample to Rule Them All: Extreme Data Efficiency in Multidiscipline Reasoning with Reinforcement Learning. 2026.
MLA (9th ed.) CitationLi, Yiyuan, et al. One Sample to Rule Them All: Extreme Data Efficiency in Multidiscipline Reasoning with Reinforcement Learning. 2026.
Warning: These citations may not always be 100% accurate.