Lin, M., Gong, Z., Tang, M., Li, Q., Wang, C., Ma, J., . . . Lu, H. (2026). expo: Exploration-prioritized policy optimization via adaptive kl regulation and gaussian curriculum sampling.
Chicago Style (17th ed.) CitationLin, Mingxiong, Zhangquan Gong, Maowen Tang, Qian Li, Chuangchuang Wang, Jian Ma, Sutian Huang, Kai Tang, and Haonan Lu. Expo: Exploration-prioritized Policy Optimization via Adaptive Kl Regulation and Gaussian Curriculum Sampling. 2026.
MLA (9th ed.) CitationLin, Mingxiong, et al. Expo: Exploration-prioritized Policy Optimization via Adaptive Kl Regulation and Gaussian Curriculum Sampling. 2026.
Warning: These citations may not always be 100% accurate.