Zhang, W., Wu, L., Zhao, C., Chang, E., Zhuge, M., Liu, Z., . . . Wen, W. (2026). dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models.
Cita Chicago Style (17a ed.)Zhang, Wenxuan, et al. DTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models. 2026.
Cita MLA (9a ed.)Zhang, Wenxuan, et al. DTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models. 2026.
Precaución: Estas citas no son 100% exactas.