Chen, H. M., Mo, Z., Lee, R., Wang, Q., Li, D., Hu, S. X., . . . Fan, H. (2026). Dynamic Expert Sharing: Decoupling Memory from Parallelism in Mixture-of-Experts Diffusion LLMs.
Chicago Style (17th ed.) CitationChen, Hao Mark, Zhiwen Mo, Royson Lee, Qianzhou Wang, Da Li, Shell Xu Hu, Wayne Luk, Timothy Hospedales, and Hongxiang Fan. Dynamic Expert Sharing: Decoupling Memory from Parallelism in Mixture-of-Experts Diffusion LLMs. 2026.
MLA (9th ed.) CitationChen, Hao Mark, et al. Dynamic Expert Sharing: Decoupling Memory from Parallelism in Mixture-of-Experts Diffusion LLMs. 2026.
Warning: These citations may not always be 100% accurate.