Zhuang, Y., Chen, J., Pang, B., Gu, Y., Zhu, Y., Jiang, Y., . . . Zhang, H. (2025). Efficient Long-context Language Model Training by Core Attention Disaggregation.
Chicago Style (17th ed.) CitationZhuang, Yonghao, Junda Chen, Bo Pang, Yi Gu, Yibo Zhu, Yimin Jiang, Ion Stoica, Eric Xing, and Hao Zhang. Efficient Long-context Language Model Training by Core Attention Disaggregation. 2025.
MLA (9th ed.) CitationZhuang, Yonghao, et al. Efficient Long-context Language Model Training by Core Attention Disaggregation. 2025.
Warning: These citations may not always be 100% accurate.