Hu, Y., Zhao, K., Huang, W., Chen, J., & Zhu, J. (2024). Accelerating Transformer Pre-training with 2: 4 Sparsity.
Chicago Style (17th ed.) CitationHu, Yuezhou, Kang Zhao, Weiyu Huang, Jianfei Chen, and Jun Zhu. Accelerating Transformer Pre-training with 2: 4 Sparsity. 2024.
MLA (9th ed.) CitationHu, Yuezhou, et al. Accelerating Transformer Pre-training with 2: 4 Sparsity. 2024.
Warning: These citations may not always be 100% accurate.