Zhao, M., Hu, W., Wang, J., Lai, X., Huang, T., Min, Y., . . . Zhu, X. (2025). Making Every Head Count: Sparse Attention Without the Speed-Performance Trade-off.
Chicago Style (17th ed.) CitationZhao, Mingkuan, Wentao Hu, Jiayin Wang, Xin Lai, Tianchen Huang, Yuheng Min, Rui Yan, and Xiaoyan Zhu. Making Every Head Count: Sparse Attention Without the Speed-Performance Trade-off. 2025.
MLA (9th ed.) CitationZhao, Mingkuan, et al. Making Every Head Count: Sparse Attention Without the Speed-Performance Trade-off. 2025.
Warning: These citations may not always be 100% accurate.