Sun, Y., Li, Y., Zou, Z., Du, B., Zhang, Z., Dong, H., . . . Wang, H. (2026). VFA: Relieving Vector Operations in Flash Attention with Global Maximum Pre-computation.
Chicago Style (17th ed.) CitationSun, Yupeng, Yanzhao Li, Zhiqiang Zou, Bai Du, Zhiyuan Zhang, Hui Dong, Gaoyige Fan, and Hui Wang. VFA: Relieving Vector Operations in Flash Attention with Global Maximum Pre-computation. 2026.
MLA (9th ed.) CitationSun, Yupeng, et al. VFA: Relieving Vector Operations in Flash Attention with Global Maximum Pre-computation. 2026.
Warning: These citations may not always be 100% accurate.