APA (7th ed.) Citation

Li, Q., Zhang, B., Ye, L., Zhang, Y., Wu, W., Sun, Y., . . . Xie, Y. (2024). Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference.

Chicago Style (17th ed.) Citation

Li, Qingyuan, Bo Zhang, Liang Ye, Yifan Zhang, Wei Wu, Yerui Sun, Lin Ma, and Yuchen Xie. Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference. 2024.

MLA (9th ed.) Citation

Li, Qingyuan, et al. Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference. 2024.

Warning: These citations may not always be 100% accurate.