APA (7th ed.) Citation

Dong, H., Johnson, T., Cho, M., & Soroush, E. (2024). Towards Low-bit Communication for Tensor Parallel LLM Inference.

Chicago Style (17th ed.) Citation

Dong, Harry, Tyler Johnson, Minsik Cho, and Emad Soroush. Towards Low-bit Communication for Tensor Parallel LLM Inference. 2024.

MLA (9th ed.) Citation

Dong, Harry, et al. Towards Low-bit Communication for Tensor Parallel LLM Inference. 2024.

Warning: These citations may not always be 100% accurate.