Dong, H., Johnson, T., Cho, M., & Soroush, E. (2024). Towards Low-bit Communication for Tensor Parallel LLM Inference.
Chicago Style (17th ed.) CitationDong, Harry, Tyler Johnson, Minsik Cho, and Emad Soroush. Towards Low-bit Communication for Tensor Parallel LLM Inference. 2024.
MLA (9th ed.) CitationDong, Harry, et al. Towards Low-bit Communication for Tensor Parallel LLM Inference. 2024.
Warning: These citations may not always be 100% accurate.