Liu, X., Zhang, L., Ganesan, D., & Guan, H. (2025). ASTRA: Communication-Efficient Acceleration for Multi-Device Transformer Inference.
Chicago Style (17th ed.) CitationLiu, Xiao, Lijun Zhang, Deepak Ganesan, and Hui Guan. ASTRA: Communication-Efficient Acceleration for Multi-Device Transformer Inference. 2025.
MLA (9th ed.) CitationLiu, Xiao, et al. ASTRA: Communication-Efficient Acceleration for Multi-Device Transformer Inference. 2025.
Warning: These citations may not always be 100% accurate.