Phung, T. S., & Thain, D. (2025). Efficiently Executing High-throughput Lightweight LLM Inference Applications on Heterogeneous Opportunistic GPU Clusters with Pervasive Context Management.
Chicago Style (17th ed.) CitationPhung, Thanh Son, and Douglas Thain. Efficiently Executing High-throughput Lightweight LLM Inference Applications on Heterogeneous Opportunistic GPU Clusters with Pervasive Context Management. 2025.
MLA (9th ed.) CitationPhung, Thanh Son, and Douglas Thain. Efficiently Executing High-throughput Lightweight LLM Inference Applications on Heterogeneous Opportunistic GPU Clusters with Pervasive Context Management. 2025.
Warning: These citations may not always be 100% accurate.