Yu, S., Xing, J., Qiao, Y., Ma, M., Li, Y., Wang, Y., . . . Sheng, Y. (2025). Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving.
Chicago Style (17th ed.) CitationYu, Shan, et al. Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving. 2025.
MLA (9th ed.) CitationYu, Shan, et al. Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving. 2025.
Warning: These citations may not always be 100% accurate.