Zhu, W., Shen, Z., Shao, Z., Dai, H., & Chen, F. (2025). Tangram: Accelerating Serverless LLM Loading through GPU Memory Reuse and Affinity.
Chicago Style (17th ed.) CitationZhu, Wenbin, Zhaoyan Shen, Zili Shao, Hongjun Dai, and Feng Chen. Tangram: Accelerating Serverless LLM Loading Through GPU Memory Reuse and Affinity. 2025.
MLA (9th ed.) CitationZhu, Wenbin, et al. Tangram: Accelerating Serverless LLM Loading Through GPU Memory Reuse and Affinity. 2025.
Warning: These citations may not always be 100% accurate.