Liu, G., Li, C., Zhao, J., Zhang, C., & Guo, M. (2024). ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression.
Chicago Style (17th ed.) CitationLiu, Guangda, Chengwei Li, Jieru Zhao, Chenqi Zhang, and Minyi Guo. ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression. 2024.
MLA (9th ed.) CitationLiu, Guangda, et al. ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression. 2024.
Warning: These citations may not always be 100% accurate.