Kumar, A. (2024). Residual vector quantization for KV cache compression in large language model.
Chicago Style (17th ed.) CitationKumar, Ankur. Residual Vector Quantization for KV Cache Compression in Large Language Model. 2024.
MLA (9th ed.) CitationKumar, Ankur. Residual Vector Quantization for KV Cache Compression in Large Language Model. 2024.
Warning: These citations may not always be 100% accurate.