Liu, P., Gao, Z., Zhao, W. X., Ma, Y., Wang, T., & Wen, J. (2024). Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression.
Chicago Style (17th ed.) CitationLiu, Peiyu, Ze-Feng Gao, Wayne Xin Zhao, Yipeng Ma, Tao Wang, and Ji-Rong Wen. Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression. 2024.
MLA (9th ed.) CitationLiu, Peiyu, et al. Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression. 2024.
Warning: These citations may not always be 100% accurate.