Li, Z., Xiao, C., Wang, Y., Liu, X., Tang, Z., Lu, B., . . . Chu, X. (2025). AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models.
Chicago Style (17th ed.) CitationLi, Zeyu, Chuanfu Xiao, Yang Wang, Xiang Liu, Zhenheng Tang, Baotong Lu, Mao Yang, Xinyu Chen, and Xiaowen Chu. AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models. 2025.
MLA (9th ed.) CitationLi, Zeyu, et al. AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models. 2025.
Warning: These citations may not always be 100% accurate.