Dong, S., Cheng, W., Qin, J., & Wang, W. (2024). QAQ: Quality Adaptive Quantization for LLM KV Cache.
Chicago Style (17th ed.) CitationDong, Shichen, Wen Cheng, Jiayu Qin, and Wei Wang. QAQ: Quality Adaptive Quantization for LLM KV Cache. 2024.
MLA (9th ed.) CitationDong, Shichen, et al. QAQ: Quality Adaptive Quantization for LLM KV Cache. 2024.
Warning: These citations may not always be 100% accurate.