Zhao, J., Fang, Z., Li, S., Yang, S., & He, S. (2024). BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference.
Chicago Style (17th ed.) CitationZhao, Junqi, Zhijin Fang, Shu Li, Shaohui Yang, and Shichao He. BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference. 2024.
MLA (9th ed.) CitationZhao, Junqi, et al. BUZZ: Beehive-structured Sparse KV Cache with Segmented Heavy Hitters for Efficient LLM Inference. 2024.
Warning: These citations may not always be 100% accurate.