Mu, J., Huang, H., Zhang, J., Yu, M., Wang, T., & Li, Y. (2025). SALS: Sparse Attention in Latent Space for KV cache Compression.
Chicago Style (17th ed.) CitationMu, Junlin, Hantao Huang, Jihang Zhang, Minghui Yu, Tao Wang, and Yidong Li. SALS: Sparse Attention in Latent Space for KV Cache Compression. 2025.
MLA (9th ed.) CitationMu, Junlin, et al. SALS: Sparse Attention in Latent Space for KV Cache Compression. 2025.
Warning: These citations may not always be 100% accurate.