APA (7th ed.) Citation

Łańcucki, A., Staniszewski, K., Nawrot, P., & Ponti, E. M. (2025). Inference-Time Hyper-Scaling with KV Cache Compression.

Chicago Style (17th ed.) Citation

Łańcucki, Adrian, Konrad Staniszewski, Piotr Nawrot, and Edoardo M. Ponti. Inference-Time Hyper-Scaling with KV Cache Compression. 2025.

MLA (9th ed.) Citation

Łańcucki, Adrian, et al. Inference-Time Hyper-Scaling with KV Cache Compression. 2025.

Warning: These citations may not always be 100% accurate.