APA (7th ed.) Citation

Luo, C., Cai, Z., Sun, H., Xiao, J., Yuan, B., Xiao, W., . . . Anandkumar, A. (2025). HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading.

Chicago Style (17th ed.) Citation

Luo, Cheng, et al. HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading. 2025.

MLA (9th ed.) Citation

Luo, Cheng, et al. HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading. 2025.

Warning: These citations may not always be 100% accurate.