Deng, L., Xu, S., Chen, J., Yan, C., Wang, J., Jiang, Z., & Shan, W. (2025). KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing.
Chicago Style (17th ed.) CitationDeng, Lishuo, Shaojie Xu, Jinwu Chen, Changwei Yan, Jiajie Wang, Zhe Jiang, and Weiwei Shan. KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing. 2025.
MLA (9th ed.) CitationDeng, Lishuo, et al. KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing. 2025.
Warning: These citations may not always be 100% accurate.