Pan, X., Li, E., Li, Q., Liang, S., Shan, Y., Zhou, K., . . . Zhang, J. (2024). InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference.
Citazione stile Chigago Style (17a edizione)Pan, Xiurui, Endian Li, Qiao Li, Shengwen Liang, Yizhou Shan, Ke Zhou, Yingwei Luo, Xiaolin Wang, e Jie Zhang. InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference. 2024.
Citatione MLA (9a ed.)Pan, Xiurui, et al. InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference. 2024.
Attenzione: Queste citazioni potrebbero non essere precise al 100%.