Wang, Q., Vahidian, S., Ye, H., Gu, J., Zhang, J., & Chen, Y. (2024). CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation.
Cita Chicago Style (17a ed.)Wang, Qinsi, Saeed Vahidian, Hancheng Ye, Jianyang Gu, Jianyi Zhang, y Yiran Chen. CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation. 2024.
Cita MLA (9a ed.)Wang, Qinsi, et al. CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation. 2024.
Precaución: Estas citas no son 100% exactas.