Zeng, Z., Yu, J., Pang, Q., Wang, Z., Zhuang, H., Shao, H., & Zou, X. (2024). Chimera: A Lossless Decoding Method for Accelerating Large Language Models Inference by Fusing all Tokens.
Chicago Style (17th ed.) CitationZeng, Ziqian, Jiahong Yu, Qianshi Pang, Zihao Wang, Huiping Zhuang, Hongen Shao, and Xiaofeng Zou. Chimera: A Lossless Decoding Method for Accelerating Large Language Models Inference by Fusing All Tokens. 2024.
MLA (9th ed.) CitationZeng, Ziqian, et al. Chimera: A Lossless Decoding Method for Accelerating Large Language Models Inference by Fusing All Tokens. 2024.
Warning: These citations may not always be 100% accurate.