Chen, Z., Yang, X., Lin, J., Sun, C., Chang, K. C., & Huang, J. (2023). Cascade Speculative Drafting for Even Faster LLM Inference.
Chicago Style (17th ed.) CitationChen, Ziyi, Xiaocong Yang, Jiacheng Lin, Chenkai Sun, Kevin Chen-Chuan Chang, and Jie Huang. Cascade Speculative Drafting for Even Faster LLM Inference. 2023.
MLA (9th ed.) CitationChen, Ziyi, et al. Cascade Speculative Drafting for Even Faster LLM Inference. 2023.
Warning: These citations may not always be 100% accurate.