Zhou, E., Sheng, K., Chen, H., & He, X. (2025). CARD: A Cache-Assisted Parallel Speculative Decoding Framework via Query-and-Correct Paradigm for Accelerating LLM Inference.
Chicago Style (17th ed.) CitationZhou, Enyu, Kai Sheng, Hao Chen, and Xin He. CARD: A Cache-Assisted Parallel Speculative Decoding Framework via Query-and-Correct Paradigm for Accelerating LLM Inference. 2025.
MLA (9th ed.) CitationZhou, Enyu, et al. CARD: A Cache-Assisted Parallel Speculative Decoding Framework via Query-and-Correct Paradigm for Accelerating LLM Inference. 2025.
Warning: These citations may not always be 100% accurate.