Dong, X., Wang, S., Lin, D., Chen, B., & Hassan, A. E. (2026). Beyond Tokens: Semantic-Aware Speculative Decoding for Efficient Inference by Probing Internal States.
Chicago Style (17th ed.) CitationDong, Ximing, Shaowei Wang, Dayi Lin, Boyuan Chen, and Ahmed E. Hassan. Beyond Tokens: Semantic-Aware Speculative Decoding for Efficient Inference by Probing Internal States. 2026.
MLA (9th ed.) CitationDong, Ximing, et al. Beyond Tokens: Semantic-Aware Speculative Decoding for Efficient Inference by Probing Internal States. 2026.
Warning: These citations may not always be 100% accurate.