Jin, W., Song, M., Pala, T. D., Chia, Y. K., Zadeh, A., Li, C., & Poria, S. (2025). PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference.
Chicago Style (17th ed.) CitationJin, Weisheng, Maojia Song, Tej Deep Pala, Yew Ken Chia, Amir Zadeh, Chuan Li, and Soujanya Poria. PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference. 2025.
MLA (9th ed.) CitationJin, Weisheng, et al. PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference. 2025.
Warning: These citations may not always be 100% accurate.