APA (7th ed.) Citation

Jin, W., Song, M., Pala, T. D., Chia, Y. K., Zadeh, A., Li, C., & Poria, S. (2025). PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference.

Chicago Style (17th ed.) Citation

Jin, Weisheng, Maojia Song, Tej Deep Pala, Yew Ken Chia, Amir Zadeh, Chuan Li, and Soujanya Poria. PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference. 2025.

MLA (9th ed.) Citation

Jin, Weisheng, et al. PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference. 2025.

Warning: These citations may not always be 100% accurate.