Kim, W., Wang, J., Yan, J. N., Abdelfattah, M., & Rush, A. M. (2025). OverFill: Two-Stage Models for Efficient Language Model Decoding.
Chicago Style (17th ed.) CitationKim, Woojeong, Junxiong Wang, Jing Nathan Yan, Mohamed Abdelfattah, and Alexander M. Rush. OverFill: Two-Stage Models for Efficient Language Model Decoding. 2025.
MLA (9th ed.) CitationKim, Woojeong, et al. OverFill: Two-Stage Models for Efficient Language Model Decoding. 2025.
Warning: These citations may not always be 100% accurate.