Lee, J., Lee, W., & Sim, J. (2024). Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization.
Chicago Style (17th ed.) CitationLee, Jungi, Wonbeom Lee, and Jaewoong Sim. Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization. 2024.
MLA (9th ed.) CitationLee, Jungi, et al. Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization. 2024.
Warning: These citations may not always be 100% accurate.