Wang, Y., He, W., & Yang, T. (2024). Athena: Efficient Block-Wise Post-Training Quantization for Large Language Models Using Second-Order Matrix Derivative Information.
Chicago Style (17th ed.) CitationWang, Yanshu, Wenyang He, and Tong Yang. Athena: Efficient Block-Wise Post-Training Quantization for Large Language Models Using Second-Order Matrix Derivative Information. 2024.
MLA (9th ed.) CitationWang, Yanshu, et al. Athena: Efficient Block-Wise Post-Training Quantization for Large Language Models Using Second-Order Matrix Derivative Information. 2024.
Warning: These citations may not always be 100% accurate.