Xie, X., Ding, K., Yan, S., Toh, K., & Wei, T. (2024). Optimization Hyper-parameter Laws for Large Language Models.
Chicago Style (17th ed.) CitationXie, Xingyu, Kuangyu Ding, Shuicheng Yan, Kim-Chuan Toh, and Tianwen Wei. Optimization Hyper-parameter Laws for Large Language Models. 2024.
MLA (9th ed.) CitationXie, Xingyu, et al. Optimization Hyper-parameter Laws for Large Language Models. 2024.
Warning: These citations may not always be 100% accurate.