Wang, S., Chen, Z., Li, B., He, K., Zhang, M., & Wang, J. (2024). Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models.
Chicago Style (17th ed.) CitationWang, Siqi, Zhengyu Chen, Bei Li, Keqing He, Min Zhang, and Jingang Wang. Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models. 2024.
MLA (9th ed.) CitationWang, Siqi, et al. Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models. 2024.
Warning: These citations may not always be 100% accurate.