Liu, J., Li, Q., & Du, W. (2024). Beyond Benchmarking: A New Paradigm for Evaluation and Assessment of Large Language Models.
Chicago Style (17th ed.) CitationLiu, Jin, Qingquan Li, and Wenlong Du. Beyond Benchmarking: A New Paradigm for Evaluation and Assessment of Large Language Models. 2024.
MLA (9th ed.) CitationLiu, Jin, et al. Beyond Benchmarking: A New Paradigm for Evaluation and Assessment of Large Language Models. 2024.
Warning: These citations may not always be 100% accurate.