Xu, M., Tan, X., Wu, J., & Zhou, D. (2026). A Judge-Aware Ranking Framework for Evaluating Large Language Models without Ground Truth.
Chicago Style (17th ed.) CitationXu, Mingyuan, Xinzi Tan, Jiawei Wu, and Doudou Zhou. A Judge-Aware Ranking Framework for Evaluating Large Language Models Without Ground Truth. 2026.
MLA (9th ed.) CitationXu, Mingyuan, et al. A Judge-Aware Ranking Framework for Evaluating Large Language Models Without Ground Truth. 2026.
Warning: These citations may not always be 100% accurate.