Choi, J., Park, S., Cho, C., Park, H., & Kim, B. (2026). Diagnosing the Reliability of LLM-as-a-Judge via Item Response Theory.
Chicago Style (17th ed.) CitationChoi, Junhyuk, Sohhyung Park, Chanhee Cho, Hyeonchu Park, and Bugeun Kim. Diagnosing the Reliability of LLM-as-a-Judge via Item Response Theory. 2026.
MLA (9th ed.) CitationChoi, Junhyuk, et al. Diagnosing the Reliability of LLM-as-a-Judge via Item Response Theory. 2026.
Warning: These citations may not always be 100% accurate.