Xia, Y., Ulmer, D., Blevins, T., Liu, Y., Schütze, H., & Roth, B. (2026). Calibration Is Not Enough: Evaluating Confidence Estimation Under Language Variations.
Chicago Style (17th ed.) CitationXia, Yuxi, Dennis Ulmer, Terra Blevins, Yihong Liu, Hinrich Schütze, and Benjamin Roth. Calibration Is Not Enough: Evaluating Confidence Estimation Under Language Variations. 2026.
MLA (9th ed.) CitationXia, Yuxi, et al. Calibration Is Not Enough: Evaluating Confidence Estimation Under Language Variations. 2026.
Warning: These citations may not always be 100% accurate.