Hua, A., Tang, K., Gu, C., Gu, J., Wong, E., & Qin, Y. (2025). Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs.
Chicago Style (17th ed.) CitationHua, Andong, Kenan Tang, Chenhe Gu, Jindong Gu, Eric Wong, and Yao Qin. Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs. 2025.
MLA (9th ed.) CitationHua, Andong, et al. Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs. 2025.
Warning: These citations may not always be 100% accurate.