Blackwell, R. E., Barry, J., & Cohn, A. G. (2024). Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores.
Chicago Style (17th ed.) CitationBlackwell, Robert E., Jon Barry, and Anthony G. Cohn. Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores. 2024.
MLA (9th ed.) CitationBlackwell, Robert E., et al. Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores. 2024.
Warning: These citations may not always be 100% accurate.