APA (7th ed.) Citation

Dinh, T. A., Mullov, C., Bärmann, L., Li, Z., Liu, D., Reiß, S., . . . Niehues, J. (2024). SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading.

Chicago Style (17th ed.) Citation

Dinh, Tu Anh, et al. SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading. 2024.

MLA (9th ed.) Citation

Dinh, Tu Anh, et al. SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading. 2024.

Warning: These citations may not always be 100% accurate.