Zhang, J., Petrui, C., Nikolić, K., & Tramèr, F. (2025). RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics.
Chicago Style (17th ed.) CitationZhang, Jie, Cezara Petrui, Kristina Nikolić, and Florian Tramèr. RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics. 2025.
MLA (9th ed.) CitationZhang, Jie, et al. RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics. 2025.
Warning: These citations may not always be 100% accurate.