Nagarkar, C., Bogachev, L., & Sharoff, S. (2026). Can LLM Reasoning Be Trusted? A Comparative Study: Using Human Benchmarking on Statistical Tasks.
Chicago Style (17th ed.) CitationNagarkar, Crish, Leonid Bogachev, and Serge Sharoff. Can LLM Reasoning Be Trusted? A Comparative Study: Using Human Benchmarking on Statistical Tasks. 2026.
MLA (9th ed.) CitationNagarkar, Crish, et al. Can LLM Reasoning Be Trusted? A Comparative Study: Using Human Benchmarking on Statistical Tasks. 2026.
Warning: These citations may not always be 100% accurate.