Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Dan, Nifu, Cai, Yujun, Wang, Yiwei
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2507.01334
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911035851538432
author	Dan, Nifu Cai, Yujun Wang, Yiwei
author_facet	Dan, Nifu Cai, Yujun Wang, Yiwei
contents	Navigating the complexities of physics reasoning has long been a difficult task for Large Language Models (LLMs), requiring a synthesis of profound conceptual understanding and adept problem-solving techniques. In this study, we investigate the application of advanced instruction-tuned reasoning models, such as Deepseek-R1, to address a diverse spectrum of physics problems curated from the challenging SciBench benchmark. Our comprehensive experimental evaluation reveals the remarkable capabilities of reasoning models. Not only do they achieve state-of-the-art accuracy in answering intricate physics questions, but they also generate distinctive reasoning patterns that emphasize on symbolic derivation. Furthermore, our findings indicate that even for these highly sophisticated reasoning models, the strategic incorporation of few-shot prompting can still yield measurable improvements in overall accuracy, highlighting the potential for continued performance gains.
format	Preprint
id	arxiv_https___arxiv_org_abs_2507_01334
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs Dan, Nifu Cai, Yujun Wang, Yiwei Computation and Language Navigating the complexities of physics reasoning has long been a difficult task for Large Language Models (LLMs), requiring a synthesis of profound conceptual understanding and adept problem-solving techniques. In this study, we investigate the application of advanced instruction-tuned reasoning models, such as Deepseek-R1, to address a diverse spectrum of physics problems curated from the challenging SciBench benchmark. Our comprehensive experimental evaluation reveals the remarkable capabilities of reasoning models. Not only do they achieve state-of-the-art accuracy in answering intricate physics questions, but they also generate distinctive reasoning patterns that emphasize on symbolic derivation. Furthermore, our findings indicate that even for these highly sophisticated reasoning models, the strategic incorporation of few-shot prompting can still yield measurable improvements in overall accuracy, highlighting the potential for continued performance gains.
title	Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs
topic	Computation and Language
url	https://arxiv.org/abs/2507.01334

Similar Items