Saved in:
Bibliographic Details
Main Authors: Dan, Nifu, Cai, Yujun, Wang, Yiwei
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.01334
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911035851538432
author Dan, Nifu
Cai, Yujun
Wang, Yiwei
author_facet Dan, Nifu
Cai, Yujun
Wang, Yiwei
contents Navigating the complexities of physics reasoning has long been a difficult task for Large Language Models (LLMs), requiring a synthesis of profound conceptual understanding and adept problem-solving techniques. In this study, we investigate the application of advanced instruction-tuned reasoning models, such as Deepseek-R1, to address a diverse spectrum of physics problems curated from the challenging SciBench benchmark. Our comprehensive experimental evaluation reveals the remarkable capabilities of reasoning models. Not only do they achieve state-of-the-art accuracy in answering intricate physics questions, but they also generate distinctive reasoning patterns that emphasize on symbolic derivation. Furthermore, our findings indicate that even for these highly sophisticated reasoning models, the strategic incorporation of few-shot prompting can still yield measurable improvements in overall accuracy, highlighting the potential for continued performance gains.
format Preprint
id arxiv_https___arxiv_org_abs_2507_01334
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs
Dan, Nifu
Cai, Yujun
Wang, Yiwei
Computation and Language
Navigating the complexities of physics reasoning has long been a difficult task for Large Language Models (LLMs), requiring a synthesis of profound conceptual understanding and adept problem-solving techniques. In this study, we investigate the application of advanced instruction-tuned reasoning models, such as Deepseek-R1, to address a diverse spectrum of physics problems curated from the challenging SciBench benchmark. Our comprehensive experimental evaluation reveals the remarkable capabilities of reasoning models. Not only do they achieve state-of-the-art accuracy in answering intricate physics questions, but they also generate distinctive reasoning patterns that emphasize on symbolic derivation. Furthermore, our findings indicate that even for these highly sophisticated reasoning models, the strategic incorporation of few-shot prompting can still yield measurable improvements in overall accuracy, highlighting the potential for continued performance gains.
title Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs
topic Computation and Language
url https://arxiv.org/abs/2507.01334