Saved in:
Bibliographic Details
Main Authors: Yang, Kaiqi, Li, Hang, Chu, Yucheng, Liu, Zitao, Tian, Mi, Liu, Hui
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.08615
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915556517478400
author Yang, Kaiqi
Li, Hang
Chu, Yucheng
Liu, Zitao
Tian, Mi
Liu, Hui
author_facet Yang, Kaiqi
Li, Hang
Chu, Yucheng
Liu, Zitao
Tian, Mi
Liu, Hui
contents Mathematical reasoning serves as a crucial testbed for the intelligence of large language models (LLMs), and math word problems (MWPs) are a popular type of math problems. Most MWP datasets consist of problems containing only the necessary information, while problems with distracting and excessive conditions are often overlooked. Prior works have tested popular LLMs and found a dramatic performance drop in the presence of distracting conditions. However, datasets of MWPs with distracting conditions are limited, and most suffer from lower levels of difficulty and out-of-context expressions. This makes distracting conditions easy to identify and exclude, thus reducing the credibility of benchmarking on them. Moreover, when adding distracting conditions, the reasoning and answers may also change, requiring intensive labor to check and write the solutions. To address these issues, we design an iterative framework to generate distracting conditions using LLMs. We develop a set of prompts to revise MWPs from different perspectives and cognitive levels, encouraging the generation of distracting conditions as well as suggestions for further revision. Another advantage is the shared solutions between original and revised problems: we explicitly guide the LLMs to generate distracting conditions that do not alter the original solutions, thus avoiding the need to generate new solutions. This framework is efficient and easy to deploy, reducing the overhead of generating MWPs with distracting conditions while maintaining data quality.
format Preprint
id arxiv_https___arxiv_org_abs_2510_08615
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Iterative LLM-Based Generation and Refinement of Distracting Conditions in Math Word Problems
Yang, Kaiqi
Li, Hang
Chu, Yucheng
Liu, Zitao
Tian, Mi
Liu, Hui
Computation and Language
Mathematical reasoning serves as a crucial testbed for the intelligence of large language models (LLMs), and math word problems (MWPs) are a popular type of math problems. Most MWP datasets consist of problems containing only the necessary information, while problems with distracting and excessive conditions are often overlooked. Prior works have tested popular LLMs and found a dramatic performance drop in the presence of distracting conditions. However, datasets of MWPs with distracting conditions are limited, and most suffer from lower levels of difficulty and out-of-context expressions. This makes distracting conditions easy to identify and exclude, thus reducing the credibility of benchmarking on them. Moreover, when adding distracting conditions, the reasoning and answers may also change, requiring intensive labor to check and write the solutions. To address these issues, we design an iterative framework to generate distracting conditions using LLMs. We develop a set of prompts to revise MWPs from different perspectives and cognitive levels, encouraging the generation of distracting conditions as well as suggestions for further revision. Another advantage is the shared solutions between original and revised problems: we explicitly guide the LLMs to generate distracting conditions that do not alter the original solutions, thus avoiding the need to generate new solutions. This framework is efficient and easy to deploy, reducing the overhead of generating MWPs with distracting conditions while maintaining data quality.
title Iterative LLM-Based Generation and Refinement of Distracting Conditions in Math Word Problems
topic Computation and Language
url https://arxiv.org/abs/2510.08615