Saved in:
Bibliographic Details
Main Authors: Sun, Yifei, Li, Yongan, Qin, A. K., Hou, Sicheng, Pflanzner, Tamas
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.11792
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915736824315904
author Sun, Yifei
Li, Yongan
Qin, A. K.
Hou, Sicheng
Pflanzner, Tamas
author_facet Sun, Yifei
Li, Yongan
Qin, A. K.
Hou, Sicheng
Pflanzner, Tamas
contents Mathematical problem generation (MPG) is a significant research direction in the field of intelligent education. In recent years, the rapid development of large language models (LLMs) has enabled new technological approaches to problem-generation tasks. Although existing LLMs can achieve high correctness rates, they generally lack innovation and exhibit poor discrimination. In this paper, we propose the task of innovative math problem generation (IMPG). To solve the IMPG task, this paper proposes a self-evolving, multi-role collaborative framework with fine-grained difficulty guidance. First, a multi-role collaborative mechanism comprising a sampler, generator, evaluator, state machine, and memory is constructed, ensuring the correctness of generated problems through iterative optimization informed by self-assessment and external feedback. Second, we introduce an improved difficulty model to quantify difficulty and provide fine-grained guidance. We adopt the data-driven association-guided path sampling (DAPS) algorithm to enhance the semantic rationality of sampled encodings. Third, we construct the HSM3K-CN dataset, which comprises high-quality high school math problems. A multi-stage training pipeline is adopted, incorporating continual pre-training (CPT), supervised fine-tuning (SFT), and group relative policy optimization (GRPO), to enhance the generation and evaluation capabilities of the base model. Finally, system self-evolution is achieved by transferring evaluation capabilities from the expert model to the apprentice model via distillation. Experiments show that, compared to baseline models, our proposed method significantly improves the innovation of the generated problems while maintaining a high correctness rate.
format Preprint
id arxiv_https___arxiv_org_abs_2601_11792
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle A self-evolving multi-role collaborative framework with fine-grained difficulty guidance for innovative mathematical problem generation
Sun, Yifei
Li, Yongan
Qin, A. K.
Hou, Sicheng
Pflanzner, Tamas
Artificial Intelligence
Computation and Language
Mathematical problem generation (MPG) is a significant research direction in the field of intelligent education. In recent years, the rapid development of large language models (LLMs) has enabled new technological approaches to problem-generation tasks. Although existing LLMs can achieve high correctness rates, they generally lack innovation and exhibit poor discrimination. In this paper, we propose the task of innovative math problem generation (IMPG). To solve the IMPG task, this paper proposes a self-evolving, multi-role collaborative framework with fine-grained difficulty guidance. First, a multi-role collaborative mechanism comprising a sampler, generator, evaluator, state machine, and memory is constructed, ensuring the correctness of generated problems through iterative optimization informed by self-assessment and external feedback. Second, we introduce an improved difficulty model to quantify difficulty and provide fine-grained guidance. We adopt the data-driven association-guided path sampling (DAPS) algorithm to enhance the semantic rationality of sampled encodings. Third, we construct the HSM3K-CN dataset, which comprises high-quality high school math problems. A multi-stage training pipeline is adopted, incorporating continual pre-training (CPT), supervised fine-tuning (SFT), and group relative policy optimization (GRPO), to enhance the generation and evaluation capabilities of the base model. Finally, system self-evolution is achieved by transferring evaluation capabilities from the expert model to the apprentice model via distillation. Experiments show that, compared to baseline models, our proposed method significantly improves the innovation of the generated problems while maintaining a high correctness rate.
title A self-evolving multi-role collaborative framework with fine-grained difficulty guidance for innovative mathematical problem generation
topic Artificial Intelligence
Computation and Language
url https://arxiv.org/abs/2601.11792