Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Zhuang, Ziqing, Zhang, Linhai, Si, Jiasheng, Zhou, Deyu, He, Yulan
Format: Preprint
Veröffentlicht: 2026
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2604.17399
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866914488744148992
author Zhuang, Ziqing
Zhang, Linhai
Si, Jiasheng
Zhou, Deyu
He, Yulan
author_facet Zhuang, Ziqing
Zhang, Linhai
Si, Jiasheng
Zhou, Deyu
He, Yulan
contents Large language models (LLMs) have demonstrated strong reasoning capabilities, and as existing approaches for enhancing LLM reasoning continue to mature, increasing attention has shifted toward meta-reasoning as a promising direction for further improvement. However, most existing meta-reasoning methods remain episodic: they focus on executing complex meta-reasoning routines within individual instances, but ignore the accumulation of reusable meta-reasoning skills across instances, leading to recurring failure modes and repeatedly high metacognitive effort. In this paper, we introduce Metacognitive Consolidation, a novel framework in which a model consolidates metacognitive experience from past reasoning episodes into reusable knowledge that improves future meta-reasoning. We instantiate this framework by structuring instance-level problem solving into distinct roles for reasoning, monitoring, and control to generate rich, attributable meta-level traces. These traces are then consolidated through a hierarchical, multi-timescale update mechanism that gradually forms evolving meta-knowledge. Experimental results demonstrate consistent performance gains across benchmarks and backbone models, and show that performance improves as metacognitive experience accumulates over time.
format Preprint
id arxiv_https___arxiv_org_abs_2604_17399
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Beyond Meta-Reasoning: Metacognitive Consolidation for Self-Improving LLM Reasoning
Zhuang, Ziqing
Zhang, Linhai
Si, Jiasheng
Zhou, Deyu
He, Yulan
Artificial Intelligence
Large language models (LLMs) have demonstrated strong reasoning capabilities, and as existing approaches for enhancing LLM reasoning continue to mature, increasing attention has shifted toward meta-reasoning as a promising direction for further improvement. However, most existing meta-reasoning methods remain episodic: they focus on executing complex meta-reasoning routines within individual instances, but ignore the accumulation of reusable meta-reasoning skills across instances, leading to recurring failure modes and repeatedly high metacognitive effort. In this paper, we introduce Metacognitive Consolidation, a novel framework in which a model consolidates metacognitive experience from past reasoning episodes into reusable knowledge that improves future meta-reasoning. We instantiate this framework by structuring instance-level problem solving into distinct roles for reasoning, monitoring, and control to generate rich, attributable meta-level traces. These traces are then consolidated through a hierarchical, multi-timescale update mechanism that gradually forms evolving meta-knowledge. Experimental results demonstrate consistent performance gains across benchmarks and backbone models, and show that performance improves as metacognitive experience accumulates over time.
title Beyond Meta-Reasoning: Metacognitive Consolidation for Self-Improving LLM Reasoning
topic Artificial Intelligence
url https://arxiv.org/abs/2604.17399