Saved in:
Bibliographic Details
Main Authors: Zhang, Shuibai, Peng, Fred Zhangzhi, Zhang, Yiheng, Pan, Jin, Chrysos, Grigorios G.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.15596
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915761040130048
author Zhang, Shuibai
Peng, Fred Zhangzhi
Zhang, Yiheng
Pan, Jin
Chrysos, Grigorios G.
author_facet Zhang, Shuibai
Peng, Fred Zhangzhi
Zhang, Yiheng
Pan, Jin
Chrysos, Grigorios G.
contents While Diffusion Language Models (DLMs) are theoretically well-suited for iterative refinement due to their non-causal structure, they often fail to reliably revise incorrect tokens in practice. The key challenge lies in the model's inability to distinguish between correct and erroneous tokens in a visible sequence. Standard masked diffusion language model (MDLM) training is restricted to the objective of unmasking, undermining the effectiveness of refinement guided by confidence. Based on this observation, we study corrective behavior in DLMs, defined as the ability to assign lower confidence to incorrect tokens and iteratively refine them while preserving correct content. We show that this capability is not induced by conventional masked diffusion objectives and propose a post-training principle oriented by correction that explicitly supervises visible incorrect tokens, enabling discriminative confidence and targeted refinement. To evaluate corrective behavior, we introduce the Code Revision Benchmark, a controllable and executable benchmark for assessing error localization and in-place correction. Experiments on code revision tasks and parallel decoding scenarios demonstrate that models trained with our approach substantially outperform standard MDLMs, with gains that are most pronounced when parallel decoding introduces substantial uncertainty and iterative refinement becomes essential. Our code is publicly available at https://github.com/zhangshuibai/CDLM.
format Preprint
id arxiv_https___arxiv_org_abs_2512_15596
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Corrective Diffusion Language Models
Zhang, Shuibai
Peng, Fred Zhangzhi
Zhang, Yiheng
Pan, Jin
Chrysos, Grigorios G.
Machine Learning
While Diffusion Language Models (DLMs) are theoretically well-suited for iterative refinement due to their non-causal structure, they often fail to reliably revise incorrect tokens in practice. The key challenge lies in the model's inability to distinguish between correct and erroneous tokens in a visible sequence. Standard masked diffusion language model (MDLM) training is restricted to the objective of unmasking, undermining the effectiveness of refinement guided by confidence. Based on this observation, we study corrective behavior in DLMs, defined as the ability to assign lower confidence to incorrect tokens and iteratively refine them while preserving correct content. We show that this capability is not induced by conventional masked diffusion objectives and propose a post-training principle oriented by correction that explicitly supervises visible incorrect tokens, enabling discriminative confidence and targeted refinement. To evaluate corrective behavior, we introduce the Code Revision Benchmark, a controllable and executable benchmark for assessing error localization and in-place correction. Experiments on code revision tasks and parallel decoding scenarios demonstrate that models trained with our approach substantially outperform standard MDLMs, with gains that are most pronounced when parallel decoding introduces substantial uncertainty and iterative refinement becomes essential. Our code is publicly available at https://github.com/zhangshuibai/CDLM.
title Corrective Diffusion Language Models
topic Machine Learning
url https://arxiv.org/abs/2512.15596