Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Shuibai, Peng, Fred Zhangzhi, Zhang, Yiheng, Pan, Jin, Chrysos, Grigorios G.
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2512.15596
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915761040130048
author	Zhang, Shuibai Peng, Fred Zhangzhi Zhang, Yiheng Pan, Jin Chrysos, Grigorios G.
author_facet	Zhang, Shuibai Peng, Fred Zhangzhi Zhang, Yiheng Pan, Jin Chrysos, Grigorios G.
contents	While Diffusion Language Models (DLMs) are theoretically well-suited for iterative refinement due to their non-causal structure, they often fail to reliably revise incorrect tokens in practice. The key challenge lies in the model's inability to distinguish between correct and erroneous tokens in a visible sequence. Standard masked diffusion language model (MDLM) training is restricted to the objective of unmasking, undermining the effectiveness of refinement guided by confidence. Based on this observation, we study corrective behavior in DLMs, defined as the ability to assign lower confidence to incorrect tokens and iteratively refine them while preserving correct content. We show that this capability is not induced by conventional masked diffusion objectives and propose a post-training principle oriented by correction that explicitly supervises visible incorrect tokens, enabling discriminative confidence and targeted refinement. To evaluate corrective behavior, we introduce the Code Revision Benchmark, a controllable and executable benchmark for assessing error localization and in-place correction. Experiments on code revision tasks and parallel decoding scenarios demonstrate that models trained with our approach substantially outperform standard MDLMs, with gains that are most pronounced when parallel decoding introduces substantial uncertainty and iterative refinement becomes essential. Our code is publicly available at https://github.com/zhangshuibai/CDLM.
format	Preprint
id	arxiv_https___arxiv_org_abs_2512_15596
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Corrective Diffusion Language Models Zhang, Shuibai Peng, Fred Zhangzhi Zhang, Yiheng Pan, Jin Chrysos, Grigorios G. Machine Learning While Diffusion Language Models (DLMs) are theoretically well-suited for iterative refinement due to their non-causal structure, they often fail to reliably revise incorrect tokens in practice. The key challenge lies in the model's inability to distinguish between correct and erroneous tokens in a visible sequence. Standard masked diffusion language model (MDLM) training is restricted to the objective of unmasking, undermining the effectiveness of refinement guided by confidence. Based on this observation, we study corrective behavior in DLMs, defined as the ability to assign lower confidence to incorrect tokens and iteratively refine them while preserving correct content. We show that this capability is not induced by conventional masked diffusion objectives and propose a post-training principle oriented by correction that explicitly supervises visible incorrect tokens, enabling discriminative confidence and targeted refinement. To evaluate corrective behavior, we introduce the Code Revision Benchmark, a controllable and executable benchmark for assessing error localization and in-place correction. Experiments on code revision tasks and parallel decoding scenarios demonstrate that models trained with our approach substantially outperform standard MDLMs, with gains that are most pronounced when parallel decoding introduces substantial uncertainty and iterative refinement becomes essential. Our code is publicly available at https://github.com/zhangshuibai/CDLM.
title	Corrective Diffusion Language Models
topic	Machine Learning
url	https://arxiv.org/abs/2512.15596

Similar Items