Saved in:
Bibliographic Details
Main Authors: Xu, Tianshuo, Wang, Kai, Chen, Zhifei, Wu, Leyi, Wen, Tianshui, Chao, Fei, Chen, Ying-Cong
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.13745
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911415733846016
author Xu, Tianshuo
Wang, Kai
Chen, Zhifei
Wu, Leyi
Wen, Tianshui
Chao, Fei
Chen, Ying-Cong
author_facet Xu, Tianshuo
Wang, Kai
Chen, Zhifei
Wu, Leyi
Wen, Tianshui
Chao, Fei
Chen, Ying-Cong
contents Computational replication of Chinese calligraphy remains challenging. Existing methods falter, either creating high-quality isolated characters while ignoring page-level aesthetics like ligatures and spacing, or attempting page synthesis at the expense of calligraphic correctness. We introduce \textbf{UniCalli}, a unified diffusion framework for column-level recognition and generation. Training both tasks jointly is deliberate: recognition constrains the generator to preserve character structure, while generation provides style and layout priors. This synergy fosters concept-level abstractions that improve both tasks, especially in limited-data regimes. We curated a dataset of over 8,000 digitized pieces, with ~4,000 densely annotated. UniCalli employs asymmetric noising and a rasterized box map for spatial priors, trained on a mix of synthetic, labeled, and unlabeled data. The model achieves state-of-the-art generative quality with superior ligature continuity and layout fidelity, alongside stronger recognition. The framework successfully extends to other ancient scripts, including Oracle bone inscriptions and Egyptian hieroglyphs. Code and data can be viewed in \href{https://github.com/EnVision-Research/UniCalli}{this URL}.
format Preprint
id arxiv_https___arxiv_org_abs_2510_13745
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy
Xu, Tianshuo
Wang, Kai
Chen, Zhifei
Wu, Leyi
Wen, Tianshui
Chao, Fei
Chen, Ying-Cong
Computer Vision and Pattern Recognition
Computational replication of Chinese calligraphy remains challenging. Existing methods falter, either creating high-quality isolated characters while ignoring page-level aesthetics like ligatures and spacing, or attempting page synthesis at the expense of calligraphic correctness. We introduce \textbf{UniCalli}, a unified diffusion framework for column-level recognition and generation. Training both tasks jointly is deliberate: recognition constrains the generator to preserve character structure, while generation provides style and layout priors. This synergy fosters concept-level abstractions that improve both tasks, especially in limited-data regimes. We curated a dataset of over 8,000 digitized pieces, with ~4,000 densely annotated. UniCalli employs asymmetric noising and a rasterized box map for spatial priors, trained on a mix of synthetic, labeled, and unlabeled data. The model achieves state-of-the-art generative quality with superior ligature continuity and layout fidelity, alongside stronger recognition. The framework successfully extends to other ancient scripts, including Oracle bone inscriptions and Egyptian hieroglyphs. Code and data can be viewed in \href{https://github.com/EnVision-Research/UniCalli}{this URL}.
title UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2510.13745