Saved in:
Bibliographic Details
Main Authors: Qin, Dantong, Bozzon, Alessandro, Yang, Xian, Zhang, Xun, Guo, Yike, Wang, Pan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.01103
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914361503645696
author Qin, Dantong
Bozzon, Alessandro
Yang, Xian
Zhang, Xun
Guo, Yike
Wang, Pan
author_facet Qin, Dantong
Bozzon, Alessandro
Yang, Xian
Zhang, Xun
Guo, Yike
Wang, Pan
contents Many creative multimedia systems are built upon visual primitives such as strokes or textures, which are difficult to collect at scale and fundamentally different from natural image data. This data scarcity makes it challenging for modern generative models to learn expressive and controllable primitives, limiting their use in process-aware content creation. In this work, we study the problem of learning human-like brushstroke generation from a small set of hand-drawn samples (n=470) and propose StrokeDiff, a diffusion-based framework with Smooth Regularization (SmR). SmR injects stochastic visual priors during training, providing a simple mechanism to stabilize diffusion models under sparse supervision without altering the inference process. We further show how the learned primitives can be made controllable through a Bézier-based conditioning module and integrated into a complete stroke-based painting pipeline, including prediction, generation, ordering, and compositing. This demonstrates how data-efficient primitive modeling can support expressive and structured multimedia content creation. Experiments indicate that the proposed approach produces diverse and structurally coherent brushstrokes and enables paintings with richer texture and layering, validated by both automatic metrics and human evaluation.
format Preprint
id arxiv_https___arxiv_org_abs_2603_01103
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Data-Efficient Brushstroke Generation with Diffusion Models for Oil Painting
Qin, Dantong
Bozzon, Alessandro
Yang, Xian
Zhang, Xun
Guo, Yike
Wang, Pan
Computer Vision and Pattern Recognition
Many creative multimedia systems are built upon visual primitives such as strokes or textures, which are difficult to collect at scale and fundamentally different from natural image data. This data scarcity makes it challenging for modern generative models to learn expressive and controllable primitives, limiting their use in process-aware content creation. In this work, we study the problem of learning human-like brushstroke generation from a small set of hand-drawn samples (n=470) and propose StrokeDiff, a diffusion-based framework with Smooth Regularization (SmR). SmR injects stochastic visual priors during training, providing a simple mechanism to stabilize diffusion models under sparse supervision without altering the inference process. We further show how the learned primitives can be made controllable through a Bézier-based conditioning module and integrated into a complete stroke-based painting pipeline, including prediction, generation, ordering, and compositing. This demonstrates how data-efficient primitive modeling can support expressive and structured multimedia content creation. Experiments indicate that the proposed approach produces diverse and structurally coherent brushstrokes and enables paintings with richer texture and layering, validated by both automatic metrics and human evaluation.
title Data-Efficient Brushstroke Generation with Diffusion Models for Oil Painting
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2603.01103