Saved in:
Bibliographic Details
Main Authors: Riso, Marzia, Vecchio, Giuseppe, Pellacini, Fabio
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2411.08930
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915756754599936
author Riso, Marzia
Vecchio, Giuseppe
Pellacini, Fabio
author_facet Riso, Marzia
Vecchio, Giuseppe
Pellacini, Fabio
contents Recent advances in diffusion models have significantly improved the synthesis of materials, textures, and 3D shapes. By conditioning these models via text or images, users can guide the generation, reducing the time required to create digital assets. In this paper, we address the synthesis of structured, stationary patterns, where diffusion models are generally less reliable and, more importantly, less controllable. Our approach leverages the generative capabilities of diffusion models specifically adapted for the pattern domain. It enables users to exercise direct control over the synthesis by expanding a partially hand-drawn pattern into a larger design while preserving the structure and details of the input. To enhance pattern quality, we fine-tune an image-pretrained diffusion model on structured patterns using Low-Rank Adaptation (LoRA), apply a noise rolling technique to ensure tileability, and utilize a patch-based approach to facilitate the generation of large-scale assets. We demonstrate the effectiveness of our method through a comprehensive set of experiments, showing that it outperforms existing models in generating diverse, consistent patterns that respond directly to user input.
format Preprint
id arxiv_https___arxiv_org_abs_2411_08930
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Structured Pattern Expansion with Diffusion Models
Riso, Marzia
Vecchio, Giuseppe
Pellacini, Fabio
Computer Vision and Pattern Recognition
Graphics
I.3
Recent advances in diffusion models have significantly improved the synthesis of materials, textures, and 3D shapes. By conditioning these models via text or images, users can guide the generation, reducing the time required to create digital assets. In this paper, we address the synthesis of structured, stationary patterns, where diffusion models are generally less reliable and, more importantly, less controllable. Our approach leverages the generative capabilities of diffusion models specifically adapted for the pattern domain. It enables users to exercise direct control over the synthesis by expanding a partially hand-drawn pattern into a larger design while preserving the structure and details of the input. To enhance pattern quality, we fine-tune an image-pretrained diffusion model on structured patterns using Low-Rank Adaptation (LoRA), apply a noise rolling technique to ensure tileability, and utilize a patch-based approach to facilitate the generation of large-scale assets. We demonstrate the effectiveness of our method through a comprehensive set of experiments, showing that it outperforms existing models in generating diverse, consistent patterns that respond directly to user input.
title Structured Pattern Expansion with Diffusion Models
topic Computer Vision and Pattern Recognition
Graphics
I.3
url https://arxiv.org/abs/2411.08930