Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Riso, Marzia, Vecchio, Giuseppe, Pellacini, Fabio
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Graphics I.3
Online Access:	https://arxiv.org/abs/2411.08930
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915756754599936
author	Riso, Marzia Vecchio, Giuseppe Pellacini, Fabio
author_facet	Riso, Marzia Vecchio, Giuseppe Pellacini, Fabio
contents	Recent advances in diffusion models have significantly improved the synthesis of materials, textures, and 3D shapes. By conditioning these models via text or images, users can guide the generation, reducing the time required to create digital assets. In this paper, we address the synthesis of structured, stationary patterns, where diffusion models are generally less reliable and, more importantly, less controllable. Our approach leverages the generative capabilities of diffusion models specifically adapted for the pattern domain. It enables users to exercise direct control over the synthesis by expanding a partially hand-drawn pattern into a larger design while preserving the structure and details of the input. To enhance pattern quality, we fine-tune an image-pretrained diffusion model on structured patterns using Low-Rank Adaptation (LoRA), apply a noise rolling technique to ensure tileability, and utilize a patch-based approach to facilitate the generation of large-scale assets. We demonstrate the effectiveness of our method through a comprehensive set of experiments, showing that it outperforms existing models in generating diverse, consistent patterns that respond directly to user input.
format	Preprint
id	arxiv_https___arxiv_org_abs_2411_08930
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Structured Pattern Expansion with Diffusion Models Riso, Marzia Vecchio, Giuseppe Pellacini, Fabio Computer Vision and Pattern Recognition Graphics I.3 Recent advances in diffusion models have significantly improved the synthesis of materials, textures, and 3D shapes. By conditioning these models via text or images, users can guide the generation, reducing the time required to create digital assets. In this paper, we address the synthesis of structured, stationary patterns, where diffusion models are generally less reliable and, more importantly, less controllable. Our approach leverages the generative capabilities of diffusion models specifically adapted for the pattern domain. It enables users to exercise direct control over the synthesis by expanding a partially hand-drawn pattern into a larger design while preserving the structure and details of the input. To enhance pattern quality, we fine-tune an image-pretrained diffusion model on structured patterns using Low-Rank Adaptation (LoRA), apply a noise rolling technique to ensure tileability, and utilize a patch-based approach to facilitate the generation of large-scale assets. We demonstrate the effectiveness of our method through a comprehensive set of experiments, showing that it outperforms existing models in generating diverse, consistent patterns that respond directly to user input.
title	Structured Pattern Expansion with Diffusion Models
topic	Computer Vision and Pattern Recognition Graphics I.3
url	https://arxiv.org/abs/2411.08930

Similar Items