Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Khowaja, Sunder Ali, Nkenyereye, Lewis, Mujtaba, Ghulam, Lee, Ik Hyun, Fortino, Giancarlo, Dev, Kapal
Format:	Preprint
Publié:	2023
Sujets:	Computer Vision and Pattern Recognition
Accès en ligne:	https://arxiv.org/abs/2307.09020
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866911822219575296
author	Khowaja, Sunder Ali Nkenyereye, Lewis Mujtaba, Ghulam Lee, Ik Hyun Fortino, Giancarlo Dev, Kapal
author_facet	Khowaja, Sunder Ali Nkenyereye, Lewis Mujtaba, Ghulam Lee, Ik Hyun Fortino, Giancarlo Dev, Kapal
contents	With the surge in emerging technologies such as Metaverse, spatial computing, and generative AI, the application of facial style transfer has gained a lot of interest from researchers as well as startups enthusiasts alike. StyleGAN methods have paved the way for transfer-learning strategies that could reduce the dependency on the huge volume of data that is available for the training process. However, StyleGAN methods have the tendency of overfitting that results in the introduction of artifacts in the facial images. Studies, such as DualStyleGAN, proposed the use of multipath networks but they require the networks to be trained for a specific style rather than generating a fusion of facial styles at once. In this paper, we propose a FusIon of STyles (FIST) network for facial images that leverages pre-trained multipath style transfer networks to eliminate the problem associated with lack of huge data volume in the training phase along with the fusion of multiple styles at the output. We leverage pre-trained styleGAN networks with an external style pass that use residual modulation block instead of a transform coding block. The method also preserves facial structure, identity, and details via the gated mapping unit introduced in this study. The aforementioned components enable us to train the network with very limited amount of data while generating high-quality stylized images. Our training process adapts curriculum learning strategy to perform efficient, flexible style and model fusion in the generative space. We perform extensive experiments to show the superiority of FISTNet in comparison to existing state-of-the-art methods.
format	Preprint
id	arxiv_https___arxiv_org_abs_2307_09020
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	FISTNet: FusIon of STyle-path generative Networks for Facial Style Transfer Khowaja, Sunder Ali Nkenyereye, Lewis Mujtaba, Ghulam Lee, Ik Hyun Fortino, Giancarlo Dev, Kapal Computer Vision and Pattern Recognition With the surge in emerging technologies such as Metaverse, spatial computing, and generative AI, the application of facial style transfer has gained a lot of interest from researchers as well as startups enthusiasts alike. StyleGAN methods have paved the way for transfer-learning strategies that could reduce the dependency on the huge volume of data that is available for the training process. However, StyleGAN methods have the tendency of overfitting that results in the introduction of artifacts in the facial images. Studies, such as DualStyleGAN, proposed the use of multipath networks but they require the networks to be trained for a specific style rather than generating a fusion of facial styles at once. In this paper, we propose a FusIon of STyles (FIST) network for facial images that leverages pre-trained multipath style transfer networks to eliminate the problem associated with lack of huge data volume in the training phase along with the fusion of multiple styles at the output. We leverage pre-trained styleGAN networks with an external style pass that use residual modulation block instead of a transform coding block. The method also preserves facial structure, identity, and details via the gated mapping unit introduced in this study. The aforementioned components enable us to train the network with very limited amount of data while generating high-quality stylized images. Our training process adapts curriculum learning strategy to perform efficient, flexible style and model fusion in the generative space. We perform extensive experiments to show the superiority of FISTNet in comparison to existing state-of-the-art methods.
title	FISTNet: FusIon of STyle-path generative Networks for Facial Style Transfer
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2307.09020

Documents similaires