Enregistré dans:
Détails bibliographiques
Auteurs principaux: Khowaja, Sunder Ali, Nkenyereye, Lewis, Mujtaba, Ghulam, Lee, Ik Hyun, Fortino, Giancarlo, Dev, Kapal
Format: Preprint
Publié: 2023
Sujets:
Accès en ligne:https://arxiv.org/abs/2307.09020
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866911822219575296
author Khowaja, Sunder Ali
Nkenyereye, Lewis
Mujtaba, Ghulam
Lee, Ik Hyun
Fortino, Giancarlo
Dev, Kapal
author_facet Khowaja, Sunder Ali
Nkenyereye, Lewis
Mujtaba, Ghulam
Lee, Ik Hyun
Fortino, Giancarlo
Dev, Kapal
contents With the surge in emerging technologies such as Metaverse, spatial computing, and generative AI, the application of facial style transfer has gained a lot of interest from researchers as well as startups enthusiasts alike. StyleGAN methods have paved the way for transfer-learning strategies that could reduce the dependency on the huge volume of data that is available for the training process. However, StyleGAN methods have the tendency of overfitting that results in the introduction of artifacts in the facial images. Studies, such as DualStyleGAN, proposed the use of multipath networks but they require the networks to be trained for a specific style rather than generating a fusion of facial styles at once. In this paper, we propose a FusIon of STyles (FIST) network for facial images that leverages pre-trained multipath style transfer networks to eliminate the problem associated with lack of huge data volume in the training phase along with the fusion of multiple styles at the output. We leverage pre-trained styleGAN networks with an external style pass that use residual modulation block instead of a transform coding block. The method also preserves facial structure, identity, and details via the gated mapping unit introduced in this study. The aforementioned components enable us to train the network with very limited amount of data while generating high-quality stylized images. Our training process adapts curriculum learning strategy to perform efficient, flexible style and model fusion in the generative space. We perform extensive experiments to show the superiority of FISTNet in comparison to existing state-of-the-art methods.
format Preprint
id arxiv_https___arxiv_org_abs_2307_09020
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle FISTNet: FusIon of STyle-path generative Networks for Facial Style Transfer
Khowaja, Sunder Ali
Nkenyereye, Lewis
Mujtaba, Ghulam
Lee, Ik Hyun
Fortino, Giancarlo
Dev, Kapal
Computer Vision and Pattern Recognition
With the surge in emerging technologies such as Metaverse, spatial computing, and generative AI, the application of facial style transfer has gained a lot of interest from researchers as well as startups enthusiasts alike. StyleGAN methods have paved the way for transfer-learning strategies that could reduce the dependency on the huge volume of data that is available for the training process. However, StyleGAN methods have the tendency of overfitting that results in the introduction of artifacts in the facial images. Studies, such as DualStyleGAN, proposed the use of multipath networks but they require the networks to be trained for a specific style rather than generating a fusion of facial styles at once. In this paper, we propose a FusIon of STyles (FIST) network for facial images that leverages pre-trained multipath style transfer networks to eliminate the problem associated with lack of huge data volume in the training phase along with the fusion of multiple styles at the output. We leverage pre-trained styleGAN networks with an external style pass that use residual modulation block instead of a transform coding block. The method also preserves facial structure, identity, and details via the gated mapping unit introduced in this study. The aforementioned components enable us to train the network with very limited amount of data while generating high-quality stylized images. Our training process adapts curriculum learning strategy to perform efficient, flexible style and model fusion in the generative space. We perform extensive experiments to show the superiority of FISTNet in comparison to existing state-of-the-art methods.
title FISTNet: FusIon of STyle-path generative Networks for Facial Style Transfer
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2307.09020