Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Nijhawan, Siddharth, Yashima, Takuya, Kojima, Tamaki
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2404.14667
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910418600984576
author	Nijhawan, Siddharth Yashima, Takuya Kojima, Tamaki
author_facet	Nijhawan, Siddharth Yashima, Takuya Kojima, Tamaki
contents	Performing facial expression transfer under one-shot setting has been increasing in popularity among research community with a focus on precise control of expressions. Existing techniques showcase compelling results in perceiving expressions, but they lack robustness with extreme head poses. They also struggle to accurately reconstruct background details, thus hindering the realism. In this paper, we propose a novel warping technology which integrates the advantages of both 2D and 3D methods to achieve robust face re-enactment. We generate dense 3D facial flow fields in feature space to warp an input image based on target expressions without depth information. This enables explicit 3D geometric control for re-enacting misaligned source and target faces. We regularize the motion estimation capability of the 3D flow prediction network through proposed "Cyclic warp loss" by converting warped 3D features back into 2D RGB space. To ensure the generation of finer facial region with natural-background, our framework only renders the facial foreground region first and learns to inpaint the blank area which needs to be filled due to source face translation, thus reconstructing the detailed background without any unwanted pixel motion. Extensive evaluation reveals that our method outperforms state-of-the-art techniques in rendering artifact-free facial images.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_14667
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	3DFlowRenderer: One-shot Face Re-enactment via Dense 3D Facial Flow Estimation Nijhawan, Siddharth Yashima, Takuya Kojima, Tamaki Computer Vision and Pattern Recognition Performing facial expression transfer under one-shot setting has been increasing in popularity among research community with a focus on precise control of expressions. Existing techniques showcase compelling results in perceiving expressions, but they lack robustness with extreme head poses. They also struggle to accurately reconstruct background details, thus hindering the realism. In this paper, we propose a novel warping technology which integrates the advantages of both 2D and 3D methods to achieve robust face re-enactment. We generate dense 3D facial flow fields in feature space to warp an input image based on target expressions without depth information. This enables explicit 3D geometric control for re-enacting misaligned source and target faces. We regularize the motion estimation capability of the 3D flow prediction network through proposed "Cyclic warp loss" by converting warped 3D features back into 2D RGB space. To ensure the generation of finer facial region with natural-background, our framework only renders the facial foreground region first and learns to inpaint the blank area which needs to be filled due to source face translation, thus reconstructing the detailed background without any unwanted pixel motion. Extensive evaluation reveals that our method outperforms state-of-the-art techniques in rendering artifact-free facial images.
title	3DFlowRenderer: One-shot Face Re-enactment via Dense 3D Facial Flow Estimation
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2404.14667

Similar Items