Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Chen, Bowei, Zhi, Tiancheng, Zhu, Peihao, Sang, Shen, Liu, Jing, Luo, Linjie
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2407.20455
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866910546079514624
author	Chen, Bowei Zhi, Tiancheng Zhu, Peihao Sang, Shen Liu, Jing Luo, Linjie
author_facet	Chen, Bowei Zhi, Tiancheng Zhu, Peihao Sang, Shen Liu, Jing Luo, Linjie
contents	Portrait editing is challenging for existing techniques due to difficulties in preserving subject features like identity. In this paper, we propose a training-based method leveraging auto-generated paired data to learn desired editing while ensuring the preservation of unchanged subject features. Specifically, we design a data generation process to create reasonably good training pairs for desired editing at low cost. Based on these pairs, we introduce a Multi-Conditioned Diffusion Model to effectively learn the editing direction and preserve subject features. During inference, our model produces accurate editing mask that can guide the inference process to further preserve detailed subject features. Experiments on costume editing and cartoon expression editing show that our method achieves state-of-the-art quality, quantitatively and qualitatively.
format	Preprint
id	arxiv_https___arxiv_org_abs_2407_20455
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Learning Feature-Preserving Portrait Editing from Generated Pairs Chen, Bowei Zhi, Tiancheng Zhu, Peihao Sang, Shen Liu, Jing Luo, Linjie Computer Vision and Pattern Recognition Portrait editing is challenging for existing techniques due to difficulties in preserving subject features like identity. In this paper, we propose a training-based method leveraging auto-generated paired data to learn desired editing while ensuring the preservation of unchanged subject features. Specifically, we design a data generation process to create reasonably good training pairs for desired editing at low cost. Based on these pairs, we introduce a Multi-Conditioned Diffusion Model to effectively learn the editing direction and preserve subject features. During inference, our model produces accurate editing mask that can guide the inference process to further preserve detailed subject features. Experiments on costume editing and cartoon expression editing show that our method achieves state-of-the-art quality, quantitatively and qualitatively.
title	Learning Feature-Preserving Portrait Editing from Generated Pairs
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2407.20455

Ähnliche Einträge