Saved in:
Bibliographic Details
Main Authors: Lew, Hah Min, Yoo, Sahng-Min, Kang, Hyunwoo, Park, Gyeong-Moon
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2411.00652
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929572028612608
author Lew, Hah Min
Yoo, Sahng-Min
Kang, Hyunwoo
Park, Gyeong-Moon
author_facet Lew, Hah Min
Yoo, Sahng-Min
Kang, Hyunwoo
Park, Gyeong-Moon
contents We introduce an industrial Head Blending pipeline for the task of seamlessly integrating an actor's head onto a target body in digital content creation. The key challenge stems from discrepancies in head shape and hair structure, which lead to unnatural boundaries and blending artifacts. Existing methods treat foreground and background as a single task, resulting in suboptimal blending quality. To address this problem, we propose CHANGER, a novel pipeline that decouples background integration from foreground blending. By utilizing chroma keying for artifact-free background generation and introducing Head shape and long Hair augmentation ($H^2$ augmentation) to simulate a wide range of head shapes and hair styles, CHANGER improves generalization on innumerable various real-world cases. Furthermore, our Foreground Predictive Attention Transformer (FPAT) module enhances foreground blending by predicting and focusing on key head and body regions. Quantitative and qualitative evaluations on benchmark datasets demonstrate that our CHANGER outperforms state-of-the-art methods, delivering high-fidelity, industrial-grade results.
format Preprint
id arxiv_https___arxiv_org_abs_2411_00652
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Towards High-fidelity Head Blending with Chroma Keying for Industrial Applications
Lew, Hah Min
Yoo, Sahng-Min
Kang, Hyunwoo
Park, Gyeong-Moon
Computer Vision and Pattern Recognition
Graphics
Machine Learning
We introduce an industrial Head Blending pipeline for the task of seamlessly integrating an actor's head onto a target body in digital content creation. The key challenge stems from discrepancies in head shape and hair structure, which lead to unnatural boundaries and blending artifacts. Existing methods treat foreground and background as a single task, resulting in suboptimal blending quality. To address this problem, we propose CHANGER, a novel pipeline that decouples background integration from foreground blending. By utilizing chroma keying for artifact-free background generation and introducing Head shape and long Hair augmentation ($H^2$ augmentation) to simulate a wide range of head shapes and hair styles, CHANGER improves generalization on innumerable various real-world cases. Furthermore, our Foreground Predictive Attention Transformer (FPAT) module enhances foreground blending by predicting and focusing on key head and body regions. Quantitative and qualitative evaluations on benchmark datasets demonstrate that our CHANGER outperforms state-of-the-art methods, delivering high-fidelity, industrial-grade results.
title Towards High-fidelity Head Blending with Chroma Keying for Industrial Applications
topic Computer Vision and Pattern Recognition
Graphics
Machine Learning
url https://arxiv.org/abs/2411.00652