Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Han, Wenkang, Lin, Wang, Zhou, Yiyun, Liu, Qi, Wang, Shulei, Yao, Chang, Chen, Jingyuan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.10293
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918091137482752
author	Han, Wenkang Lin, Wang Zhou, Yiyun Liu, Qi Wang, Shulei Yao, Chang Chen, Jingyuan
author_facet	Han, Wenkang Lin, Wang Zhou, Yiyun Liu, Qi Wang, Shulei Yao, Chang Chen, Jingyuan
contents	Face Video Restoration (FVR) aims to recover high-quality face videos from degraded versions. Traditional methods struggle to preserve fine-grained, identity-specific features when degradation is severe, often producing average-looking faces that lack individual characteristics. To address these challenges, we introduce IP-FVR, a novel method that leverages a high-quality reference face image as a visual prompt to provide identity conditioning during the denoising process. IP-FVR incorporates semantically rich identity information from the reference image using decoupled cross-attention mechanisms, ensuring detailed and identity consistent results. For intra-clip identity drift (within 24 frames), we introduce an identity-preserving feedback learning method that combines cosine similarity-based reward signals with suffix-weighted temporal aggregation. This approach effectively minimizes drift within sequences of frames. For inter-clip identity drift, we develop an exponential blending strategy that aligns identities across clips by iteratively blending frames from previous clips during the denoising process. This method ensures consistent identity representation across different clips. Additionally, we enhance the restoration process with a multi-stream negative prompt, guiding the model's attention to relevant facial attributes and minimizing the generation of low-quality or incorrect features. Extensive experiments on both synthetic and real-world datasets demonstrate that IP-FVR outperforms existing methods in both quality and identity preservation, showcasing its substantial potential for practical applications in face video restoration.
format	Preprint
id	arxiv_https___arxiv_org_abs_2507_10293
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Show and Polish: Reference-Guided Identity Preservation in Face Video Restoration Han, Wenkang Lin, Wang Zhou, Yiyun Liu, Qi Wang, Shulei Yao, Chang Chen, Jingyuan Computer Vision and Pattern Recognition Face Video Restoration (FVR) aims to recover high-quality face videos from degraded versions. Traditional methods struggle to preserve fine-grained, identity-specific features when degradation is severe, often producing average-looking faces that lack individual characteristics. To address these challenges, we introduce IP-FVR, a novel method that leverages a high-quality reference face image as a visual prompt to provide identity conditioning during the denoising process. IP-FVR incorporates semantically rich identity information from the reference image using decoupled cross-attention mechanisms, ensuring detailed and identity consistent results. For intra-clip identity drift (within 24 frames), we introduce an identity-preserving feedback learning method that combines cosine similarity-based reward signals with suffix-weighted temporal aggregation. This approach effectively minimizes drift within sequences of frames. For inter-clip identity drift, we develop an exponential blending strategy that aligns identities across clips by iteratively blending frames from previous clips during the denoising process. This method ensures consistent identity representation across different clips. Additionally, we enhance the restoration process with a multi-stream negative prompt, guiding the model's attention to relevant facial attributes and minimizing the generation of low-quality or incorrect features. Extensive experiments on both synthetic and real-world datasets demonstrate that IP-FVR outperforms existing methods in both quality and identity preservation, showcasing its substantial potential for practical applications in face video restoration.
title	Show and Polish: Reference-Guided Identity Preservation in Face Video Restoration
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2507.10293

Similar Items