Salvato in:
Dettagli Bibliografici
Autori principali: Chu, Shuyang, Xia, Menghan, Yuan, Mengyao, Liu, Xin, Seppanen, Tapio, Zhao, Guoying, Shi, Jingang
Natura: Preprint
Pubblicazione: 2025
Soggetti:
Accesso online:https://arxiv.org/abs/2502.07526
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866909488108273664
author Chu, Shuyang
Xia, Menghan
Yuan, Mengyao
Liu, Xin
Seppanen, Tapio
Zhao, Guoying
Shi, Jingang
author_facet Chu, Shuyang
Xia, Menghan
Yuan, Mengyao
Liu, Xin
Seppanen, Tapio
Zhao, Guoying
Shi, Jingang
contents Remote photoplethysmography (rPPG) aims to measure non-contact physiological signals from facial videos, which has shown great potential in many applications. Most existing methods directly extract video-based rPPG features by designing neural networks for heart rate estimation. Although they can achieve acceptable results, the recovery of rPPG signal faces intractable challenges when interference from real-world scenarios takes place on facial video. Specifically, facial videos are inevitably affected by non-physiological factors (e.g., camera device noise, defocus, and motion blur), leading to the distortion of extracted rPPG signals. Recent rPPG extraction methods are easily affected by interference and degradation, resulting in noisy rPPG signals. In this paper, we propose a novel method named CodePhys, which innovatively treats rPPG measurement as a code query task in a noise-free proxy space (i.e., codebook) constructed by ground-truth PPG signals. We consider noisy rPPG features as queries and generate high-fidelity rPPG features by matching them with noise-free PPG features from the codebook. Our approach also incorporates a spatial-aware encoder network with a spatial attention mechanism to highlight physiologically active areas and uses a distillation loss to reduce the influence of non-periodic visual interference. Experimental results on four benchmark datasets demonstrate that CodePhys outperforms state-of-the-art methods in both intra-dataset and cross-dataset settings.
format Preprint
id arxiv_https___arxiv_org_abs_2502_07526
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle CodePhys: Robust Video-based Remote Physiological Measurement through Latent Codebook Querying
Chu, Shuyang
Xia, Menghan
Yuan, Mengyao
Liu, Xin
Seppanen, Tapio
Zhao, Guoying
Shi, Jingang
Computer Vision and Pattern Recognition
Remote photoplethysmography (rPPG) aims to measure non-contact physiological signals from facial videos, which has shown great potential in many applications. Most existing methods directly extract video-based rPPG features by designing neural networks for heart rate estimation. Although they can achieve acceptable results, the recovery of rPPG signal faces intractable challenges when interference from real-world scenarios takes place on facial video. Specifically, facial videos are inevitably affected by non-physiological factors (e.g., camera device noise, defocus, and motion blur), leading to the distortion of extracted rPPG signals. Recent rPPG extraction methods are easily affected by interference and degradation, resulting in noisy rPPG signals. In this paper, we propose a novel method named CodePhys, which innovatively treats rPPG measurement as a code query task in a noise-free proxy space (i.e., codebook) constructed by ground-truth PPG signals. We consider noisy rPPG features as queries and generate high-fidelity rPPG features by matching them with noise-free PPG features from the codebook. Our approach also incorporates a spatial-aware encoder network with a spatial attention mechanism to highlight physiologically active areas and uses a distillation loss to reduce the influence of non-periodic visual interference. Experimental results on four benchmark datasets demonstrate that CodePhys outperforms state-of-the-art methods in both intra-dataset and cross-dataset settings.
title CodePhys: Robust Video-based Remote Physiological Measurement through Latent Codebook Querying
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2502.07526