Saved in:
Bibliographic Details
Main Authors: Celestino, José, Marques, Manuel, Nascimento, Jacinto C.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.20251
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929683646382080
author Celestino, José
Marques, Manuel
Nascimento, Jacinto C.
author_facet Celestino, José
Marques, Manuel
Nascimento, Jacinto C.
contents Head pose estimation has become a crucial area of research in computer vision given its usefulness in a wide range of applications, including robotics, surveillance, or driver attention monitoring. One of the most difficult challenges in this field is managing head occlusions that frequently take place in real-world scenarios. In this paper, we propose a novel and efficient framework that is robust in real world head occlusion scenarios. In particular, we propose an unsupervised latent embedding clustering with regression and classification components for each pose angle. The model optimizes latent feature representations for occluded and non-occluded images through a clustering term while improving fine-grained angle predictions. Experimental evaluation on in-the-wild head pose benchmark datasets reveal competitive performance in comparison to state-of-the-art methodologies with the advantage of having a significant data reduction. We observe a substantial improvement in occluded head pose estimation. Also, an ablation study is conducted to ascertain the impact of the clustering term within our proposed framework.
format Preprint
id arxiv_https___arxiv_org_abs_2403_20251
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Latent Embedding Clustering for Occlusion Robust Head Pose Estimation
Celestino, José
Marques, Manuel
Nascimento, Jacinto C.
Computer Vision and Pattern Recognition
Head pose estimation has become a crucial area of research in computer vision given its usefulness in a wide range of applications, including robotics, surveillance, or driver attention monitoring. One of the most difficult challenges in this field is managing head occlusions that frequently take place in real-world scenarios. In this paper, we propose a novel and efficient framework that is robust in real world head occlusion scenarios. In particular, we propose an unsupervised latent embedding clustering with regression and classification components for each pose angle. The model optimizes latent feature representations for occluded and non-occluded images through a clustering term while improving fine-grained angle predictions. Experimental evaluation on in-the-wild head pose benchmark datasets reveal competitive performance in comparison to state-of-the-art methodologies with the advantage of having a significant data reduction. We observe a substantial improvement in occluded head pose estimation. Also, an ablation study is conducted to ascertain the impact of the clustering term within our proposed framework.
title Latent Embedding Clustering for Occlusion Robust Head Pose Estimation
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2403.20251