Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ortmann, Thorben, Wang, Qi, Putzar, Larissa
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.11306
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916525106003968
author	Ortmann, Thorben Wang, Qi Putzar, Larissa
author_facet	Ortmann, Thorben Wang, Qi Putzar, Larissa
contents	In this study, we explored the potential of utilizing Facial Expression Activations (FEAs) captured via the Meta Quest Pro Virtual Reality (VR) headset for Facial Expression Recognition (FER) in VR settings. Leveraging the EmojiHeroVR Database (EmoHeVRDB), we compared several unimodal approaches and achieved up to 73.02% accuracy for the static FER task with seven emotion categories. Furthermore, we integrated FEA and image data in multimodal approaches, observing significant improvements in recognition accuracy. An intermediate fusion approach achieved the highest accuracy of 80.42%, significantly surpassing the baseline evaluation result of 69.84% reported for EmoHeVRDB's image data. Our study is the first to utilize EmoHeVRDB's unique FEA data for unimodal and multimodal static FER, establishing new benchmarks for FER in VR settings. Our findings highlight the potential of fusing complementary modalities to enhance FER accuracy in VR settings, where conventional image-based methods are severely limited by the occlusion caused by Head-Mounted Displays (HMDs).
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_11306
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Unimodal and Multimodal Static Facial Expression Recognition for Virtual Reality Users with EmoHeVRDB Ortmann, Thorben Wang, Qi Putzar, Larissa Computer Vision and Pattern Recognition In this study, we explored the potential of utilizing Facial Expression Activations (FEAs) captured via the Meta Quest Pro Virtual Reality (VR) headset for Facial Expression Recognition (FER) in VR settings. Leveraging the EmojiHeroVR Database (EmoHeVRDB), we compared several unimodal approaches and achieved up to 73.02% accuracy for the static FER task with seven emotion categories. Furthermore, we integrated FEA and image data in multimodal approaches, observing significant improvements in recognition accuracy. An intermediate fusion approach achieved the highest accuracy of 80.42%, significantly surpassing the baseline evaluation result of 69.84% reported for EmoHeVRDB's image data. Our study is the first to utilize EmoHeVRDB's unique FEA data for unimodal and multimodal static FER, establishing new benchmarks for FER in VR settings. Our findings highlight the potential of fusing complementary modalities to enhance FER accuracy in VR settings, where conventional image-based methods are severely limited by the occlusion caused by Head-Mounted Displays (HMDs).
title	Unimodal and Multimodal Static Facial Expression Recognition for Virtual Reality Users with EmoHeVRDB
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2412.11306

Similar Items