Saved in:
Bibliographic Details
Main Authors: Cheng, Zhang, Wang, Yanxia, Xia, Guoyu
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.08074
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910739485163520
author Cheng, Zhang
Wang, Yanxia
Xia, Guoyu
author_facet Cheng, Zhang
Wang, Yanxia
Xia, Guoyu
contents In recent years, the accuracy of gaze estimation techniques has gradually improved, but existing methods often rely on large datasets or large models to improve performance, which leads to high demands on computational resources. In terms of this issue, this paper proposes a lightweight gaze estimation model EM-Net based on deep learning and traditional machine learning algorithms Expectation Maximization algorithm. First, the proposed Global Attention Mechanism(GAM) is added to extract features related to gaze estimation to improve the model's ability to capture global dependencies and thus improve its performance. Second, by learning hierarchical feature representations through the EM module, the model has strong generalization ability, which reduces the need for sample size. Experiments have confirmed that, on the premise of using only 50% of the training data, EM-Net improves the performance of Gaze360, MPIIFaceGaze, and RT-Gene datasets by 2.2%, 2.02%, and 2.03%, respectively, compared with GazeNAS-ETH. It also shows good robustness in the face of Gaussian noise interference.
format Preprint
id arxiv_https___arxiv_org_abs_2412_08074
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle EM-Net: Gaze Estimation with Expectation Maximization Algorithm
Cheng, Zhang
Wang, Yanxia
Xia, Guoyu
Computer Vision and Pattern Recognition
Machine Learning
In recent years, the accuracy of gaze estimation techniques has gradually improved, but existing methods often rely on large datasets or large models to improve performance, which leads to high demands on computational resources. In terms of this issue, this paper proposes a lightweight gaze estimation model EM-Net based on deep learning and traditional machine learning algorithms Expectation Maximization algorithm. First, the proposed Global Attention Mechanism(GAM) is added to extract features related to gaze estimation to improve the model's ability to capture global dependencies and thus improve its performance. Second, by learning hierarchical feature representations through the EM module, the model has strong generalization ability, which reduces the need for sample size. Experiments have confirmed that, on the premise of using only 50% of the training data, EM-Net improves the performance of Gaze360, MPIIFaceGaze, and RT-Gene datasets by 2.2%, 2.02%, and 2.03%, respectively, compared with GazeNAS-ETH. It also shows good robustness in the face of Gaussian noise interference.
title EM-Net: Gaze Estimation with Expectation Maximization Algorithm
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2412.08074