Saved in:
Bibliographic Details
Main Authors: Huang, Haoyu, Sato, Yoichi
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.11669
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915794327175168
author Huang, Haoyu
Sato, Yoichi
author_facet Huang, Haoyu
Sato, Yoichi
contents This paper introduces neck-mounted view gaze estimation, a new task that estimates user gaze from the neck-mounted camera perspective. Prior work on egocentric gaze estimation, which predicts device wearer's gaze location within the camera's field of view, mainly focuses on head-mounted cameras while alternative viewpoints remain underexplored. To bridge this gap, we collect the first dataset for this task, consisting of approximately 4 hours of video collected from 8 participants during everyday activities. We evaluate a transformer-based gaze estimation model, GLC, on the new dataset and propose two extensions: an auxiliary gaze out-of-bound classification task and a multi-view co-learning approach that jointly trains head-view and neck-view models using a geometry-aware auxiliary loss. Experimental results show that incorporating gaze out-of-bound classification improves performance over standard fine-tuning, while the co-learning approach does not yield gains. We further analyze these results and discuss implications for neck-mounted gaze estimation.
format Preprint
id arxiv_https___arxiv_org_abs_2602_11669
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Egocentric Gaze Estimation via Neck-Mounted Camera
Huang, Haoyu
Sato, Yoichi
Computer Vision and Pattern Recognition
This paper introduces neck-mounted view gaze estimation, a new task that estimates user gaze from the neck-mounted camera perspective. Prior work on egocentric gaze estimation, which predicts device wearer's gaze location within the camera's field of view, mainly focuses on head-mounted cameras while alternative viewpoints remain underexplored. To bridge this gap, we collect the first dataset for this task, consisting of approximately 4 hours of video collected from 8 participants during everyday activities. We evaluate a transformer-based gaze estimation model, GLC, on the new dataset and propose two extensions: an auxiliary gaze out-of-bound classification task and a multi-view co-learning approach that jointly trains head-view and neck-view models using a geometry-aware auxiliary loss. Experimental results show that incorporating gaze out-of-bound classification improves performance over standard fine-tuning, while the co-learning approach does not yield gains. We further analyze these results and discuss implications for neck-mounted gaze estimation.
title Egocentric Gaze Estimation via Neck-Mounted Camera
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.11669