Saved in:
Bibliographic Details
Main Authors: Song, Peipei, Zhang, Jing, Koniusz, Piotr, Barnes, Nick
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.14821
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909146170785792
author Song, Peipei
Zhang, Jing
Koniusz, Piotr
Barnes, Nick
author_facet Song, Peipei
Zhang, Jing
Koniusz, Piotr
Barnes, Nick
contents Existing eye fixation prediction methods perform the mapping from input images to the corresponding dense fixation maps generated from raw fixation points. However, due to the stochastic nature of human fixation, the generated dense fixation maps may be a less-than-ideal representation of human fixation. To provide a robust fixation model, we introduce Gaussian Representation for eye fixation modeling. Specifically, we propose to model the eye fixation map as a mixture of probability distributions, namely a Gaussian Mixture Model. In this new representation, we use several Gaussian distribution components as an alternative to the provided fixation map, which makes the model more robust to the randomness of fixation. Meanwhile, we design our framework upon some lightweight backbones to achieve real-time fixation prediction. Experimental results on three public fixation prediction datasets (SALICON, MIT1003, TORONTO) demonstrate that our method is fast and effective.
format Preprint
id arxiv_https___arxiv_org_abs_2403_14821
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Learning Gaussian Representation for Eye Fixation Prediction
Song, Peipei
Zhang, Jing
Koniusz, Piotr
Barnes, Nick
Computer Vision and Pattern Recognition
Existing eye fixation prediction methods perform the mapping from input images to the corresponding dense fixation maps generated from raw fixation points. However, due to the stochastic nature of human fixation, the generated dense fixation maps may be a less-than-ideal representation of human fixation. To provide a robust fixation model, we introduce Gaussian Representation for eye fixation modeling. Specifically, we propose to model the eye fixation map as a mixture of probability distributions, namely a Gaussian Mixture Model. In this new representation, we use several Gaussian distribution components as an alternative to the provided fixation map, which makes the model more robust to the randomness of fixation. Meanwhile, we design our framework upon some lightweight backbones to achieve real-time fixation prediction. Experimental results on three public fixation prediction datasets (SALICON, MIT1003, TORONTO) demonstrate that our method is fast and effective.
title Learning Gaussian Representation for Eye Fixation Prediction
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2403.14821