Saved in:
Bibliographic Details
Main Authors: Le, Hoang H., Nguyen, Duy M. H., Bhatti, Omair Shahzad, Kopacsi, Laszlo, Ngo, Thinh P., Nguyen, Binh T., Barz, Michael, Sonntag, Daniel
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.06239
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914860461195264
author Le, Hoang H.
Nguyen, Duy M. H.
Bhatti, Omair Shahzad
Kopacsi, Laszlo
Ngo, Thinh P.
Nguyen, Binh T.
Barz, Michael
Sonntag, Daniel
author_facet Le, Hoang H.
Nguyen, Duy M. H.
Bhatti, Omair Shahzad
Kopacsi, Laszlo
Ngo, Thinh P.
Nguyen, Binh T.
Barz, Michael
Sonntag, Daniel
contents Comprehending how humans process visual information in dynamic settings is crucial for psychology and designing user-centered interactions. While mobile eye-tracking systems combining egocentric video and gaze signals can offer valuable insights, manual analysis of these recordings is time-intensive. In this work, we present a novel human-centered learning algorithm designed for automated object recognition within mobile eye-tracking settings. Our approach seamlessly integrates an object detector with a spatial relation-aware inductive message-passing network (I-MPN), harnessing node profile information and capturing object correlations. Such mechanisms enable us to learn embedding functions capable of generalizing to new object angle views, facilitating rapid adaptation and efficient reasoning in dynamic contexts as users navigate their environment. Through experiments conducted on three distinct video sequences, our interactive-based method showcases significant performance improvements over fixed training/testing algorithms, even when trained on considerably smaller annotated samples collected through user feedback. Furthermore, we demonstrate exceptional efficiency in data annotation processes and surpass prior interactive methods that use complete object detectors, combine detectors with convolutional networks, or employ interactive video segmentation.
format Preprint
id arxiv_https___arxiv_org_abs_2406_06239
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data
Le, Hoang H.
Nguyen, Duy M. H.
Bhatti, Omair Shahzad
Kopacsi, Laszlo
Ngo, Thinh P.
Nguyen, Binh T.
Barz, Michael
Sonntag, Daniel
Computer Vision and Pattern Recognition
Comprehending how humans process visual information in dynamic settings is crucial for psychology and designing user-centered interactions. While mobile eye-tracking systems combining egocentric video and gaze signals can offer valuable insights, manual analysis of these recordings is time-intensive. In this work, we present a novel human-centered learning algorithm designed for automated object recognition within mobile eye-tracking settings. Our approach seamlessly integrates an object detector with a spatial relation-aware inductive message-passing network (I-MPN), harnessing node profile information and capturing object correlations. Such mechanisms enable us to learn embedding functions capable of generalizing to new object angle views, facilitating rapid adaptation and efficient reasoning in dynamic contexts as users navigate their environment. Through experiments conducted on three distinct video sequences, our interactive-based method showcases significant performance improvements over fixed training/testing algorithms, even when trained on considerably smaller annotated samples collected through user feedback. Furthermore, we demonstrate exceptional efficiency in data annotation processes and surpass prior interactive methods that use complete object detectors, combine detectors with convolutional networks, or employ interactive video segmentation.
title I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2406.06239