Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Zakour, Marsil, Nath, Partha Pratim, Lohmer, Ludwig, Gökçe, Emre Faik, Piccolrovazzi, Martin, Patsch, Constantin, Wu, Yuankai, Chaudhari, Rahul, Steinbach, Eckehard
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2402.17758
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866916142456504320
author Zakour, Marsil
Nath, Partha Pratim
Lohmer, Ludwig
Gökçe, Emre Faik
Piccolrovazzi, Martin
Patsch, Constantin
Wu, Yuankai
Chaudhari, Rahul
Steinbach, Eckehard
author_facet Zakour, Marsil
Nath, Partha Pratim
Lohmer, Ludwig
Gökçe, Emre Faik
Piccolrovazzi, Martin
Patsch, Constantin
Wu, Yuankai
Chaudhari, Rahul
Steinbach, Eckehard
contents Hand-Object Interactions (HOIs) are conditioned on spatial and temporal contexts like surrounding objects, previous actions, and future intents (for example, grasping and handover actions vary greatly based on objects proximity and trajectory obstruction). However, existing datasets for 4D HOI (3D HOI over time) are limited to one subject interacting with one object only. This restricts the generalization of learning-based HOI methods trained on those datasets. We introduce ADL4D, a dataset of up to two subjects interacting with different sets of objects performing Activities of Daily Living (ADL) like breakfast or lunch preparation activities. The transition between multiple objects to complete a certain task over time introduces a unique context lacking in existing datasets. Our dataset consists of 75 sequences with a total of 1.1M RGB-D frames, hand and object poses, and per-hand fine-grained action annotations. We develop an automatic system for multi-view multi-hand 3D pose annotation capable of tracking hand poses over time. We integrate and test it against publicly available datasets. Finally, we evaluate our dataset on the tasks of Hand Mesh Recovery (HMR) and Hand Action Segmentation (HAS).
format Preprint
id arxiv_https___arxiv_org_abs_2402_17758
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living
Zakour, Marsil
Nath, Partha Pratim
Lohmer, Ludwig
Gökçe, Emre Faik
Piccolrovazzi, Martin
Patsch, Constantin
Wu, Yuankai
Chaudhari, Rahul
Steinbach, Eckehard
Computer Vision and Pattern Recognition
Hand-Object Interactions (HOIs) are conditioned on spatial and temporal contexts like surrounding objects, previous actions, and future intents (for example, grasping and handover actions vary greatly based on objects proximity and trajectory obstruction). However, existing datasets for 4D HOI (3D HOI over time) are limited to one subject interacting with one object only. This restricts the generalization of learning-based HOI methods trained on those datasets. We introduce ADL4D, a dataset of up to two subjects interacting with different sets of objects performing Activities of Daily Living (ADL) like breakfast or lunch preparation activities. The transition between multiple objects to complete a certain task over time introduces a unique context lacking in existing datasets. Our dataset consists of 75 sequences with a total of 1.1M RGB-D frames, hand and object poses, and per-hand fine-grained action annotations. We develop an automatic system for multi-view multi-hand 3D pose annotation capable of tracking hand poses over time. We integrate and test it against publicly available datasets. Finally, we evaluate our dataset on the tasks of Hand Mesh Recovery (HMR) and Hand Action Segmentation (HAS).
title ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2402.17758