Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zakour, Marsil, Nath, Partha Pratim, Lohmer, Ludwig, Gökçe, Emre Faik, Piccolrovazzi, Martin, Patsch, Constantin, Wu, Yuankai, Chaudhari, Rahul, Steinbach, Eckehard
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2402.17758
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866916142456504320
author	Zakour, Marsil Nath, Partha Pratim Lohmer, Ludwig Gökçe, Emre Faik Piccolrovazzi, Martin Patsch, Constantin Wu, Yuankai Chaudhari, Rahul Steinbach, Eckehard
author_facet	Zakour, Marsil Nath, Partha Pratim Lohmer, Ludwig Gökçe, Emre Faik Piccolrovazzi, Martin Patsch, Constantin Wu, Yuankai Chaudhari, Rahul Steinbach, Eckehard
contents	Hand-Object Interactions (HOIs) are conditioned on spatial and temporal contexts like surrounding objects, previous actions, and future intents (for example, grasping and handover actions vary greatly based on objects proximity and trajectory obstruction). However, existing datasets for 4D HOI (3D HOI over time) are limited to one subject interacting with one object only. This restricts the generalization of learning-based HOI methods trained on those datasets. We introduce ADL4D, a dataset of up to two subjects interacting with different sets of objects performing Activities of Daily Living (ADL) like breakfast or lunch preparation activities. The transition between multiple objects to complete a certain task over time introduces a unique context lacking in existing datasets. Our dataset consists of 75 sequences with a total of 1.1M RGB-D frames, hand and object poses, and per-hand fine-grained action annotations. We develop an automatic system for multi-view multi-hand 3D pose annotation capable of tracking hand poses over time. We integrate and test it against publicly available datasets. Finally, we evaluate our dataset on the tasks of Hand Mesh Recovery (HMR) and Hand Action Segmentation (HAS).
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_17758
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living Zakour, Marsil Nath, Partha Pratim Lohmer, Ludwig Gökçe, Emre Faik Piccolrovazzi, Martin Patsch, Constantin Wu, Yuankai Chaudhari, Rahul Steinbach, Eckehard Computer Vision and Pattern Recognition Hand-Object Interactions (HOIs) are conditioned on spatial and temporal contexts like surrounding objects, previous actions, and future intents (for example, grasping and handover actions vary greatly based on objects proximity and trajectory obstruction). However, existing datasets for 4D HOI (3D HOI over time) are limited to one subject interacting with one object only. This restricts the generalization of learning-based HOI methods trained on those datasets. We introduce ADL4D, a dataset of up to two subjects interacting with different sets of objects performing Activities of Daily Living (ADL) like breakfast or lunch preparation activities. The transition between multiple objects to complete a certain task over time introduces a unique context lacking in existing datasets. Our dataset consists of 75 sequences with a total of 1.1M RGB-D frames, hand and object poses, and per-hand fine-grained action annotations. We develop an automatic system for multi-view multi-hand 3D pose annotation capable of tracking hand poses over time. We integrate and test it against publicly available datasets. Finally, we evaluate our dataset on the tasks of Hand Mesh Recovery (HMR) and Hand Action Segmentation (HAS).
title	ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2402.17758

Ähnliche Einträge