Saved in:
Bibliographic Details
Main Authors: Hägerlind, Johannes, Hentati-Sundberg, Jonas, Wandt, Bastian
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.13629
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913479051444224
author Hägerlind, Johannes
Hentati-Sundberg, Jonas
Wandt, Bastian
author_facet Hägerlind, Johannes
Hentati-Sundberg, Jonas
Wandt, Bastian
contents This paper deals with 3D reconstruction of seabirds which recently came into focus of environmental scientists as valuable bio-indicators for environmental change. Such 3D information is beneficial for analyzing the bird's behavior and physiological shape, for example by tracking motion, shape, and appearance changes. From a computer vision perspective birds are especially challenging due to their rapid and oftentimes non-rigid motions. We propose an approach to reconstruct the 3D pose and shape from monocular videos of a specific breed of seabird - the common murre. Our approach comprises a full pipeline of detection, tracking, segmentation, and temporally consistent 3D reconstruction. Additionally, we propose a temporal loss that extends current single-image 3D bird pose estimators to the temporal domain. Moreover, we provide a real-world dataset of 10000 frames of video observations on average capture nine birds simultaneously, comprising a large variety of motions and interactions, including a smaller test set with bird-specific keypoint labels. Using our temporal optimization, we achieve state-of-the-art performance for the challenging sequences in our dataset.
format Preprint
id arxiv_https___arxiv_org_abs_2408_13629
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Temporally-consistent 3D Reconstruction of Birds
Hägerlind, Johannes
Hentati-Sundberg, Jonas
Wandt, Bastian
Computer Vision and Pattern Recognition
This paper deals with 3D reconstruction of seabirds which recently came into focus of environmental scientists as valuable bio-indicators for environmental change. Such 3D information is beneficial for analyzing the bird's behavior and physiological shape, for example by tracking motion, shape, and appearance changes. From a computer vision perspective birds are especially challenging due to their rapid and oftentimes non-rigid motions. We propose an approach to reconstruct the 3D pose and shape from monocular videos of a specific breed of seabird - the common murre. Our approach comprises a full pipeline of detection, tracking, segmentation, and temporally consistent 3D reconstruction. Additionally, we propose a temporal loss that extends current single-image 3D bird pose estimators to the temporal domain. Moreover, we provide a real-world dataset of 10000 frames of video observations on average capture nine birds simultaneously, comprising a large variety of motions and interactions, including a smaller test set with bird-specific keypoint labels. Using our temporal optimization, we achieve state-of-the-art performance for the challenging sequences in our dataset.
title Temporally-consistent 3D Reconstruction of Birds
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2408.13629