Saved in:
Bibliographic Details
Main Authors: Huang, Sining, Song, Yukun, Kang, Yixiao, Yu, Chang
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.14577
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • In the field of spatial computing, one of the most essential tasks is the pose estimation of 3D objects. While rigid transformations of arbitrary 3D objects are relatively hard to detect due to varying environment introducing factors like insufficient lighting or even occlusion, objects with pre-defined shapes are often easy to track, leveraging geometric constraints. Curved images, with flexible dimensions but a confined shape, are essential shapes often targeted in 3D tracking. Traditionally, proprietary algorithms often require specific curvature measures as the input along with the original flattened images to enable pose estimation for a single image target. In this paper, we propose a pipeline that can detect several logo images simultaneously and only requires the original images as the input, unlocking more effects in downstream fields such as Augmented Reality (AR).