Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Boyne, Oliver, Cipolla, Roberto
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2502.06367
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915159003365376
author	Boyne, Oliver Cipolla, Roberto
author_facet	Boyne, Oliver Cipolla, Roberto
contents	Surface reconstruction from multiple, calibrated images is a challenging task - often requiring a large number of collected images with significant overlap. We look at the specific case of human foot reconstruction. As with previous successful foot reconstruction work, we seek to extract rich per-pixel geometry cues from multi-view RGB images, and fuse these into a final 3D object. Our method, FOCUS, tackles this problem with 3 main contributions: (i) SynFoot2, an extension of an existing synthetic foot dataset to include a new data type: dense correspondence with the parameterized foot model FIND; (ii) an uncertainty-aware dense correspondence predictor trained on our synthetic dataset; (iii) two methods for reconstructing a 3D surface from dense correspondence predictions: one inspired by Structure-from-Motion, and one optimization-based using the FIND model. We show that our reconstruction achieves state-of-the-art reconstruction quality in a few-view setting, performing comparably to state-of-the-art when many views are available, and runs substantially faster. We release our synthetic dataset to the research community. Code is available at: https://github.com/OllieBoyne/FOCUS
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_06367
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	FOCUS -- Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences Boyne, Oliver Cipolla, Roberto Computer Vision and Pattern Recognition Surface reconstruction from multiple, calibrated images is a challenging task - often requiring a large number of collected images with significant overlap. We look at the specific case of human foot reconstruction. As with previous successful foot reconstruction work, we seek to extract rich per-pixel geometry cues from multi-view RGB images, and fuse these into a final 3D object. Our method, FOCUS, tackles this problem with 3 main contributions: (i) SynFoot2, an extension of an existing synthetic foot dataset to include a new data type: dense correspondence with the parameterized foot model FIND; (ii) an uncertainty-aware dense correspondence predictor trained on our synthetic dataset; (iii) two methods for reconstructing a 3D surface from dense correspondence predictions: one inspired by Structure-from-Motion, and one optimization-based using the FIND model. We show that our reconstruction achieves state-of-the-art reconstruction quality in a few-view setting, performing comparably to state-of-the-art when many views are available, and runs substantially faster. We release our synthetic dataset to the research community. Code is available at: https://github.com/OllieBoyne/FOCUS
title	FOCUS -- Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2502.06367

Similar Items