Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lee, Jae Yong, Scharstein, Daniel, Bapat, Akash, Hu, Hao, Fu, Andrew, Zhao, Haoru, Sammut, Paul, Li, Xiang, Jeapes, Stephen, Gupta, Anik, David, Lior, Madhuvarasu, Saketh, Joshi, Jay Girish, Wither, Jason
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.13741
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912966030393344
author	Lee, Jae Yong Scharstein, Daniel Bapat, Akash Hu, Hao Fu, Andrew Zhao, Haoru Sammut, Paul Li, Xiang Jeapes, Stephen Gupta, Anik David, Lior Madhuvarasu, Saketh Joshi, Jay Girish Wither, Jason
author_facet	Lee, Jae Yong Scharstein, Daniel Bapat, Akash Hu, Hao Fu, Andrew Zhao, Haoru Sammut, Paul Li, Xiang Jeapes, Stephen Gupta, Anik David, Lior Madhuvarasu, Saketh Joshi, Jay Girish Wither, Jason
contents	We present Ego-1K, a large-scale collection of time-synchronized egocentric multiview videos designed to advance neural 3D video synthesis and dynamic scene understanding. The dataset contains nearly 1,000 short egocentric videos captured with a custom rig with 12 synchronized cameras surrounding a 4-camera VR headset worn by the user. Scene content focuses on hand motions and hand-object interactions in different settings. We describe rig design, data processing, and calibration. Our dataset enables new ways to benchmark egocentric scene reconstruction methods, an important research area as smart glasses with multiple cameras become omnipresent. Our experiments demonstrate that our dataset presents unique challenges for existing 3D and 4D novel view synthesis methods due to large disparities and image motion caused by close dynamic objects and rig egomotion. Our dataset supports future research in this challenging domain. It is available at https://huggingface.co/datasets/facebook/ego-1k.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_13741
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Ego-1K -- A Large-Scale Multiview Video Dataset for Egocentric Vision Lee, Jae Yong Scharstein, Daniel Bapat, Akash Hu, Hao Fu, Andrew Zhao, Haoru Sammut, Paul Li, Xiang Jeapes, Stephen Gupta, Anik David, Lior Madhuvarasu, Saketh Joshi, Jay Girish Wither, Jason Computer Vision and Pattern Recognition We present Ego-1K, a large-scale collection of time-synchronized egocentric multiview videos designed to advance neural 3D video synthesis and dynamic scene understanding. The dataset contains nearly 1,000 short egocentric videos captured with a custom rig with 12 synchronized cameras surrounding a 4-camera VR headset worn by the user. Scene content focuses on hand motions and hand-object interactions in different settings. We describe rig design, data processing, and calibration. Our dataset enables new ways to benchmark egocentric scene reconstruction methods, an important research area as smart glasses with multiple cameras become omnipresent. Our experiments demonstrate that our dataset presents unique challenges for existing 3D and 4D novel view synthesis methods due to large disparities and image motion caused by close dynamic objects and rig egomotion. Our dataset supports future research in this challenging domain. It is available at https://huggingface.co/datasets/facebook/ego-1k.
title	Ego-1K -- A Large-Scale Multiview Video Dataset for Egocentric Vision
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2603.13741

Similar Items