:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Straub, Julian, DeTone, Daniel, Shen, Tianwei, Yang, Nan, Sweeney, Chris, Newcombe, Richard
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2406.10224
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Boxer: Robust Lifting of Open-World 2D Bounding Boxes to 3D
by: DeTone, Daniel, et al.
Published: (2026)

Sonata: Self-Supervised Learning of Reliable Point Representations
by: Wu, Xiaoyang, et al.
Published: (2025)

ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
by: Siddiqui, Yawar, et al.
Published: (2026)

NymeriaPlus: Enriching Nymeria Dataset with Additional Annotations and Data
by: DeTone, Daniel, et al.
Published: (2026)

EgoLifter: Open-world 3D Segmentation for Egocentric Perception
by: Gu, Qiao, et al.
Published: (2024)

LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World
by: Yang, Nan, et al.
Published: (2026)

Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking
by: Banerjee, Prithviraj, et al.
Published: (2024)

E$^3$C: Video Generation with 3D Environmental Memory and Ego-Exo Human Pose Control
by: Gu, Qiao, et al.
Published: (2026)

HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos
by: Banerjee, Prithviraj, et al.
Published: (2024)

FedEFM: Federated Endovascular Foundation Model with Unseen Data
by: Do, Tuong, et al.
Published: (2025)

Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling
by: Xie, Christopher, et al.
Published: (2025)

Benchmarking Egocentric Visual-Inertial SLAM at City Scale
by: Krishnan, Anusha, et al.
Published: (2025)

EgoLM: Multi-Modal Language Model of Egocentric Motions
by: Hong, Fangzhou, et al.
Published: (2024)

ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images
by: Ding, Fangqiang, et al.
Published: (2024)

Towards Egocentric 3D Hand Pose Estimation in Unseen Domains
by: Mucha, Wiktor, et al.
Published: (2026)

E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models
by: Cong, Wenyan, et al.
Published: (2025)

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
by: Lee, Junha, et al.
Published: (2025)

Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset
by: Wang, Zirui, et al.
Published: (2025)

SuperMemory-VQA: An Egocentric Visual Question-Answering Benchmark for Long-Horizon Memory
by: Alam, Samiul, et al.
Published: (2026)

Benchmarking 2D Egocentric Hand Pose Datasets
by: Taran, Olga, et al.
Published: (2024)

EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining
by: Xu, Boshen, et al.
Published: (2025)

Grounding 3D Scene Affordance From Egocentric Interactions
by: Liu, Cuiyu, et al.
Published: (2024)

Photoreal Scene Reconstruction from an Egocentric Device
by: Lv, Zhaoyang, et al.
Published: (2025)

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
by: Jiang, Haoyi, et al.
Published: (2024)

DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
by: Lin, Chieh Hubert, et al.
Published: (2025)

A Pixel Is Worth More Than One 3D Gaussians in Single-View 3D Reconstruction
by: Shen, Jianghao, et al.
Published: (2024)

BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence
by: Lin, Xuewu, et al.
Published: (2024)

Towards Foundation Models for 3D Vision: How Close Are We?
by: Zuo, Yiming, et al.
Published: (2024)

ProGS: Towards Progressive Coding for 3D Gaussian Splatting
by: Tang, Zhiye, et al.
Published: (2026)

Pixels to Play: A Foundation Model for 3D Gameplay
by: Yue, Yuguang, et al.
Published: (2025)

Egocentric Bias in Vision-Language Models
by: Wang, Maijunxian, et al.
Published: (2026)

Instance Tracking in 3D Scenes from Egocentric Videos
by: Zhao, Yunhan, et al.
Published: (2023)

A Survey on 3D Egocentric Human Pose Estimation
by: Azam, Md Mushfiqur, et al.
Published: (2024)

HMD^2: Environment-aware Motion Generation from Single Egocentric Head-Mounted Device
by: Guzov, Vladimir, et al.
Published: (2024)

EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams
by: Millerdurai, Christen, et al.
Published: (2024)

T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
by: He, Yuze, et al.
Published: (2023)

Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation
by: Schmidt, Tanner, et al.
Published: (2025)

BrainSegFounder: Towards 3D Foundation Models for Neuroimage Segmentation
by: Cox, Joseph, et al.
Published: (2024)

Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging
by: Wang, Shansong, et al.
Published: (2025)

EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
by: Yang, Yuhang, et al.
Published: (2024)