Saved in:
| Main Authors: | Straub, Julian, DeTone, Daniel, Shen, Tianwei, Yang, Nan, Sweeney, Chris, Newcombe, Richard |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.10224 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Boxer: Robust Lifting of Open-World 2D Bounding Boxes to 3D
by: DeTone, Daniel, et al.
Published: (2026)
by: DeTone, Daniel, et al.
Published: (2026)
Sonata: Self-Supervised Learning of Reliable Point Representations
by: Wu, Xiaoyang, et al.
Published: (2025)
by: Wu, Xiaoyang, et al.
Published: (2025)
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
by: Siddiqui, Yawar, et al.
Published: (2026)
by: Siddiqui, Yawar, et al.
Published: (2026)
NymeriaPlus: Enriching Nymeria Dataset with Additional Annotations and Data
by: DeTone, Daniel, et al.
Published: (2026)
by: DeTone, Daniel, et al.
Published: (2026)
EgoLifter: Open-world 3D Segmentation for Egocentric Perception
by: Gu, Qiao, et al.
Published: (2024)
by: Gu, Qiao, et al.
Published: (2024)
LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World
by: Yang, Nan, et al.
Published: (2026)
by: Yang, Nan, et al.
Published: (2026)
Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking
by: Banerjee, Prithviraj, et al.
Published: (2024)
by: Banerjee, Prithviraj, et al.
Published: (2024)
E$^3$C: Video Generation with 3D Environmental Memory and Ego-Exo Human Pose Control
by: Gu, Qiao, et al.
Published: (2026)
by: Gu, Qiao, et al.
Published: (2026)
HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos
by: Banerjee, Prithviraj, et al.
Published: (2024)
by: Banerjee, Prithviraj, et al.
Published: (2024)
FedEFM: Federated Endovascular Foundation Model with Unseen Data
by: Do, Tuong, et al.
Published: (2025)
by: Do, Tuong, et al.
Published: (2025)
Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling
by: Xie, Christopher, et al.
Published: (2025)
by: Xie, Christopher, et al.
Published: (2025)
Benchmarking Egocentric Visual-Inertial SLAM at City Scale
by: Krishnan, Anusha, et al.
Published: (2025)
by: Krishnan, Anusha, et al.
Published: (2025)
EgoLM: Multi-Modal Language Model of Egocentric Motions
by: Hong, Fangzhou, et al.
Published: (2024)
by: Hong, Fangzhou, et al.
Published: (2024)
ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images
by: Ding, Fangqiang, et al.
Published: (2024)
by: Ding, Fangqiang, et al.
Published: (2024)
Towards Egocentric 3D Hand Pose Estimation in Unseen Domains
by: Mucha, Wiktor, et al.
Published: (2026)
by: Mucha, Wiktor, et al.
Published: (2026)
E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models
by: Cong, Wenyan, et al.
Published: (2025)
by: Cong, Wenyan, et al.
Published: (2025)
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
by: Lee, Junha, et al.
Published: (2025)
by: Lee, Junha, et al.
Published: (2025)
Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset
by: Wang, Zirui, et al.
Published: (2025)
by: Wang, Zirui, et al.
Published: (2025)
SuperMemory-VQA: An Egocentric Visual Question-Answering Benchmark for Long-Horizon Memory
by: Alam, Samiul, et al.
Published: (2026)
by: Alam, Samiul, et al.
Published: (2026)
Benchmarking 2D Egocentric Hand Pose Datasets
by: Taran, Olga, et al.
Published: (2024)
by: Taran, Olga, et al.
Published: (2024)
EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining
by: Xu, Boshen, et al.
Published: (2025)
by: Xu, Boshen, et al.
Published: (2025)
Grounding 3D Scene Affordance From Egocentric Interactions
by: Liu, Cuiyu, et al.
Published: (2024)
by: Liu, Cuiyu, et al.
Published: (2024)
Photoreal Scene Reconstruction from an Egocentric Device
by: Lv, Zhaoyang, et al.
Published: (2025)
by: Lv, Zhaoyang, et al.
Published: (2025)
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
by: Jiang, Haoyi, et al.
Published: (2024)
by: Jiang, Haoyi, et al.
Published: (2024)
DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
by: Lin, Chieh Hubert, et al.
Published: (2025)
by: Lin, Chieh Hubert, et al.
Published: (2025)
A Pixel Is Worth More Than One 3D Gaussians in Single-View 3D Reconstruction
by: Shen, Jianghao, et al.
Published: (2024)
by: Shen, Jianghao, et al.
Published: (2024)
BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence
by: Lin, Xuewu, et al.
Published: (2024)
by: Lin, Xuewu, et al.
Published: (2024)
Towards Foundation Models for 3D Vision: How Close Are We?
by: Zuo, Yiming, et al.
Published: (2024)
by: Zuo, Yiming, et al.
Published: (2024)
ProGS: Towards Progressive Coding for 3D Gaussian Splatting
by: Tang, Zhiye, et al.
Published: (2026)
by: Tang, Zhiye, et al.
Published: (2026)
Pixels to Play: A Foundation Model for 3D Gameplay
by: Yue, Yuguang, et al.
Published: (2025)
by: Yue, Yuguang, et al.
Published: (2025)
Egocentric Bias in Vision-Language Models
by: Wang, Maijunxian, et al.
Published: (2026)
by: Wang, Maijunxian, et al.
Published: (2026)
Instance Tracking in 3D Scenes from Egocentric Videos
by: Zhao, Yunhan, et al.
Published: (2023)
by: Zhao, Yunhan, et al.
Published: (2023)
A Survey on 3D Egocentric Human Pose Estimation
by: Azam, Md Mushfiqur, et al.
Published: (2024)
by: Azam, Md Mushfiqur, et al.
Published: (2024)
HMD^2: Environment-aware Motion Generation from Single Egocentric Head-Mounted Device
by: Guzov, Vladimir, et al.
Published: (2024)
by: Guzov, Vladimir, et al.
Published: (2024)
EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams
by: Millerdurai, Christen, et al.
Published: (2024)
by: Millerdurai, Christen, et al.
Published: (2024)
T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
by: He, Yuze, et al.
Published: (2023)
by: He, Yuze, et al.
Published: (2023)
Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation
by: Schmidt, Tanner, et al.
Published: (2025)
by: Schmidt, Tanner, et al.
Published: (2025)
BrainSegFounder: Towards 3D Foundation Models for Neuroimage Segmentation
by: Cox, Joseph, et al.
Published: (2024)
by: Cox, Joseph, et al.
Published: (2024)
Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging
by: Wang, Shansong, et al.
Published: (2025)
by: Wang, Shansong, et al.
Published: (2025)
EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
by: Yang, Yuhang, et al.
Published: (2024)
by: Yang, Yuhang, et al.
Published: (2024)
Similar Items
-
Boxer: Robust Lifting of Open-World 2D Bounding Boxes to 3D
by: DeTone, Daniel, et al.
Published: (2026) -
Sonata: Self-Supervised Learning of Reliable Point Representations
by: Wu, Xiaoyang, et al.
Published: (2025) -
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
by: Siddiqui, Yawar, et al.
Published: (2026) -
NymeriaPlus: Enriching Nymeria Dataset with Additional Annotations and Data
by: DeTone, Daniel, et al.
Published: (2026) -
EgoLifter: Open-world 3D Segmentation for Egocentric Perception
by: Gu, Qiao, et al.
Published: (2024)