:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	DeTone, Daniel, Shen, Tianwei, Zhang, Fan, Ma, Lingni, Straub, Julian, Newcombe, Richard, Engel, Jakob
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.05212
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models
by: Straub, Julian, et al.
Published: (2024)

LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World
by: Yang, Nan, et al.
Published: (2026)

NymeriaPlus: Enriching Nymeria Dataset with Additional Annotations and Data
by: DeTone, Daniel, et al.
Published: (2026)

Sonata: Self-Supervised Learning of Reliable Point Representations
by: Wu, Xiaoyang, et al.
Published: (2025)

ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
by: Siddiqui, Yawar, et al.
Published: (2026)

E$^3$C: Video Generation with 3D Environmental Memory and Ego-Exo Human Pose Control
by: Gu, Qiao, et al.
Published: (2026)

Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling
by: Xie, Christopher, et al.
Published: (2025)

Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking
by: Banerjee, Prithviraj, et al.
Published: (2024)

OpenBox: Annotate Any Bounding Boxes in 3D
by: Lee, In-Jae, et al.
Published: (2025)

EgoLM: Multi-Modal Language Model of Egocentric Motions
by: Hong, Fangzhou, et al.
Published: (2024)

HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos
by: Banerjee, Prithviraj, et al.
Published: (2024)

HMD^2: Environment-aware Motion Generation from Single Egocentric Head-Mounted Device
by: Guzov, Vladimir, et al.
Published: (2024)

Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
by: T, Mukund Varma, et al.
Published: (2024)

Lifting Motion to the 3D World via 2D Diffusion
by: Li, Jiaman, et al.
Published: (2024)

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
by: Jia, Yueru, et al.
Published: (2024)

3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
by: Yang, Yung-Hsu, et al.
Published: (2025)

4DGT: Learning a 4D Gaussian Transformer Using Real-World Monocular Videos
by: Xu, Zhen, et al.
Published: (2025)

LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors
by: Chen, Yabo, et al.
Published: (2024)

Rethinking the Encoding and Annotating of 3D Bounding Box: Corner-Aware 3D Object Detection from Point Clouds
by: Meng, Qinghao, et al.
Published: (2025)

PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting
by: Wang, Yu, et al.
Published: (2024)

Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild
by: Ma, Lingni, et al.
Published: (2024)

EgoLifter: Open-world 3D Segmentation for Egocentric Perception
by: Gu, Qiao, et al.
Published: (2024)

MoCA3D: Monocular 3D Bounding Box Prediction in the Image Plane
by: Jeon, Changwoo, et al.
Published: (2026)

3D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis
by: Liu, Ruiqi, et al.
Published: (2024)

Stroke3D: Lifting 2D strokes into rigged 3D model via latent diffusion models
by: Zhao, Ruisi, et al.
Published: (2026)

BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity
by: Koo, Juil, et al.
Published: (2026)

MPL: Lifting 3D Human Pose from Multi-view 2D Poses
by: Ghasemzadeh, Seyed Abolfazl, et al.
Published: (2024)

S2D: Sparse to Dense Lifting for 3D Reconstruction with Minimal Inputs
by: Ji, Yuzhou, et al.
Published: (2026)

LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation
by: Wang, Jingjing, et al.
Published: (2026)

HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model
by: Liu, Qi, et al.
Published: (2025)

Photoreal Scene Reconstruction from an Egocentric Device
by: Lv, Zhaoyang, et al.
Published: (2025)

Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection
by: Zhang, Ruiyang, et al.
Published: (2024)

Aria Gen 2 Pilot Dataset
by: Kong, Chen, et al.
Published: (2025)

Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation
by: Schmidt, Tanner, et al.
Published: (2025)

RUMPL: Ray-Based Transformers for Universal Multi-View 2D to 3D Human Pose Lifting
by: Ghasemzadeh, Seyed Abolfazl, et al.
Published: (2025)

Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing
by: Shen, Hongyu, et al.
Published: (2025)

Benchmarking Egocentric Visual-Inertial SLAM at City Scale
by: Krishnan, Anusha, et al.
Published: (2025)

CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction
by: Gupta, Pranav, et al.
Published: (2024)

OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection
by: Huang, Chenxi, et al.
Published: (2022)

WorldFlow3D: Flowing Through 3D Distributions for Unbounded World Generation
by: Joshi, Amogh, et al.
Published: (2026)