Saved in:
| Main Authors: | DeTone, Daniel, Shen, Tianwei, Zhang, Fan, Ma, Lingni, Straub, Julian, Newcombe, Richard, Engel, Jakob |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.05212 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models
by: Straub, Julian, et al.
Published: (2024)
by: Straub, Julian, et al.
Published: (2024)
LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World
by: Yang, Nan, et al.
Published: (2026)
by: Yang, Nan, et al.
Published: (2026)
NymeriaPlus: Enriching Nymeria Dataset with Additional Annotations and Data
by: DeTone, Daniel, et al.
Published: (2026)
by: DeTone, Daniel, et al.
Published: (2026)
Sonata: Self-Supervised Learning of Reliable Point Representations
by: Wu, Xiaoyang, et al.
Published: (2025)
by: Wu, Xiaoyang, et al.
Published: (2025)
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
by: Siddiqui, Yawar, et al.
Published: (2026)
by: Siddiqui, Yawar, et al.
Published: (2026)
E$^3$C: Video Generation with 3D Environmental Memory and Ego-Exo Human Pose Control
by: Gu, Qiao, et al.
Published: (2026)
by: Gu, Qiao, et al.
Published: (2026)
Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling
by: Xie, Christopher, et al.
Published: (2025)
by: Xie, Christopher, et al.
Published: (2025)
Introducing HOT3D: An Egocentric Dataset for 3D Hand and Object Tracking
by: Banerjee, Prithviraj, et al.
Published: (2024)
by: Banerjee, Prithviraj, et al.
Published: (2024)
OpenBox: Annotate Any Bounding Boxes in 3D
by: Lee, In-Jae, et al.
Published: (2025)
by: Lee, In-Jae, et al.
Published: (2025)
EgoLM: Multi-Modal Language Model of Egocentric Motions
by: Hong, Fangzhou, et al.
Published: (2024)
by: Hong, Fangzhou, et al.
Published: (2024)
HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos
by: Banerjee, Prithviraj, et al.
Published: (2024)
by: Banerjee, Prithviraj, et al.
Published: (2024)
HMD^2: Environment-aware Motion Generation from Single Egocentric Head-Mounted Device
by: Guzov, Vladimir, et al.
Published: (2024)
by: Guzov, Vladimir, et al.
Published: (2024)
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D
by: T, Mukund Varma, et al.
Published: (2024)
by: T, Mukund Varma, et al.
Published: (2024)
Lifting Motion to the 3D World via 2D Diffusion
by: Li, Jiaman, et al.
Published: (2024)
by: Li, Jiaman, et al.
Published: (2024)
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
by: Jia, Yueru, et al.
Published: (2024)
by: Jia, Yueru, et al.
Published: (2024)
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
by: Yang, Yung-Hsu, et al.
Published: (2025)
by: Yang, Yung-Hsu, et al.
Published: (2025)
4DGT: Learning a 4D Gaussian Transformer Using Real-World Monocular Videos
by: Xu, Zhen, et al.
Published: (2025)
by: Xu, Zhen, et al.
Published: (2025)
LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors
by: Chen, Yabo, et al.
Published: (2024)
by: Chen, Yabo, et al.
Published: (2024)
Rethinking the Encoding and Annotating of 3D Bounding Box: Corner-Aware 3D Object Detection from Point Clouds
by: Meng, Qinghao, et al.
Published: (2025)
by: Meng, Qinghao, et al.
Published: (2025)
PLGS: Robust Panoptic Lifting with 3D Gaussian Splatting
by: Wang, Yu, et al.
Published: (2024)
by: Wang, Yu, et al.
Published: (2024)
Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild
by: Ma, Lingni, et al.
Published: (2024)
by: Ma, Lingni, et al.
Published: (2024)
EgoLifter: Open-world 3D Segmentation for Egocentric Perception
by: Gu, Qiao, et al.
Published: (2024)
by: Gu, Qiao, et al.
Published: (2024)
MoCA3D: Monocular 3D Bounding Box Prediction in the Image Plane
by: Jeon, Changwoo, et al.
Published: (2026)
by: Jeon, Changwoo, et al.
Published: (2026)
3D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis
by: Liu, Ruiqi, et al.
Published: (2024)
by: Liu, Ruiqi, et al.
Published: (2024)
Stroke3D: Lifting 2D strokes into rigged 3D model via latent diffusion models
by: Zhao, Ruisi, et al.
Published: (2026)
by: Zhao, Ruisi, et al.
Published: (2026)
BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity
by: Koo, Juil, et al.
Published: (2026)
by: Koo, Juil, et al.
Published: (2026)
MPL: Lifting 3D Human Pose from Multi-view 2D Poses
by: Ghasemzadeh, Seyed Abolfazl, et al.
Published: (2024)
by: Ghasemzadeh, Seyed Abolfazl, et al.
Published: (2024)
S2D: Sparse to Dense Lifting for 3D Reconstruction with Minimal Inputs
by: Ji, Yuzhou, et al.
Published: (2026)
by: Ji, Yuzhou, et al.
Published: (2026)
LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation
by: Wang, Jingjing, et al.
Published: (2026)
by: Wang, Jingjing, et al.
Published: (2026)
HQ-OV3D: A High Box Quality Open-World 3D Detection Framework based on Diffision Model
by: Liu, Qi, et al.
Published: (2025)
by: Liu, Qi, et al.
Published: (2025)
Photoreal Scene Reconstruction from an Egocentric Device
by: Lv, Zhaoyang, et al.
Published: (2025)
by: Lv, Zhaoyang, et al.
Published: (2025)
Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection
by: Zhang, Ruiyang, et al.
Published: (2024)
by: Zhang, Ruiyang, et al.
Published: (2024)
Aria Gen 2 Pilot Dataset
by: Kong, Chen, et al.
Published: (2025)
by: Kong, Chen, et al.
Published: (2025)
Segment This Thing: Foveated Tokenization for Efficient Point-Prompted Segmentation
by: Schmidt, Tanner, et al.
Published: (2025)
by: Schmidt, Tanner, et al.
Published: (2025)
RUMPL: Ray-Based Transformers for Universal Multi-View 2D to 3D Human Pose Lifting
by: Ghasemzadeh, Seyed Abolfazl, et al.
Published: (2025)
by: Ghasemzadeh, Seyed Abolfazl, et al.
Published: (2025)
Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing
by: Shen, Hongyu, et al.
Published: (2025)
by: Shen, Hongyu, et al.
Published: (2025)
Benchmarking Egocentric Visual-Inertial SLAM at City Scale
by: Krishnan, Anusha, et al.
Published: (2025)
by: Krishnan, Anusha, et al.
Published: (2025)
CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction
by: Gupta, Pranav, et al.
Published: (2024)
by: Gupta, Pranav, et al.
Published: (2024)
OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection
by: Huang, Chenxi, et al.
Published: (2022)
by: Huang, Chenxi, et al.
Published: (2022)
WorldFlow3D: Flowing Through 3D Distributions for Unbounded World Generation
by: Joshi, Amogh, et al.
Published: (2026)
by: Joshi, Amogh, et al.
Published: (2026)
Similar Items
-
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models
by: Straub, Julian, et al.
Published: (2024) -
LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World
by: Yang, Nan, et al.
Published: (2026) -
NymeriaPlus: Enriching Nymeria Dataset with Additional Annotations and Data
by: DeTone, Daniel, et al.
Published: (2026) -
Sonata: Self-Supervised Learning of Reliable Point Representations
by: Wu, Xiaoyang, et al.
Published: (2025) -
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures
by: Siddiqui, Yawar, et al.
Published: (2026)