Saved in:
| Main Authors: | Zhou, Hongyi, Guo, Yulan, Wang, Xiaogang, Xu, Kai |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.11868 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
by: Tang, Yijie, et al.
Published: (2025)
by: Tang, Yijie, et al.
Published: (2025)
MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation
by: Koleini, Farnoosh, et al.
Published: (2025)
by: Koleini, Farnoosh, et al.
Published: (2025)
MonoNPHM: Dynamic Head Reconstruction from Monocular Videos
by: Giebenhain, Simon, et al.
Published: (2023)
by: Giebenhain, Simon, et al.
Published: (2023)
Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos
by: Liu, Jinfeng, et al.
Published: (2025)
by: Liu, Jinfeng, et al.
Published: (2025)
MonoHair: High-Fidelity Hair Modeling from a Monocular Video
by: Wu, Keyu, et al.
Published: (2024)
by: Wu, Keyu, et al.
Published: (2024)
MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
by: Li, Haitian, et al.
Published: (2026)
by: Li, Haitian, et al.
Published: (2026)
MonoCD: Monocular 3D Object Detection with Complementary Depths
by: Yan, Longfei, et al.
Published: (2024)
by: Yan, Longfei, et al.
Published: (2024)
MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction
by: Zhao, Wang, et al.
Published: (2024)
by: Zhao, Wang, et al.
Published: (2024)
Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction
by: Li, Wenyu, et al.
Published: (2025)
by: Li, Wenyu, et al.
Published: (2025)
MonoPhysics: Estimating Geometry, Appearance, and Physical Parameters from Monocular Videos
by: Rho, Daniel, et al.
Published: (2026)
by: Rho, Daniel, et al.
Published: (2026)
VideoDirector: Precise Video Editing via Text-to-Video Models
by: Wang, Yukun, et al.
Published: (2024)
by: Wang, Yukun, et al.
Published: (2024)
MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
by: Yang, Min, et al.
Published: (2025)
by: Yang, Min, et al.
Published: (2025)
MonoSAOD: Monocular 3D Object Detection with Sparsely Annotated Label
by: Jung, Junyoung, et al.
Published: (2026)
by: Jung, Junyoung, et al.
Published: (2026)
MonoLSS: Learnable Sample Selection For Monocular 3D Detection
by: Li, Zhenjia, et al.
Published: (2023)
by: Li, Zhenjia, et al.
Published: (2023)
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
by: Liu, Hou-I, et al.
Published: (2024)
by: Liu, Hou-I, et al.
Published: (2024)
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
by: Pu, Fanqi, et al.
Published: (2024)
by: Pu, Fanqi, et al.
Published: (2024)
MonoCloth: Reconstruction and Animation of Cloth-Decoupled Human Avatars from Monocular Videos
by: Jin, Daisheng, et al.
Published: (2025)
by: Jin, Daisheng, et al.
Published: (2025)
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
by: Zhang, Renrui, et al.
Published: (2022)
by: Zhang, Renrui, et al.
Published: (2022)
DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis
by: Gu, Yuming, et al.
Published: (2023)
by: Gu, Yuming, et al.
Published: (2023)
MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection
by: Parihar, Rishubh, et al.
Published: (2025)
by: Parihar, Rishubh, et al.
Published: (2025)
MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
MonoCLUE : Object-Aware Clustering Enhances Monocular 3D Object Detection
by: Yang, Sunghun, et al.
Published: (2025)
by: Yang, Sunghun, et al.
Published: (2025)
MonoDETRNext: Next-Generation Accurate and Efficient Monocular 3D Object Detector
by: Liao, Pan, et al.
Published: (2024)
by: Liao, Pan, et al.
Published: (2024)
MonoPRIO: Adaptive Prior Conditioning for Unified Monocular 3D Object Detection
by: Davies, Leon, et al.
Published: (2026)
by: Davies, Leon, et al.
Published: (2026)
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
by: Liu, Yifan, et al.
Published: (2025)
by: Liu, Yifan, et al.
Published: (2025)
Progressive Correspondence Regenerator for Robust 3D Registration
by: Zhao, Guiyu, et al.
Published: (2025)
by: Zhao, Guiyu, et al.
Published: (2025)
Zero-Shot Monocular Scene Flow Estimation in the Wild
by: Liang, Yiqing, et al.
Published: (2025)
by: Liang, Yiqing, et al.
Published: (2025)
MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models
by: Meier, Johannes, et al.
Published: (2025)
by: Meier, Johannes, et al.
Published: (2025)
MonoVQD: Monocular 3D Object Detection with Variational Query Denoising and Self-Distillation
by: Vu, Kiet Dang, et al.
Published: (2025)
by: Vu, Kiet Dang, et al.
Published: (2025)
MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
by: Jiang, Xueying, et al.
Published: (2024)
by: Jiang, Xueying, et al.
Published: (2024)
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
by: Oh, Youngmin, et al.
Published: (2024)
by: Oh, Youngmin, et al.
Published: (2024)
MonoSR: Open-Vocabulary Spatial Reasoning from Monocular Images
by: Wang, Qirui, et al.
Published: (2025)
by: Wang, Qirui, et al.
Published: (2025)
ViT-1.58b: Mobile Vision Transformers in the 1-bit Era
by: Yuan, Zhengqing, et al.
Published: (2024)
by: Yuan, Zhengqing, et al.
Published: (2024)
MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection
by: Fu, Youjia, et al.
Published: (2024)
by: Fu, Youjia, et al.
Published: (2024)
MonoOcc: Digging into Monocular Semantic Occupancy Prediction
by: Zheng, Yupeng, et al.
Published: (2024)
by: Zheng, Yupeng, et al.
Published: (2024)
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
by: Zhang, Wentao, et al.
Published: (2024)
by: Zhang, Wentao, et al.
Published: (2024)
PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments
by: Wang, Changhao, et al.
Published: (2025)
by: Wang, Changhao, et al.
Published: (2025)
Map-Mono-Ego: Map-Grounded Global Human Pose Estimation from Monocular Egocentric Video
by: Deguchi, Hiroyuki, et al.
Published: (2026)
by: Deguchi, Hiroyuki, et al.
Published: (2026)
Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians
by: Shi, Jin-Chuan, et al.
Published: (2025)
by: Shi, Jin-Chuan, et al.
Published: (2025)
MonoDream: Monocular Vision-Language Navigation with Panoramic Dreaming
by: Wang, Shuo, et al.
Published: (2025)
by: Wang, Shuo, et al.
Published: (2025)
Similar Items
-
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
by: Tang, Yijie, et al.
Published: (2025) -
MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation
by: Koleini, Farnoosh, et al.
Published: (2025) -
MonoNPHM: Dynamic Head Reconstruction from Monocular Videos
by: Giebenhain, Simon, et al.
Published: (2023) -
Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos
by: Liu, Jinfeng, et al.
Published: (2025) -
MonoHair: High-Fidelity Hair Modeling from a Monocular Video
by: Wu, Keyu, et al.
Published: (2024)