:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhou, Hongyi, Guo, Yulan, Wang, Xiaogang, Xu, Kai
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2505.11868
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
by: Tang, Yijie, et al.
Published: (2025)

MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation
by: Koleini, Farnoosh, et al.
Published: (2025)

MonoNPHM: Dynamic Head Reconstruction from Monocular Videos
by: Giebenhain, Simon, et al.
Published: (2023)

Mono4DGS-HDR: High Dynamic Range 4D Gaussian Splatting from Alternating-exposure Monocular Videos
by: Liu, Jinfeng, et al.
Published: (2025)

MonoHair: High-Fidelity Hair Modeling from a Monocular Video
by: Wu, Keyu, et al.
Published: (2024)

MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
by: Li, Haitian, et al.
Published: (2026)

MonoCD: Monocular 3D Object Detection with Complementary Depths
by: Yan, Longfei, et al.
Published: (2024)

MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction
by: Zhao, Wang, et al.
Published: (2024)

Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction
by: Li, Wenyu, et al.
Published: (2025)

MonoPhysics: Estimating Geometry, Appearance, and Physical Parameters from Monocular Videos
by: Rho, Daniel, et al.
Published: (2026)

VideoDirector: Precise Video Editing via Text-to-Video Models
by: Wang, Yukun, et al.
Published: (2024)

MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
by: Yang, Min, et al.
Published: (2025)

MonoSAOD: Monocular 3D Object Detection with Sparsely Annotated Label
by: Jung, Junyoung, et al.
Published: (2026)

MonoLSS: Learnable Sample Selection For Monocular 3D Detection
by: Li, Zhenjia, et al.
Published: (2023)

MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
by: Liu, Hou-I, et al.
Published: (2024)

MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
by: Pu, Fanqi, et al.
Published: (2024)

MonoCloth: Reconstruction and Animation of Cloth-Decoupled Human Avatars from Monocular Videos
by: Jin, Daisheng, et al.
Published: (2025)

MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
by: Zhang, Renrui, et al.
Published: (2022)

DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis
by: Gu, Yuming, et al.
Published: (2023)

MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection
by: Parihar, Rishubh, et al.
Published: (2025)

MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion
by: Wang, Zihan, et al.
Published: (2025)

MonoCLUE : Object-Aware Clustering Enhances Monocular 3D Object Detection
by: Yang, Sunghun, et al.
Published: (2025)

MonoDETRNext: Next-Generation Accurate and Efficient Monocular 3D Object Detector
by: Liao, Pan, et al.
Published: (2024)

MonoPRIO: Adaptive Prior Conditioning for Unified Monocular 3D Object Detection
by: Davies, Leon, et al.
Published: (2026)

MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
by: Liu, Yifan, et al.
Published: (2025)

Progressive Correspondence Regenerator for Robust 3D Registration
by: Zhao, Guiyu, et al.
Published: (2025)

Zero-Shot Monocular Scene Flow Estimation in the Wild
by: Liang, Yiqing, et al.
Published: (2025)

MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models
by: Meier, Johannes, et al.
Published: (2025)

MonoVQD: Monocular 3D Object Detection with Variational Query Denoising and Self-Distillation
by: Vu, Kiet Dang, et al.
Published: (2025)

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
by: Jiang, Xueying, et al.
Published: (2024)

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
by: Oh, Youngmin, et al.
Published: (2024)

MonoSR: Open-Vocabulary Spatial Reasoning from Monocular Images
by: Wang, Qirui, et al.
Published: (2025)

ViT-1.58b: Mobile Vision Transformers in the 1-bit Era
by: Yuan, Zhengqing, et al.
Published: (2024)

MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection
by: Fu, Youjia, et al.
Published: (2024)

MonoOcc: Digging into Monocular Semantic Occupancy Prediction
by: Zheng, Yupeng, et al.
Published: (2024)

Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
by: Zhang, Wentao, et al.
Published: (2024)

PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments
by: Wang, Changhao, et al.
Published: (2025)

Map-Mono-Ego: Map-Grounded Global Human Pose Estimation from Monocular Egocentric Video
by: Deguchi, Hiroyuki, et al.
Published: (2026)

Mono4DEditor: Text-Driven 4D Scene Editing from Monocular Video via Point-Level Localization of Language-Embedded Gaussians
by: Shi, Jin-Chuan, et al.
Published: (2025)

MonoDream: Monocular Vision-Language Navigation with Panoramic Dreaming
by: Wang, Shuo, et al.
Published: (2025)