:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Yu-Wei, Han, Tongju, Gao, Lipeng, Wei, Mingqiang, Liu, Hui, Li, Changbao, Zhang, Caiming
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2508.19555
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MonoHair: High-Fidelity Hair Modeling from a Monocular Video
by: Wu, Keyu, et al.
Published: (2024)

RegTrack: Simplicity Beneath Complexity in Robust Multi-Modal 3D Multi-Object Tracking
by: Gu, Lipeng, et al.
Published: (2024)

I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs
by: Qi, Yu, et al.
Published: (2025)

PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments
by: Wang, Changhao, et al.
Published: (2025)

Unified Representation Space for 3D Visual Grounding
by: Zheng, Yinuo, et al.
Published: (2025)

MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction
by: Zhang, Wenyuan, et al.
Published: (2025)

RTS-Mono: A Real-Time Self-Supervised Monocular Depth Estimation Method for Real-World Deployment
by: Cheng, Zeyu, et al.
Published: (2025)

MonoOcc: Digging into Monocular Semantic Occupancy Prediction
by: Zheng, Yupeng, et al.
Published: (2024)

3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment
by: Le, Nhut, et al.
Published: (2025)

MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
by: Zhang, Renrui, et al.
Published: (2022)

PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation
by: Li, Zhenyu, et al.
Published: (2024)

MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model
by: Liu, Jian, et al.
Published: (2025)

ColorGS: High-fidelity Surgical Scene Reconstruction with Colored Gaussian Splatting
by: Ji, Qun, et al.
Published: (2025)

MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network
by: Jiang, Jianfei, et al.
Published: (2025)

W-HMR: Monocular Human Mesh Recovery in World Space with Weak-Supervised Calibration
by: Yao, Wei, et al.
Published: (2023)

CrossTracker: Robust Multi-modal 3D Multi-Object Tracking via Cross Correction
by: Gu, Lipeng, et al.
Published: (2024)

MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation
by: Koleini, Farnoosh, et al.
Published: (2025)

MonoNPHM: Dynamic Head Reconstruction from Monocular Videos
by: Giebenhain, Simon, et al.
Published: (2023)

MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM
by: Li, Renwu, et al.
Published: (2025)

Natural Human Motion Recovery by Aligning High-Order Temporal Dynamics from Monocular Videos
by: Wei, Dingkun, et al.
Published: (2026)

VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes
by: Wu, Ke, et al.
Published: (2025)

SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis
by: Chen, Zhengqing, et al.
Published: (2025)

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
by: Jiang, Xueying, et al.
Published: (2024)

MonoDETRNext: Next-Generation Accurate and Efficient Monocular 3D Object Detector
by: Liao, Pan, et al.
Published: (2024)

OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics
by: Yoo, Jisang, et al.
Published: (2025)

Monocular Mesh Recovery and Body Measurement of Female Saanen Goats
by: Jin, Bo, et al.
Published: (2026)

BSGS: Bi-stage 3D Gaussian Splatting for Camera Motion Deblurring
by: Zhao, An, et al.
Published: (2025)

MetricHMSR:Metric Human Mesh and Scene Recovery from Monocular Images
by: Song, Chentao, et al.
Published: (2025)

Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal
by: Yu, Wanchang, et al.
Published: (2025)

GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors
by: Li, An, et al.
Published: (2025)

MonoCD: Monocular 3D Object Detection with Complementary Depths
by: Yan, Longfei, et al.
Published: (2024)

MonoSR: Open-Vocabulary Spatial Reasoning from Monocular Images
by: Wang, Qirui, et al.
Published: (2025)

Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
by: Wang, Yuran, et al.
Published: (2024)

MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
by: Liu, Hou-I, et al.
Published: (2024)

MonoDream: Monocular Vision-Language Navigation with Panoramic Dreaming
by: Wang, Shuo, et al.
Published: (2025)

MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection
by: Fu, Youjia, et al.
Published: (2024)

Rethinking Real-world Image Deraining via An Unpaired Degradation-Conditioned Diffusion Model
by: Shen, Yiyang, et al.
Published: (2023)

High-Fidelity Differential-information Driven Binary Vision Transformer
by: Gao, Tian, et al.
Published: (2025)

MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
by: Liu, Yifan, et al.
Published: (2025)

MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction
by: Zhao, Wang, et al.
Published: (2024)