Saved in:
| Main Authors: | Zhang, Yu-Wei, Han, Tongju, Gao, Lipeng, Wei, Mingqiang, Liu, Hui, Li, Changbao, Zhang, Caiming |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.19555 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MonoHair: High-Fidelity Hair Modeling from a Monocular Video
by: Wu, Keyu, et al.
Published: (2024)
by: Wu, Keyu, et al.
Published: (2024)
RegTrack: Simplicity Beneath Complexity in Robust Multi-Modal 3D Multi-Object Tracking
by: Gu, Lipeng, et al.
Published: (2024)
by: Gu, Lipeng, et al.
Published: (2024)
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs
by: Qi, Yu, et al.
Published: (2025)
by: Qi, Yu, et al.
Published: (2025)
PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments
by: Wang, Changhao, et al.
Published: (2025)
by: Wang, Changhao, et al.
Published: (2025)
Unified Representation Space for 3D Visual Grounding
by: Zheng, Yinuo, et al.
Published: (2025)
by: Zheng, Yinuo, et al.
Published: (2025)
MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction
by: Zhang, Wenyuan, et al.
Published: (2025)
by: Zhang, Wenyuan, et al.
Published: (2025)
RTS-Mono: A Real-Time Self-Supervised Monocular Depth Estimation Method for Real-World Deployment
by: Cheng, Zeyu, et al.
Published: (2025)
by: Cheng, Zeyu, et al.
Published: (2025)
MonoOcc: Digging into Monocular Semantic Occupancy Prediction
by: Zheng, Yupeng, et al.
Published: (2024)
by: Zheng, Yupeng, et al.
Published: (2024)
3DAeroRelief: The first 3D Benchmark UAV Dataset for Post-Disaster Assessment
by: Le, Nhut, et al.
Published: (2025)
by: Le, Nhut, et al.
Published: (2025)
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
by: Zhang, Renrui, et al.
Published: (2022)
by: Zhang, Renrui, et al.
Published: (2022)
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation
by: Li, Zhenyu, et al.
Published: (2024)
by: Li, Zhenyu, et al.
Published: (2024)
MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model
by: Liu, Jian, et al.
Published: (2025)
by: Liu, Jian, et al.
Published: (2025)
ColorGS: High-fidelity Surgical Scene Reconstruction with Colored Gaussian Splatting
by: Ji, Qun, et al.
Published: (2025)
by: Ji, Qun, et al.
Published: (2025)
MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network
by: Jiang, Jianfei, et al.
Published: (2025)
by: Jiang, Jianfei, et al.
Published: (2025)
W-HMR: Monocular Human Mesh Recovery in World Space with Weak-Supervised Calibration
by: Yao, Wei, et al.
Published: (2023)
by: Yao, Wei, et al.
Published: (2023)
CrossTracker: Robust Multi-modal 3D Multi-Object Tracking via Cross Correction
by: Gu, Lipeng, et al.
Published: (2024)
by: Gu, Lipeng, et al.
Published: (2024)
MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation
by: Koleini, Farnoosh, et al.
Published: (2025)
by: Koleini, Farnoosh, et al.
Published: (2025)
MonoNPHM: Dynamic Head Reconstruction from Monocular Videos
by: Giebenhain, Simon, et al.
Published: (2023)
by: Giebenhain, Simon, et al.
Published: (2023)
MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM
by: Li, Renwu, et al.
Published: (2025)
by: Li, Renwu, et al.
Published: (2025)
Natural Human Motion Recovery by Aligning High-Order Temporal Dynamics from Monocular Videos
by: Wei, Dingkun, et al.
Published: (2026)
by: Wei, Dingkun, et al.
Published: (2026)
VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes
by: Wu, Ke, et al.
Published: (2025)
by: Wu, Ke, et al.
Published: (2025)
SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis
by: Chen, Zhengqing, et al.
Published: (2025)
by: Chen, Zhengqing, et al.
Published: (2025)
MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
by: Jiang, Xueying, et al.
Published: (2024)
by: Jiang, Xueying, et al.
Published: (2024)
MonoDETRNext: Next-Generation Accurate and Efficient Monocular 3D Object Detector
by: Liao, Pan, et al.
Published: (2024)
by: Liao, Pan, et al.
Published: (2024)
OpenMonoGS-SLAM: Monocular Gaussian Splatting SLAM with Open-set Semantics
by: Yoo, Jisang, et al.
Published: (2025)
by: Yoo, Jisang, et al.
Published: (2025)
Monocular Mesh Recovery and Body Measurement of Female Saanen Goats
by: Jin, Bo, et al.
Published: (2026)
by: Jin, Bo, et al.
Published: (2026)
BSGS: Bi-stage 3D Gaussian Splatting for Camera Motion Deblurring
by: Zhao, An, et al.
Published: (2025)
by: Zhao, An, et al.
Published: (2025)
MetricHMSR:Metric Human Mesh and Scene Recovery from Monocular Images
by: Song, Chentao, et al.
Published: (2025)
by: Song, Chentao, et al.
Published: (2025)
Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal
by: Yu, Wanchang, et al.
Published: (2025)
by: Yu, Wanchang, et al.
Published: (2025)
GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors
by: Li, An, et al.
Published: (2025)
by: Li, An, et al.
Published: (2025)
MonoCD: Monocular 3D Object Detection with Complementary Depths
by: Yan, Longfei, et al.
Published: (2024)
by: Yan, Longfei, et al.
Published: (2024)
MonoSR: Open-Vocabulary Spatial Reasoning from Monocular Images
by: Wang, Qirui, et al.
Published: (2025)
by: Wang, Qirui, et al.
Published: (2025)
Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching
by: Wang, Yuran, et al.
Published: (2024)
by: Wang, Yuran, et al.
Published: (2024)
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
by: Liu, Hou-I, et al.
Published: (2024)
by: Liu, Hou-I, et al.
Published: (2024)
MonoDream: Monocular Vision-Language Navigation with Panoramic Dreaming
by: Wang, Shuo, et al.
Published: (2025)
by: Wang, Shuo, et al.
Published: (2025)
MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection
by: Fu, Youjia, et al.
Published: (2024)
by: Fu, Youjia, et al.
Published: (2024)
Rethinking Real-world Image Deraining via An Unpaired Degradation-Conditioned Diffusion Model
by: Shen, Yiyang, et al.
Published: (2023)
by: Shen, Yiyang, et al.
Published: (2023)
High-Fidelity Differential-information Driven Binary Vision Transformer
by: Gao, Tian, et al.
Published: (2025)
by: Gao, Tian, et al.
Published: (2025)
MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
by: Liu, Yifan, et al.
Published: (2025)
by: Liu, Yifan, et al.
Published: (2025)
MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction
by: Zhao, Wang, et al.
Published: (2024)
by: Zhao, Wang, et al.
Published: (2024)
Similar Items
-
MonoHair: High-Fidelity Hair Modeling from a Monocular Video
by: Wu, Keyu, et al.
Published: (2024) -
RegTrack: Simplicity Beneath Complexity in Robust Multi-Modal 3D Multi-Object Tracking
by: Gu, Lipeng, et al.
Published: (2024) -
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs
by: Qi, Yu, et al.
Published: (2025) -
PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments
by: Wang, Changhao, et al.
Published: (2025) -
Unified Representation Space for 3D Visual Grounding
by: Zheng, Yinuo, et al.
Published: (2025)