Saved in:
| Main Authors: | Wang, Haotian, Xiao, Aoran, Zhang, Xiaoqin, Yang, Meng, Lu, Shijian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.07374 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Survey of Label-Efficient Deep Learning for 3D Point Clouds
by: Xiao, Aoran, et al.
Published: (2023)
by: Xiao, Aoran, et al.
Published: (2023)
Scale Propagation Network for Generalizable Depth Completion
by: Wang, Haotian, et al.
Published: (2024)
by: Wang, Haotian, et al.
Published: (2024)
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model
by: Xiao, Aoran, et al.
Published: (2024)
by: Xiao, Aoran, et al.
Published: (2024)
MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
by: Jiang, Xueying, et al.
Published: (2024)
by: Jiang, Xueying, et al.
Published: (2024)
Data-Efficient Generalization for Zero-shot Composed Image Retrieval
by: Chen, Zining, et al.
Published: (2025)
by: Chen, Zining, et al.
Published: (2025)
Scalable and Generalizable Correspondence Pruning via Geometry-Consistent Pre-training
by: Liao, Tangfei, et al.
Published: (2024)
by: Liao, Tangfei, et al.
Published: (2024)
MuSASplat: Efficient Sparse-View 3D Gaussian Splats via Lightweight Multi-Scale Adaptation
by: Xu, Muyu, et al.
Published: (2025)
by: Xu, Muyu, et al.
Published: (2025)
Segment Anything with Multiple Modalities
by: Xiao, Aoran, et al.
Published: (2024)
by: Xiao, Aoran, et al.
Published: (2024)
Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation
by: Xing, Yun, et al.
Published: (2023)
by: Xing, Yun, et al.
Published: (2023)
L3DR: 3D-aware LiDAR Diffusion and Rectification
by: Liu, Quan, et al.
Published: (2026)
by: Liu, Quan, et al.
Published: (2026)
ToDRE: Effective Visual Token Pruning via Token Diversity and Task Relevance
by: Li, Duo, et al.
Published: (2025)
by: Li, Duo, et al.
Published: (2025)
A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models
by: Li, Duo, et al.
Published: (2025)
by: Li, Duo, et al.
Published: (2025)
Foundation Models for Remote Sensing and Earth Observation: A Survey
by: Xiao, Aoran, et al.
Published: (2024)
by: Xiao, Aoran, et al.
Published: (2024)
Weakly Supervised Monocular 3D Detection with a Single-View Image
by: Jiang, Xueying, et al.
Published: (2024)
by: Jiang, Xueying, et al.
Published: (2024)
PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations
by: Wei, Yu, et al.
Published: (2025)
by: Wei, Yu, et al.
Published: (2025)
Masked AutoDecoder is Effective Multi-Task Vision Generalist
by: Qiu, Han, et al.
Published: (2024)
by: Qiu, Han, et al.
Published: (2024)
Spatial Preference Rewarding for MLLMs Spatial Understanding
by: Qiu, Han, et al.
Published: (2025)
by: Qiu, Han, et al.
Published: (2025)
Exploring 3D Reasoning-Driven Planning: From Implicit Human Intentions to Route-Aware Activity Planning
by: Jiang, Xueying, et al.
Published: (2025)
by: Jiang, Xueying, et al.
Published: (2025)
STS-Mixer: Spatio-Temporal-Spectral Mixer for 4D Point Cloud Video Understanding
by: Li, Wenhao, et al.
Published: (2026)
by: Li, Wenhao, et al.
Published: (2026)
SVDC: Consistent Direct Time-of-Flight Video Depth Completion with Frequency Selective Fusion
by: Zhu, Xuan, et al.
Published: (2025)
by: Zhu, Xuan, et al.
Published: (2025)
Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
by: Nie, Jiahao, et al.
Published: (2024)
by: Nie, Jiahao, et al.
Published: (2024)
Historical Test-time Prompt Tuning for Vision Foundation Models
by: Zhang, Jingyi, et al.
Published: (2024)
by: Zhang, Jingyi, et al.
Published: (2024)
X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation
by: Yang, Yuchen, et al.
Published: (2024)
by: Yang, Yuchen, et al.
Published: (2024)
CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View Transformation
by: Lee, In-Jae, et al.
Published: (2025)
by: Lee, In-Jae, et al.
Published: (2025)
Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection
by: Yu, Zhenni, et al.
Published: (2024)
by: Yu, Zhenni, et al.
Published: (2024)
LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
by: Qiu, Han, et al.
Published: (2024)
by: Qiu, Han, et al.
Published: (2024)
Progressive Depth Decoupling and Modulating for Flexible Depth Completion
by: Yang, Zhiwen, et al.
Published: (2024)
by: Yang, Zhiwen, et al.
Published: (2024)
OASIS-DC: Generalizable Depth Completion via Output-level Alignment of Sparse-Integrated Monocular Pseudo Depth
by: Cho, Jaehyeon, et al.
Published: (2026)
by: Cho, Jaehyeon, et al.
Published: (2026)
Layer Consistency Matters: Elegant Latent Transition Discrepancy for Generalizable Synthetic Image Detection
by: Yang, Yawen, et al.
Published: (2026)
by: Yang, Yawen, et al.
Published: (2026)
Multi-Depth Branch Network for Efficient Image Super-Resolution
by: Tian, Huiyuan, et al.
Published: (2023)
by: Tian, Huiyuan, et al.
Published: (2023)
Pose-Free Omnidirectional Gaussian Splatting for 360-Degree Videos with Consistent Depth Priors
by: Zhuang, Chuanqing, et al.
Published: (2026)
by: Zhuang, Chuanqing, et al.
Published: (2026)
Learning Generalizable Shape Completion with SIM(3) Equivariance
by: Wang, Yuqing, et al.
Published: (2025)
by: Wang, Yuqing, et al.
Published: (2025)
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
by: Zeng, Ziyao, et al.
Published: (2024)
by: Zeng, Ziyao, et al.
Published: (2024)
Navigating Label Ambiguity for Facial Expression Recognition in the Wild
by: Lee, JunGyu, et al.
Published: (2025)
by: Lee, JunGyu, et al.
Published: (2025)
Towards Domain-agnostic Depth Completion
by: Xu, Guangkai, et al.
Published: (2022)
by: Xu, Guangkai, et al.
Published: (2022)
The Demon is in Ambiguity: Revisiting Situation Recognition with Single Positive Multi-Label Learning
by: Lin, Yiming, et al.
Published: (2025)
by: Lin, Yiming, et al.
Published: (2025)
Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion
by: Chen, Shenglun, et al.
Published: (2025)
by: Chen, Shenglun, et al.
Published: (2025)
Transparent Object Depth Completion
by: Zhou, Yifan, et al.
Published: (2024)
by: Zhou, Yifan, et al.
Published: (2024)
Depth Completion as Parameter-Efficient Test-Time Adaptation
by: Ke, Bingxin, et al.
Published: (2026)
by: Ke, Bingxin, et al.
Published: (2026)
SDformer: Efficient End-to-End Transformer for Depth Completion
by: Qian, Jian, et al.
Published: (2024)
by: Qian, Jian, et al.
Published: (2024)
Similar Items
-
A Survey of Label-Efficient Deep Learning for 3D Point Clouds
by: Xiao, Aoran, et al.
Published: (2023) -
Scale Propagation Network for Generalizable Depth Completion
by: Wang, Haotian, et al.
Published: (2024) -
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model
by: Xiao, Aoran, et al.
Published: (2024) -
MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
by: Jiang, Xueying, et al.
Published: (2024) -
Data-Efficient Generalization for Zero-shot Composed Image Retrieval
by: Chen, Zining, et al.
Published: (2025)