:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zhou, Wentao, Chen, Xuweiyi, Rajagopal, Vignesh, Chen, Jeffrey, Chandra, Rohan, Cheng, Zezhou
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2512.10956
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

WildRayZer: Self-supervised Large View Synthesis in Dynamic Environments
von: Chen, Xuweiyi, et al.
Veröffentlicht: (2026)

Probing the Mid-level Vision Capabilities of Self-Supervised Learning
von: Chen, Xuweiyi, et al.
Veröffentlicht: (2024)

Semantic-Free Procedural 3D Shapes Are Surprisingly Good Teachers
von: Chen, Xuweiyi, et al.
Veröffentlicht: (2024)

Point-MoE: Large-Scale Multi-Dataset Training with Mixture-of-Experts for 3D Semantic Segmentation
von: Chen, Xuweiyi, et al.
Veröffentlicht: (2025)

Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
von: Wang, Boyang, et al.
Veröffentlicht: (2025)

Open Vocabulary Monocular 3D Object Detection
von: Yao, Jin, et al.
Veröffentlicht: (2024)

SAB3R: Semantic-Augmented Backbone in 3D Reconstruction
von: Chen, Xuweiyi, et al.
Veröffentlicht: (2025)

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
von: Xia, Tian, et al.
Veröffentlicht: (2024)

StereoVGGT: A Training-Free Visual Geometry Transformer for Stereo Vision
von: Chen, Ziyang, et al.
Veröffentlicht: (2026)

All-in-One: Transferring Vision Foundation Models into Stereo Matching
von: Zhou, Jingyi, et al.
Veröffentlicht: (2024)

Next-Embedding Prediction Makes Strong Vision Learners
von: Xu, Sihan, et al.
Veröffentlicht: (2025)

Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation
von: Wang, Zihan, et al.
Veröffentlicht: (2025)

VDNeRF: Vision-only Dynamic Neural Radiance Field for Urban Scenes
von: Zou, Zhengyu, et al.
Veröffentlicht: (2025)

Eyes on the Streets: Leveraging Street-Level Imaging to Model Urban Crime Dynamics
von: Qi, Zhixuan, et al.
Veröffentlicht: (2024)

SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing
von: Chen, Chen, et al.
Veröffentlicht: (2024)

ZeroStereo: Zero-shot Stereo Matching from Single Images
von: Wang, Xianqi, et al.
Veröffentlicht: (2025)

Dual-Level Precision Edges Guided Multi-View Stereo with Accurate Planarization
von: Chen, Kehua, et al.
Veröffentlicht: (2024)

Multi-Object Hallucination in Vision-Language Models
von: Chen, Xuweiyi, et al.
Veröffentlicht: (2024)

The Role of Cyclopean-Eye in Stereo Vision
von: da Silva, Sherlon Almeida, et al.
Veröffentlicht: (2025)

WakeupUrban: Unsupervised Semantic Segmentation of Mid-20$^{th}$ century Urban Landscapes with Satellite Imagery
von: Hao, Tianxiang, et al.
Veröffentlicht: (2025)

MLG-Stereo: ViT Based Stereo Matching with Multi-Stage Local-Global Enhancement
von: Zhang, Haoyu, et al.
Veröffentlicht: (2026)

Cross-Modal Urban Sensing: Evaluating Sound-Vision Alignment Across Street-Level and Aerial Imagery
von: Chen, Pengyu, et al.
Veröffentlicht: (2025)

StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation
von: Li, Haodong, et al.
Veröffentlicht: (2025)

Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching
von: Jing, Junpeng, et al.
Veröffentlicht: (2024)

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors
von: Shen, Guibao, et al.
Veröffentlicht: (2025)

Fisheye Stereo Vision: Depth and Range Error
von: Jiang, Leaf, et al.
Veröffentlicht: (2026)

Non-Learning Low-Light Stereo Vision
von: Wang, Jason, et al.
Veröffentlicht: (2026)

Playing to Vision Foundation Model's Strengths in Stereo Matching
von: Liu, Chuang-Wei, et al.
Veröffentlicht: (2024)

CityPulse: Fine-Grained Assessment of Urban Change with Street View Time Series
von: Huang, Tianyuan, et al.
Veröffentlicht: (2024)

SMFormer: Empowering Self-supervised Stereo Matching via Foundation Models and Data Augmentation
von: Wang, Yun, et al.
Veröffentlicht: (2026)

Pip-Stereo: Progressive Iterations Pruner for Iterative Optimization based Stereo Matching
von: Zheng, Jintu, et al.
Veröffentlicht: (2026)

DreamStereo: Towards Real-Time Stereo Inpainting for HD Videos
von: Huang, Yuan, et al.
Veröffentlicht: (2026)

Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction
von: Huang, Xiufeng, et al.
Veröffentlicht: (2025)

Adapting Stereo Vision From Objects To 3D Lunar Surface Reconstruction with the StereoLunar Dataset
von: Grethen, Clementine, et al.
Veröffentlicht: (2025)

MoCha-Stereo: Motif Channel Attention Network for Stereo Matching
von: Chen, Ziyang, et al.
Veröffentlicht: (2024)

Vision-Based Autonomous UAV Navigation and Landing for Urban Search and Rescue
von: Mittal, Mayank, et al.
Veröffentlicht: (2019)

A Large Vision-Language Model based Environment Perception System for Visually Impaired People
von: Chen, Zezhou, et al.
Veröffentlicht: (2025)

RayMap3R: Inference-Time RayMap for Dynamic 3D Reconstruction
von: Wang, Feiran, et al.
Veröffentlicht: (2026)

Affine Correspondences in Stereo Vision: Theory, Practice, and Limitations
von: Hajder, Levente
Veröffentlicht: (2026)

Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
von: Yu, Songsong, et al.
Veröffentlicht: (2025)