:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Chang, Peng, Xingtao
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2511.05949
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Sampling-Gaussian for stereo matching
by: Pan, Baiyu, et al.
Published: (2024)

An evaluation of Deep Learning based stereo dense matching dataset shift from aerial images and a large scale stereo dataset
by: Wu, Teng, et al.
Published: (2024)

Efficient stereo matching on embedded GPUs with zero-means cross correlation
by: Chang, Qiong, et al.
Published: (2022)

StereoVAE: A lightweight stereo-matching system using embedded GPUs
by: Chang, Qiong, et al.
Published: (2023)

YASMOT: Yet another stereo image multi-object tracker
by: Malde, Ketil
Published: (2025)

Multi-scale interaction network for stereo image super-resolution
by: Xu, Liyi, et al.
Published: (2026)

Transformer-based stereo-aware 3D object detection from binocular images
by: Sun, Hanqing, et al.
Published: (2023)

Unsupervised training of keypoint-agnostic descriptors for flexible retinal image registration
by: Rivas-Villar, David, et al.
Published: (2025)

Analysis of different disparity estimation techniques on aerial stereo image datasets
by: Narayan, Ishan, et al.
Published: (2024)

Restereo: Diffusion stereo video generation and restoration
by: Huang, Xingchang, et al.
Published: (2025)

Detail-aware multi-view stereo network for depth estimation
by: Tian, Haitao, et al.
Published: (2025)

U$^{2}$Flow: Uncertainty-Aware Unsupervised Optical Flow Estimation
by: Sun, Xunpei, et al.
Published: (2026)

Improving Cross-view Object Geo-localization: A Dual Attention Approach with Cross-view Interaction and Multi-Scale Spatial Features
by: Zhu, Xingtao Ling Yingying
Published: (2025)

Comprehensive language-image pre-training for 3D medical image understanding
by: Wald, Tassilo, et al.
Published: (2025)

Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT
by: Sengupta, Saurav, et al.
Published: (2023)

CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation
by: Liu, Chenying, et al.
Published: (2024)

Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images
by: Di Via, Roberto, et al.
Published: (2024)

Fast constrained sampling in pre-trained diffusion models
by: Graikos, Alexandros, et al.
Published: (2024)

Positioning radiata pine branches requiring pruning by drone stereo vision
by: Lin, Yida, et al.
Published: (2026)

Online pre-training with long-form videos
by: Kato, Itsuki, et al.
Published: (2024)

A generalised pre-training strategy for deep learning networks in semantic segmentation of remotely sensed images
by: Fang, Yuan, et al.
Published: (2026)

Polygon-mamba: Retinal vessel segmentation using polygon scanning mamba and space-frequency collaborative attention
by: Peng, Yuanyuan, et al.
Published: (2026)

Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery
by: Huynh, Andy V., et al.
Published: (2024)

Hyperlocal disaster damage assessment using bi-temporal street-view imagery and pre-trained vision models
by: Yang, Yifan, et al.
Published: (2025)

Joint stereo 3D object detection and implicit surface reconstruction
by: Li, Shichao, et al.
Published: (2021)

Anchor-free Cross-view Object Geo-localization with Gaussian Position Encoding and Cross-view Association
by: Ling, Xingtao, et al.
Published: (2025)

RouteWinFormer: A Route-Window Transformer for Middle-range Attention in Image Restoration
by: Li, Qifan, et al.
Published: (2025)

Anatomical grounding pre-training for medical phrase grounding
by: Zhang, Wenjun, et al.
Published: (2025)

From pre-training to downstream performance: Does domain-specific pre-training make sense?
by: Krones, Felix
Published: (2026)

Generalized Denoising Diffusion Codebook Models (gDDCM): Tokenizing images using a pre-trained diffusion model
by: Kong, Fei
Published: (2025)

Aligned Unsupervised Pretraining of Object Detectors with Self-training
by: Metaxas, Ioannis Maniadis, et al.
Published: (2023)

Improving fine-grained understanding in image-text pre-training
by: Bica, Ioana, et al.
Published: (2024)

OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network
by: Zhao, Tiancheng, et al.
Published: (2022)

StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN
by: Bedychaj, Andrzej, et al.
Published: (2024)

TriDP-PTM: a three-stage distortion-perception tradeoff guides the pre-training model for radar cardiac sensing
by: Li, Jinye, et al.
Published: (2026)

Anatomically-guided masked autoencoder pre-training for aneurysm detection
by: Ceballos-Arroyo, Alberto Mario, et al.
Published: (2025)

MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging
by: Zhou, Jiaying, et al.
Published: (2024)

LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation
by: Wang, Qiyuan, et al.
Published: (2024)

Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation
by: Zhang, Dingwen, et al.
Published: (2024)

DIP: Unsupervised Dense In-Context Post-training of Visual Representations
by: Sirko-Galouchenko, Sophia, et al.
Published: (2025)