Saved in:
| Main Authors: | Li, Chang, Peng, Xingtao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.05949 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Sampling-Gaussian for stereo matching
by: Pan, Baiyu, et al.
Published: (2024)
by: Pan, Baiyu, et al.
Published: (2024)
An evaluation of Deep Learning based stereo dense matching dataset shift from aerial images and a large scale stereo dataset
by: Wu, Teng, et al.
Published: (2024)
by: Wu, Teng, et al.
Published: (2024)
Efficient stereo matching on embedded GPUs with zero-means cross correlation
by: Chang, Qiong, et al.
Published: (2022)
by: Chang, Qiong, et al.
Published: (2022)
StereoVAE: A lightweight stereo-matching system using embedded GPUs
by: Chang, Qiong, et al.
Published: (2023)
by: Chang, Qiong, et al.
Published: (2023)
YASMOT: Yet another stereo image multi-object tracker
by: Malde, Ketil
Published: (2025)
by: Malde, Ketil
Published: (2025)
Multi-scale interaction network for stereo image super-resolution
by: Xu, Liyi, et al.
Published: (2026)
by: Xu, Liyi, et al.
Published: (2026)
Transformer-based stereo-aware 3D object detection from binocular images
by: Sun, Hanqing, et al.
Published: (2023)
by: Sun, Hanqing, et al.
Published: (2023)
Unsupervised training of keypoint-agnostic descriptors for flexible retinal image registration
by: Rivas-Villar, David, et al.
Published: (2025)
by: Rivas-Villar, David, et al.
Published: (2025)
Analysis of different disparity estimation techniques on aerial stereo image datasets
by: Narayan, Ishan, et al.
Published: (2024)
by: Narayan, Ishan, et al.
Published: (2024)
Restereo: Diffusion stereo video generation and restoration
by: Huang, Xingchang, et al.
Published: (2025)
by: Huang, Xingchang, et al.
Published: (2025)
Detail-aware multi-view stereo network for depth estimation
by: Tian, Haitao, et al.
Published: (2025)
by: Tian, Haitao, et al.
Published: (2025)
U$^{2}$Flow: Uncertainty-Aware Unsupervised Optical Flow Estimation
by: Sun, Xunpei, et al.
Published: (2026)
by: Sun, Xunpei, et al.
Published: (2026)
Improving Cross-view Object Geo-localization: A Dual Attention Approach with Cross-view Interaction and Multi-Scale Spatial Features
by: Zhu, Xingtao Ling Yingying
Published: (2025)
by: Zhu, Xingtao Ling Yingying
Published: (2025)
Comprehensive language-image pre-training for 3D medical image understanding
by: Wald, Tassilo, et al.
Published: (2025)
by: Wald, Tassilo, et al.
Published: (2025)
Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT
by: Sengupta, Saurav, et al.
Published: (2023)
by: Sengupta, Saurav, et al.
Published: (2023)
CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation
by: Liu, Chenying, et al.
Published: (2024)
by: Liu, Chenying, et al.
Published: (2024)
Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images
by: Di Via, Roberto, et al.
Published: (2024)
by: Di Via, Roberto, et al.
Published: (2024)
Fast constrained sampling in pre-trained diffusion models
by: Graikos, Alexandros, et al.
Published: (2024)
by: Graikos, Alexandros, et al.
Published: (2024)
Positioning radiata pine branches requiring pruning by drone stereo vision
by: Lin, Yida, et al.
Published: (2026)
by: Lin, Yida, et al.
Published: (2026)
Online pre-training with long-form videos
by: Kato, Itsuki, et al.
Published: (2024)
by: Kato, Itsuki, et al.
Published: (2024)
A generalised pre-training strategy for deep learning networks in semantic segmentation of remotely sensed images
by: Fang, Yuan, et al.
Published: (2026)
by: Fang, Yuan, et al.
Published: (2026)
Polygon-mamba: Retinal vessel segmentation using polygon scanning mamba and space-frequency collaborative attention
by: Peng, Yuanyuan, et al.
Published: (2026)
by: Peng, Yuanyuan, et al.
Published: (2026)
Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery
by: Huynh, Andy V., et al.
Published: (2024)
by: Huynh, Andy V., et al.
Published: (2024)
Hyperlocal disaster damage assessment using bi-temporal street-view imagery and pre-trained vision models
by: Yang, Yifan, et al.
Published: (2025)
by: Yang, Yifan, et al.
Published: (2025)
Joint stereo 3D object detection and implicit surface reconstruction
by: Li, Shichao, et al.
Published: (2021)
by: Li, Shichao, et al.
Published: (2021)
Anchor-free Cross-view Object Geo-localization with Gaussian Position Encoding and Cross-view Association
by: Ling, Xingtao, et al.
Published: (2025)
by: Ling, Xingtao, et al.
Published: (2025)
RouteWinFormer: A Route-Window Transformer for Middle-range Attention in Image Restoration
by: Li, Qifan, et al.
Published: (2025)
by: Li, Qifan, et al.
Published: (2025)
Anatomical grounding pre-training for medical phrase grounding
by: Zhang, Wenjun, et al.
Published: (2025)
by: Zhang, Wenjun, et al.
Published: (2025)
From pre-training to downstream performance: Does domain-specific pre-training make sense?
by: Krones, Felix
Published: (2026)
by: Krones, Felix
Published: (2026)
Generalized Denoising Diffusion Codebook Models (gDDCM): Tokenizing images using a pre-trained diffusion model
by: Kong, Fei
Published: (2025)
by: Kong, Fei
Published: (2025)
Aligned Unsupervised Pretraining of Object Detectors with Self-training
by: Metaxas, Ioannis Maniadis, et al.
Published: (2023)
by: Metaxas, Ioannis Maniadis, et al.
Published: (2023)
Improving fine-grained understanding in image-text pre-training
by: Bica, Ioana, et al.
Published: (2024)
by: Bica, Ioana, et al.
Published: (2024)
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network
by: Zhao, Tiancheng, et al.
Published: (2022)
by: Zhao, Tiancheng, et al.
Published: (2022)
StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN
by: Bedychaj, Andrzej, et al.
Published: (2024)
by: Bedychaj, Andrzej, et al.
Published: (2024)
TriDP-PTM: a three-stage distortion-perception tradeoff guides the pre-training model for radar cardiac sensing
by: Li, Jinye, et al.
Published: (2026)
by: Li, Jinye, et al.
Published: (2026)
Anatomically-guided masked autoencoder pre-training for aneurysm detection
by: Ceballos-Arroyo, Alberto Mario, et al.
Published: (2025)
by: Ceballos-Arroyo, Alberto Mario, et al.
Published: (2025)
MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging
by: Zhou, Jiaying, et al.
Published: (2024)
by: Zhou, Jiaying, et al.
Published: (2024)
LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation
by: Wang, Qiyuan, et al.
Published: (2024)
by: Wang, Qiyuan, et al.
Published: (2024)
Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation
by: Zhang, Dingwen, et al.
Published: (2024)
by: Zhang, Dingwen, et al.
Published: (2024)
DIP: Unsupervised Dense In-Context Post-training of Visual Representations
by: Sirko-Galouchenko, Sophia, et al.
Published: (2025)
by: Sirko-Galouchenko, Sophia, et al.
Published: (2025)
Similar Items
-
The Sampling-Gaussian for stereo matching
by: Pan, Baiyu, et al.
Published: (2024) -
An evaluation of Deep Learning based stereo dense matching dataset shift from aerial images and a large scale stereo dataset
by: Wu, Teng, et al.
Published: (2024) -
Efficient stereo matching on embedded GPUs with zero-means cross correlation
by: Chang, Qiong, et al.
Published: (2022) -
StereoVAE: A lightweight stereo-matching system using embedded GPUs
by: Chang, Qiong, et al.
Published: (2023) -
YASMOT: Yet another stereo image multi-object tracker
by: Malde, Ketil
Published: (2025)