:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lin, Qingran, Yang, Fengwei, Zhu, Chaolun
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.17390
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CellVTA: Enhancing Vision Foundation Models for Accurate Cell Segmentation and Classification
by: Yang, Yang, et al.
Published: (2025)

Harnessing the Power of Local Representations for Few-Shot Classification
by: Tang, Shi, et al.
Published: (2024)

Harnessing The Power of Attention For Patch-Based Biomedical Image Classification
by: Habib, Gousia, et al.
Published: (2024)

Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image
by: Miao, Qingran, et al.
Published: (2025)

Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization
by: Tian, Qingyao, et al.
Published: (2025)

Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
by: Wei, Zhixiang, et al.
Published: (2023)

MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation
by: Zhu, Shenhao, et al.
Published: (2024)

Selective, Regularized, and Calibrated: Harnessing Vision Foundation Models for Cross-Domain Few-Shot Semantic Segmentation
by: Ma, Junyuan, et al.
Published: (2026)

MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic Segmentation
by: Gu, Fuqiang, et al.
Published: (2025)

Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
by: Shi, Yuheng, et al.
Published: (2024)

Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification
by: Li, Sirui, et al.
Published: (2024)

Multi-Scale Transformer Architecture for Accurate Medical Image Classification
by: Hu, Jiacheng, et al.
Published: (2025)

Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
by: Peng, Wenshuo, et al.
Published: (2024)

MegActor: Harness the Power of Raw Video for Vivid Portrait Animation
by: Yang, Shurong, et al.
Published: (2024)

Benchmarking Foundation Models for Mitotic Figure Classification
by: Ammeling, Jonas, et al.
Published: (2025)

MIRepNet: A Pipeline and Foundation Model for EEG-Based Motor Imagery Classification
by: Liu, Dingkun, et al.
Published: (2025)

Learning from SAM: Harnessing a Foundation Model for Sim2Real Adaptation by Regularization
by: Bonani, Mayara E., et al.
Published: (2023)

Semantic Smoothing via Novel View Synthesis for Robust SAR Image Classification
by: Brignac, Daniel, et al.
Published: (2026)

AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks
by: Li, You, et al.
Published: (2024)

Harnessing Diffusion-Generated Synthetic Images for Fair Image Classification
by: Basu, Abhipsa, et al.
Published: (2025)

MixerCA: An Efficient and Accurate Model for High-Performance Hyperspectral Image Classification
by: Alkhatib, Mohammed Q., et al.
Published: (2026)

PathVQ: Reforming Computational Pathology Foundation Model for Whole Slide Image Analysis via Vector Quantization
by: Li, Honglin, et al.
Published: (2025)

FoundationSLAM: Unleashing the Power of Depth Foundation Models for End-to-End Dense Visual SLAM
by: Wu, Yuchen, et al.
Published: (2025)

Evaluating Vision Foundation Models for Pixel and Object Classification in Microscopy
by: Teuber, Carolin, et al.
Published: (2026)

FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models
by: Zhao, Lin, et al.
Published: (2024)

Foundation Models as Class-Incremental Learners for Dermatological Image Classification
by: Elkhayat, Mohamed, et al.
Published: (2025)

Adapting a Segmentation Foundation Model for Medical Image Classification
by: Gu, Pengfei, et al.
Published: (2025)

Harnessing Large Vision and Language Models in Agriculture: A Review
by: Zhu, Hongyan, et al.
Published: (2024)

Beyond Diagnostic Performance: Revealing and Quantifying Ethical Risks in Pathology Foundation Models
by: Lin, Weiping, et al.
Published: (2025)

Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID
by: Tan, Wentao, et al.
Published: (2024)

Expert Knowledge-Guided Decision Calibration for Accurate Fine-Grained Tree Species Classification
by: Long, Chen, et al.
Published: (2026)

PneumoLLM: Harnessing the Power of Large Language Model for Pneumoconiosis Diagnosis
by: Song, Meiyue, et al.
Published: (2023)

EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing
by: Yang, Xiangpeng, et al.
Published: (2024)

Debiased Noise Editing on Foundation Models for Fair Medical Image Classification
by: Jin, Ruinan, et al.
Published: (2024)

Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification
by: Mahbod, Amirreza, et al.
Published: (2025)

Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
by: Yuan, Tianyu, et al.
Published: (2025)

PL-FSCIL: Harnessing the Power of Prompts for Few-Shot Class-Incremental Learning
by: Tian, Songsong, et al.
Published: (2024)

OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box Prompts
by: Zhou, Zhen, et al.
Published: (2024)

CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic
by: Sun, Yuxuan, et al.
Published: (2025)

GeoSurDepth: Harnessing Foundation Model for Spatial Geometry Consistency-Oriented Self-Supervised Surround-View Depth Estimation
by: Liu, Weimin, et al.
Published: (2026)