Saved in:
| Main Authors: | Lin, Qingran, Yang, Fengwei, Zhu, Chaolun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.17390 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CellVTA: Enhancing Vision Foundation Models for Accurate Cell Segmentation and Classification
by: Yang, Yang, et al.
Published: (2025)
by: Yang, Yang, et al.
Published: (2025)
Harnessing the Power of Local Representations for Few-Shot Classification
by: Tang, Shi, et al.
Published: (2024)
by: Tang, Shi, et al.
Published: (2024)
Harnessing The Power of Attention For Patch-Based Biomedical Image Classification
by: Habib, Gousia, et al.
Published: (2024)
by: Habib, Gousia, et al.
Published: (2024)
Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image
by: Miao, Qingran, et al.
Published: (2025)
by: Miao, Qingran, et al.
Published: (2025)
Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization
by: Tian, Qingyao, et al.
Published: (2025)
by: Tian, Qingyao, et al.
Published: (2025)
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
by: Wei, Zhixiang, et al.
Published: (2023)
by: Wei, Zhixiang, et al.
Published: (2023)
MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation
by: Zhu, Shenhao, et al.
Published: (2024)
by: Zhu, Shenhao, et al.
Published: (2024)
Selective, Regularized, and Calibrated: Harnessing Vision Foundation Models for Cross-Domain Few-Shot Semantic Segmentation
by: Ma, Junyuan, et al.
Published: (2026)
by: Ma, Junyuan, et al.
Published: (2026)
MambaSeg: Harnessing Mamba for Accurate and Efficient Image-Event Semantic Segmentation
by: Gu, Fuqiang, et al.
Published: (2025)
by: Gu, Fuqiang, et al.
Published: (2025)
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
by: Shi, Yuheng, et al.
Published: (2024)
by: Shi, Yuheng, et al.
Published: (2024)
Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification
by: Li, Sirui, et al.
Published: (2024)
by: Li, Sirui, et al.
Published: (2024)
Multi-Scale Transformer Architecture for Accurate Medical Image Classification
by: Hu, Jiacheng, et al.
Published: (2025)
by: Hu, Jiacheng, et al.
Published: (2025)
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
by: Peng, Wenshuo, et al.
Published: (2024)
by: Peng, Wenshuo, et al.
Published: (2024)
MegActor: Harness the Power of Raw Video for Vivid Portrait Animation
by: Yang, Shurong, et al.
Published: (2024)
by: Yang, Shurong, et al.
Published: (2024)
Benchmarking Foundation Models for Mitotic Figure Classification
by: Ammeling, Jonas, et al.
Published: (2025)
by: Ammeling, Jonas, et al.
Published: (2025)
MIRepNet: A Pipeline and Foundation Model for EEG-Based Motor Imagery Classification
by: Liu, Dingkun, et al.
Published: (2025)
by: Liu, Dingkun, et al.
Published: (2025)
Learning from SAM: Harnessing a Foundation Model for Sim2Real Adaptation by Regularization
by: Bonani, Mayara E., et al.
Published: (2023)
by: Bonani, Mayara E., et al.
Published: (2023)
Semantic Smoothing via Novel View Synthesis for Robust SAR Image Classification
by: Brignac, Daniel, et al.
Published: (2026)
by: Brignac, Daniel, et al.
Published: (2026)
AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks
by: Li, You, et al.
Published: (2024)
by: Li, You, et al.
Published: (2024)
Harnessing Diffusion-Generated Synthetic Images for Fair Image Classification
by: Basu, Abhipsa, et al.
Published: (2025)
by: Basu, Abhipsa, et al.
Published: (2025)
MixerCA: An Efficient and Accurate Model for High-Performance Hyperspectral Image Classification
by: Alkhatib, Mohammed Q., et al.
Published: (2026)
by: Alkhatib, Mohammed Q., et al.
Published: (2026)
PathVQ: Reforming Computational Pathology Foundation Model for Whole Slide Image Analysis via Vector Quantization
by: Li, Honglin, et al.
Published: (2025)
by: Li, Honglin, et al.
Published: (2025)
FoundationSLAM: Unleashing the Power of Depth Foundation Models for End-to-End Dense Visual SLAM
by: Wu, Yuchen, et al.
Published: (2025)
by: Wu, Yuchen, et al.
Published: (2025)
Evaluating Vision Foundation Models for Pixel and Object Classification in Microscopy
by: Teuber, Carolin, et al.
Published: (2026)
by: Teuber, Carolin, et al.
Published: (2026)
FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models
by: Zhao, Lin, et al.
Published: (2024)
by: Zhao, Lin, et al.
Published: (2024)
Foundation Models as Class-Incremental Learners for Dermatological Image Classification
by: Elkhayat, Mohamed, et al.
Published: (2025)
by: Elkhayat, Mohamed, et al.
Published: (2025)
Adapting a Segmentation Foundation Model for Medical Image Classification
by: Gu, Pengfei, et al.
Published: (2025)
by: Gu, Pengfei, et al.
Published: (2025)
Harnessing Large Vision and Language Models in Agriculture: A Review
by: Zhu, Hongyan, et al.
Published: (2024)
by: Zhu, Hongyan, et al.
Published: (2024)
Beyond Diagnostic Performance: Revealing and Quantifying Ethical Risks in Pathology Foundation Models
by: Lin, Weiping, et al.
Published: (2025)
by: Lin, Weiping, et al.
Published: (2025)
Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID
by: Tan, Wentao, et al.
Published: (2024)
by: Tan, Wentao, et al.
Published: (2024)
Expert Knowledge-Guided Decision Calibration for Accurate Fine-Grained Tree Species Classification
by: Long, Chen, et al.
Published: (2026)
by: Long, Chen, et al.
Published: (2026)
PneumoLLM: Harnessing the Power of Large Language Model for Pneumoconiosis Diagnosis
by: Song, Meiyue, et al.
Published: (2023)
by: Song, Meiyue, et al.
Published: (2023)
EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing
by: Yang, Xiangpeng, et al.
Published: (2024)
by: Yang, Xiangpeng, et al.
Published: (2024)
Debiased Noise Editing on Foundation Models for Fair Medical Image Classification
by: Jin, Ruinan, et al.
Published: (2024)
by: Jin, Ruinan, et al.
Published: (2024)
Fusion of Foundation and Vision Transformer Model Features for Dermatoscopic Image Classification
by: Mahbod, Amirreza, et al.
Published: (2025)
by: Mahbod, Amirreza, et al.
Published: (2025)
Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
by: Yuan, Tianyu, et al.
Published: (2025)
by: Yuan, Tianyu, et al.
Published: (2025)
PL-FSCIL: Harnessing the Power of Prompts for Few-Shot Class-Incremental Learning
by: Tian, Songsong, et al.
Published: (2024)
by: Tian, Songsong, et al.
Published: (2024)
OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box Prompts
by: Zhou, Zhen, et al.
Published: (2024)
by: Zhou, Zhen, et al.
Published: (2024)
CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic
by: Sun, Yuxuan, et al.
Published: (2025)
by: Sun, Yuxuan, et al.
Published: (2025)
GeoSurDepth: Harnessing Foundation Model for Spatial Geometry Consistency-Oriented Self-Supervised Surround-View Depth Estimation
by: Liu, Weimin, et al.
Published: (2026)
by: Liu, Weimin, et al.
Published: (2026)
Similar Items
-
CellVTA: Enhancing Vision Foundation Models for Accurate Cell Segmentation and Classification
by: Yang, Yang, et al.
Published: (2025) -
Harnessing the Power of Local Representations for Few-Shot Classification
by: Tang, Shi, et al.
Published: (2024) -
Harnessing The Power of Attention For Patch-Based Biomedical Image Classification
by: Habib, Gousia, et al.
Published: (2024) -
Print2Volume: Generating Synthetic OCT-based 3D Fingerprint Volume from 2D Fingerprint Image
by: Miao, Qingran, et al.
Published: (2025) -
Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization
by: Tian, Qingyao, et al.
Published: (2025)