:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Miao, Yibo, Lei, Yu, Zhou, Feng, Deng, Zhijie
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2404.00312
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Multimodal Medical Image Classification via Synergistic Learning Pre-training
by: Lin, Qinghua, et al.
Published: (2025)

Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration
by: Park, Donwon, et al.
Published: (2024)

Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
by: Lan, Zhengxing, et al.
Published: (2024)

Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models
by: LeCoz, Adrien, et al.
Published: (2024)

Benchmarking the Influence of Pre-training on Explanation Performance in MR Image Classification
by: Oliveira, Marta, et al.
Published: (2023)

SuperCL: Superpixel Guided Contrastive Learning for Medical Image Segmentation Pre-training
by: Zeng, Shuang, et al.
Published: (2025)

Siamese Transformer Networks for Few-shot Image Classification
by: Jiang, Weihao, et al.
Published: (2024)

Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training
by: Zhang, Xinsong, et al.
Published: (2025)

A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models
by: Zheng, Haonan, et al.
Published: (2024)

Domain-Specific Pre-training Improves Confidence in Whole Slide Image Classification
by: Chitnis, Soham Rohit, et al.
Published: (2023)

Med-GLIP: Advancing Medical Language-Image Pre-training with Large-scale Grounded Dataset
by: Deng, Ziye, et al.
Published: (2025)

TLCM: Training-efficient Latent Consistency Model for Image Generation with 2-8 Steps
by: Xie, Qingsong, et al.
Published: (2024)

Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition
by: Lu, Feng, et al.
Published: (2024)

Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
by: Feng, Zhangchi, et al.
Published: (2024)

Pre-training with Random Orthogonal Projection Image Modeling
by: Haghighat, Maryam, et al.
Published: (2023)

Multi-level Asymmetric Contrastive Learning for Volumetric Medical Image Segmentation Pre-training
by: Zeng, Shuang, et al.
Published: (2023)

Black-box Membership Inference Attacks on the Pre-training Data of Image-generation Models
by: Qi, Tao, et al.
Published: (2026)

When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification
by: Pang, Zirui, et al.
Published: (2025)

One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training
by: Yu, Jia, et al.
Published: (2025)

PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models
by: Nautiyal, Mayank, et al.
Published: (2025)

DiffRIS: Enhancing Referring Remote Sensing Image Segmentation with Pre-trained Text-to-Image Diffusion Models
by: Dong, Zhe, et al.
Published: (2025)

VecSet-Edit: Unleashing Pre-trained LRM for Mesh Editing from Single Image
by: Hsiao, Teng-Fang, et al.
Published: (2026)

UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting
by: Wang, Ziyi, et al.
Published: (2025)

Integrating Text and Image Pre-training for Multi-modal Algorithmic Reasoning
by: Zhang, Zijian, et al.
Published: (2024)

MedFILIP: Medical Fine-grained Language-Image Pre-training
by: Liang, Xinjie, et al.
Published: (2025)

Enhancing Adversarial Transferability in Visual-Language Pre-training Models via Local Shuffle and Sample-based Attack
by: Liu, Xin, et al.
Published: (2025)

Grounded Knowledge-Enhanced Medical Vision-Language Pre-training for Chest X-Ray
by: Deng, Qiao, et al.
Published: (2024)

Augmenting Prototype Network with TransMix for Few-shot Hyperspectral Image Classification
by: Liu, Chun, et al.
Published: (2024)

Libra-MIL: Multimodal Prototypes Stereoscopic Infused with Task-specific Language Priors for Few-shot Whole Slide Image Classification
by: Zhuang, Zhenfeng, et al.
Published: (2025)

Practical Continual Forgetting for Pre-trained Vision Models
by: Zhao, Hongbo, et al.
Published: (2025)

RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training
by: Lu, Zhixiu, et al.
Published: (2024)

Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
by: Zheng, Weijie, et al.
Published: (2024)

Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection
by: Ye, Wei, et al.
Published: (2024)

Retrieval-augmented Prompt Learning for Pre-trained Foundation Models
by: Chen, Xiang, et al.
Published: (2025)

A Bayesian Approach to OOD Robustness in Image Classification
by: Kaushik, Prakhar, et al.
Published: (2024)

Zero-shot Building Age Classification from Facade Image Using GPT-4
by: Zeng, Zichao, et al.
Published: (2024)

Demographic User Modeling for Social Robotics with Multimodal Pre-trained Models
by: Rahimi, Hamed, et al.
Published: (2025)

On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
by: Ifriqi, Tariq Berrada, et al.
Published: (2024)

Improving Pre-trained Segmentation Models using Post-Processing
by: Parida, Abhijeet, et al.
Published: (2025)

Your Pre-trained Diffusion Model Secretly Knows Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2026)