Saved in:
| Main Authors: | Miao, Yibo, Lei, Yu, Zhou, Feng, Deng, Zhijie |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.00312 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multimodal Medical Image Classification via Synergistic Learning Pre-training
by: Lin, Qinghua, et al.
Published: (2025)
by: Lin, Qinghua, et al.
Published: (2025)
Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration
by: Park, Donwon, et al.
Published: (2024)
by: Park, Donwon, et al.
Published: (2024)
Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
by: Lan, Zhengxing, et al.
Published: (2024)
by: Lan, Zhengxing, et al.
Published: (2024)
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models
by: LeCoz, Adrien, et al.
Published: (2024)
by: LeCoz, Adrien, et al.
Published: (2024)
Benchmarking the Influence of Pre-training on Explanation Performance in MR Image Classification
by: Oliveira, Marta, et al.
Published: (2023)
by: Oliveira, Marta, et al.
Published: (2023)
SuperCL: Superpixel Guided Contrastive Learning for Medical Image Segmentation Pre-training
by: Zeng, Shuang, et al.
Published: (2025)
by: Zeng, Shuang, et al.
Published: (2025)
Siamese Transformer Networks for Few-shot Image Classification
by: Jiang, Weihao, et al.
Published: (2024)
by: Jiang, Weihao, et al.
Published: (2024)
Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training
by: Zhang, Xinsong, et al.
Published: (2025)
by: Zhang, Xinsong, et al.
Published: (2025)
A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models
by: Zheng, Haonan, et al.
Published: (2024)
by: Zheng, Haonan, et al.
Published: (2024)
Domain-Specific Pre-training Improves Confidence in Whole Slide Image Classification
by: Chitnis, Soham Rohit, et al.
Published: (2023)
by: Chitnis, Soham Rohit, et al.
Published: (2023)
Med-GLIP: Advancing Medical Language-Image Pre-training with Large-scale Grounded Dataset
by: Deng, Ziye, et al.
Published: (2025)
by: Deng, Ziye, et al.
Published: (2025)
TLCM: Training-efficient Latent Consistency Model for Image Generation with 2-8 Steps
by: Xie, Qingsong, et al.
Published: (2024)
by: Xie, Qingsong, et al.
Published: (2024)
Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition
by: Lu, Feng, et al.
Published: (2024)
by: Lu, Feng, et al.
Published: (2024)
Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
by: Feng, Zhangchi, et al.
Published: (2024)
by: Feng, Zhangchi, et al.
Published: (2024)
Pre-training with Random Orthogonal Projection Image Modeling
by: Haghighat, Maryam, et al.
Published: (2023)
by: Haghighat, Maryam, et al.
Published: (2023)
Multi-level Asymmetric Contrastive Learning for Volumetric Medical Image Segmentation Pre-training
by: Zeng, Shuang, et al.
Published: (2023)
by: Zeng, Shuang, et al.
Published: (2023)
Black-box Membership Inference Attacks on the Pre-training Data of Image-generation Models
by: Qi, Tao, et al.
Published: (2026)
by: Qi, Tao, et al.
Published: (2026)
When VLMs Meet Image Classification: Test Sets Renovation via Missing Label Identification
by: Pang, Zirui, et al.
Published: (2025)
by: Pang, Zirui, et al.
Published: (2025)
One-shot synthesis of rare gastrointestinal lesions improves diagnostic accuracy and clinical training
by: Yu, Jia, et al.
Published: (2025)
by: Yu, Jia, et al.
Published: (2025)
PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models
by: Nautiyal, Mayank, et al.
Published: (2025)
by: Nautiyal, Mayank, et al.
Published: (2025)
DiffRIS: Enhancing Referring Remote Sensing Image Segmentation with Pre-trained Text-to-Image Diffusion Models
by: Dong, Zhe, et al.
Published: (2025)
by: Dong, Zhe, et al.
Published: (2025)
VecSet-Edit: Unleashing Pre-trained LRM for Mesh Editing from Single Image
by: Hsiao, Teng-Fang, et al.
Published: (2026)
by: Hsiao, Teng-Fang, et al.
Published: (2026)
UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting
by: Wang, Ziyi, et al.
Published: (2025)
by: Wang, Ziyi, et al.
Published: (2025)
Integrating Text and Image Pre-training for Multi-modal Algorithmic Reasoning
by: Zhang, Zijian, et al.
Published: (2024)
by: Zhang, Zijian, et al.
Published: (2024)
MedFILIP: Medical Fine-grained Language-Image Pre-training
by: Liang, Xinjie, et al.
Published: (2025)
by: Liang, Xinjie, et al.
Published: (2025)
Enhancing Adversarial Transferability in Visual-Language Pre-training Models via Local Shuffle and Sample-based Attack
by: Liu, Xin, et al.
Published: (2025)
by: Liu, Xin, et al.
Published: (2025)
Grounded Knowledge-Enhanced Medical Vision-Language Pre-training for Chest X-Ray
by: Deng, Qiao, et al.
Published: (2024)
by: Deng, Qiao, et al.
Published: (2024)
Augmenting Prototype Network with TransMix for Few-shot Hyperspectral Image Classification
by: Liu, Chun, et al.
Published: (2024)
by: Liu, Chun, et al.
Published: (2024)
Libra-MIL: Multimodal Prototypes Stereoscopic Infused with Task-specific Language Priors for Few-shot Whole Slide Image Classification
by: Zhuang, Zhenfeng, et al.
Published: (2025)
by: Zhuang, Zhenfeng, et al.
Published: (2025)
Practical Continual Forgetting for Pre-trained Vision Models
by: Zhao, Hongbo, et al.
Published: (2025)
by: Zhao, Hongbo, et al.
Published: (2025)
RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training
by: Lu, Zhixiu, et al.
Published: (2024)
by: Lu, Zhixiu, et al.
Published: (2024)
Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
by: Zheng, Weijie, et al.
Published: (2024)
by: Zheng, Weijie, et al.
Published: (2024)
Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection
by: Ye, Wei, et al.
Published: (2024)
by: Ye, Wei, et al.
Published: (2024)
Retrieval-augmented Prompt Learning for Pre-trained Foundation Models
by: Chen, Xiang, et al.
Published: (2025)
by: Chen, Xiang, et al.
Published: (2025)
A Bayesian Approach to OOD Robustness in Image Classification
by: Kaushik, Prakhar, et al.
Published: (2024)
by: Kaushik, Prakhar, et al.
Published: (2024)
Zero-shot Building Age Classification from Facade Image Using GPT-4
by: Zeng, Zichao, et al.
Published: (2024)
by: Zeng, Zichao, et al.
Published: (2024)
Demographic User Modeling for Social Robotics with Multimodal Pre-trained Models
by: Rahimi, Hamed, et al.
Published: (2025)
by: Rahimi, Hamed, et al.
Published: (2025)
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
by: Ifriqi, Tariq Berrada, et al.
Published: (2024)
by: Ifriqi, Tariq Berrada, et al.
Published: (2024)
Improving Pre-trained Segmentation Models using Post-Processing
by: Parida, Abhijeet, et al.
Published: (2025)
by: Parida, Abhijeet, et al.
Published: (2025)
Your Pre-trained Diffusion Model Secretly Knows Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2026)
by: Rajagopalan, Sudarshan, et al.
Published: (2026)
Similar Items
-
Multimodal Medical Image Classification via Synergistic Learning Pre-training
by: Lin, Qinghua, et al.
Published: (2025) -
Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration
by: Park, Donwon, et al.
Published: (2024) -
Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
by: Lan, Zhengxing, et al.
Published: (2024) -
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models
by: LeCoz, Adrien, et al.
Published: (2024) -
Benchmarking the Influence of Pre-training on Explanation Performance in MR Image Classification
by: Oliveira, Marta, et al.
Published: (2023)