:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zheng, Hao, Yang, Shunzhi, He, Zhuoxin, Yang, Jinfeng, Huang, Zhenhua
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.14976
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Multi-modal Attribute Prompting for Vision-Language Models
by: Liu, Xin, et al.
Published: (2024)

Multi-modal Mutual-Guidance Conditional Prompt Learning for Vision-Language Models
by: Yang, Shijun, et al.
Published: (2025)

Cross-modal Prompting for Balanced Incomplete Multi-modal Emotion Recognition
by: He, Wen-Jue, et al.
Published: (2025)

Cross-modal Proxy Evolving for OOD Detection with Vision-Language Models
by: Tang, Hao, et al.
Published: (2026)

Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
by: Ma, Shuailei, et al.
Published: (2023)

Cascade Prompt Learning for Vision-Language Model Adaptation
by: Wu, Ge, et al.
Published: (2024)

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models
by: Li, Zheng, et al.
Published: (2024)

PathoHR: Hierarchical Reasoning for Vision-Language Models in Pathology
by: Huang, Yating, et al.
Published: (2025)

Quantized Prompt for Efficient Generalization of Vision-Language Models
by: Hao, Tianxiang, et al.
Published: (2024)

A Cross-Modal Prompt Injection Attack against Large Vision-Language Models with Image-Only Perturbation
by: Yang, Hao, et al.
Published: (2026)

InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models
by: Zhou, Shuchang, et al.
Published: (2025)

Modular Prompt Learning Improves Vision-Language Models
by: Huang, Zhenhan, et al.
Published: (2025)

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
by: Liu, Haowei, et al.
Published: (2024)

Hierarchical Dual-Subspace Decoupling for Continual Learning in Vision-Language Models
by: Qin, Mengxin, et al.
Published: (2026)

RESTORE: Towards Feature Shift for Vision-Language Prompt Learning
by: Yang, Yuncheng, et al.
Published: (2024)

Concept-Guided Prompt Learning for Generalization in Vision-Language Models
by: Zhang, Yi, et al.
Published: (2024)

Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models
by: Jang, Young Kyun, et al.
Published: (2024)

Deep Reversible Consistency Learning for Cross-modal Retrieval
by: Pu, Ruitao, et al.
Published: (2025)

CAST: Cross-modal Alignment Similarity Test for Vision Language Models
by: Dagan, Gautier, et al.
Published: (2024)

Revisiting Prompt Pretraining of Vision-Language Models
by: Chen, Zhenyuan, et al.
Published: (2024)

Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language Models
by: Jia, Xiaojun, et al.
Published: (2025)

MePT: Multi-Representation Guided Prompt Tuning for Vision-Language Model
by: Wang, Xinyang, et al.
Published: (2024)

PromptEcho: Annotation-Free Reward from Vision-Language Models for Text-to-Image Reinforcement Learning
by: Liu, Jinlong, et al.
Published: (2026)

Active Prompt Learning in Vision Language Models
by: Bang, Jihwan, et al.
Published: (2023)

In the Era of Prompt Learning with Vision-Language Models
by: Jha, Ankit
Published: (2024)

Mixture of Prompt Learning for Vision Language Models
by: Du, Yu, et al.
Published: (2024)

Cross-modal Associations in Vision and Language Models: Revisiting the Bouba-Kiki Effect
by: Kouwenhoven, Tom, et al.
Published: (2025)

Debiased Prompt Tuning in Vision-Language Model without Annotations
by: Jiang, Chaoquan, et al.
Published: (2025)

Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
by: Hong, Haodong, et al.
Published: (2024)

DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Models
by: Zhang, Yudong, et al.
Published: (2024)

Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation
by: Zheng, Xu, et al.
Published: (2024)

FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection
by: Lu, Xinhua, et al.
Published: (2025)

Expanding the Boundaries of Vision Prior Knowledge in Multi-modal Large Language Models
by: Liang, Qiao, et al.
Published: (2025)

SpecVLM: Fast Speculative Decoding in Vision-Language Models
by: Huang, Haiduo, et al.
Published: (2025)

ArGue: Attribute-Guided Prompt Tuning for Vision-Language Models
by: Tian, Xinyu, et al.
Published: (2023)

Integrated Structural Prompt Learning for Vision-Language Models
by: Wang, Jiahui, et al.
Published: (2025)

Active Prompt Learning with Vision-Language Model Priors
by: Kim, Hoyoung, et al.
Published: (2024)

Consistency-guided Prompt Learning for Vision-Language Models
by: Roy, Shuvendu, et al.
Published: (2023)

IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting
by: Fu, Hao, et al.
Published: (2025)

Evolving Prompt Adaptation for Vision-Language Models
by: Zhang, Enming, et al.
Published: (2026)