:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shivika, Bose, Kartik, Gupta, Pankaj
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.13561
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Investigating the Semantic Robustness of CLIP-based Zero-Shot Anomaly Segmentation
by: Stangl, Kevin, et al.
Published: (2024)

Transductive Zero-Shot and Few-Shot CLIP
by: Martin, Ségolène, et al.
Published: (2024)

Enhancing Compositional Reasoning in CLIP via Reconstruction and Alignment of Text Descriptions
by: Kwon, Jihoon, et al.
Published: (2025)

Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
by: Qiu, Longtian, et al.
Published: (2024)

ViP$^2$-CLIP: Visual-Perception Prompting with Unified Alignment for Zero-Shot Anomaly Detection
by: Yang, Ziteng, et al.
Published: (2025)

Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion
by: Allgeuer, Philipp, et al.
Published: (2024)

Semantic Relation-Enhanced CLIP Adapter for Domain Adaptive Zero-Shot Learning
by: Yu, Jiaao, et al.
Published: (2025)

Learning Primitive Relations for Compositional Zero-Shot Learning
by: Lee, Insu, et al.
Published: (2025)

ClipSAM: CLIP and SAM Collaboration for Zero-Shot Anomaly Segmentation
by: Li, Shengze, et al.
Published: (2024)

Prompt-Based Continual Compositional Zero-Shot Learning
by: Maryam, Sauda, et al.
Published: (2025)

ComCLIP: Training-Free Compositional Image and Text Matching
by: Jiang, Kenan, et al.
Published: (2022)

Zero Shot Context-Based Object Segmentation using SLIP (SAM+CLIP)
by: Gundavarapu, Saaketh Koundinya, et al.
Published: (2024)

LAGO: Language-Guided Adaptive Object-Region Focus for Zero-Shot Visual-Text Alignment
by: Hu, Junyi, et al.
Published: (2026)

Few-Shot Remote Sensing Image Scene Classification with CLIP and Prompt Learning
by: Dimitrovski, Ivica, et al.
Published: (2025)

Topology-Aware CLIP Few-Shot Learning
by: Huang, Dazhi
Published: (2025)

Deep Semantic-Visual Alignment for Zero-Shot Remote Sensing Image Scene Classification
by: Xu, Wenjia, et al.
Published: (2024)

FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement
by: Hu, Ming, et al.
Published: (2026)

CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
by: Zhang, Mingkun, et al.
Published: (2025)

Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning
by: Yan, Xudong, et al.
Published: (2024)

CT-IDP: Segmentation-Derived Quantitative Phenotypes for Interpretable Abdominal CT Disease Classification
by: Dahal, Lavsen, et al.
Published: (2026)

Hybrid Discriminative Attribute-Object Embedding Network for Compositional Zero-Shot Learning
by: Liu, Yang, et al.
Published: (2024)

Self-Attention with State-Object Weighted Combination for Compositional Zero Shot Learning
by: Chang, Cheng-Hong, et al.
Published: (2025)

Zoom-shot: Fast and Efficient Unsupervised Zero-Shot Transfer of CLIP to Vision Encoders with Multimodal Loss
by: Shipard, Jordan, et al.
Published: (2024)

TIMA: Text-Image Mutual Awareness for Balancing Zero-Shot Adversarial Robustness and Generalization Ability
by: Ma, Fengji, et al.
Published: (2024)

Enhancing Multimodal Understanding with CLIP-Based Image-to-Text Transformation
by: Che, Chang, et al.
Published: (2024)

Interpreting CLIP's Image Representation via Text-Based Decomposition
by: Gandelsman, Yossi, et al.
Published: (2023)

CompDiff: Hierarchical Compositional Diffusion for Fair and Zero-Shot Intersectional Medical Image Generation
by: Ibrahim, Mahmoud, et al.
Published: (2026)

DGTRSD & DGTRS-CLIP: A Dual-Granularity Remote Sensing Image-Text Dataset and Vision Language Foundation Model for Alignment
by: Chen, Weizhi, et al.
Published: (2025)

ACD-CLIP: Decoupling Representation and Dynamic Fusion for Zero-Shot Anomaly Detection
by: Ma, Ke, et al.
Published: (2025)

AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
by: Metzen, Jan Hendrik, et al.
Published: (2023)

Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS's LLM-CLIP Framework for Image Captioning
by: Benhammou, Yassir, et al.
Published: (2025)

Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
by: Sinhamahapatra, Poulami, et al.
Published: (2024)

Decoupling Endpoint and Semantic Transition Learning for Zero-Shot Composed Image Retrieval
by: Liu, Mingyu, et al.
Published: (2026)

Pseudo-label Based Domain Adaptation for Zero-Shot Text Steganalysis
by: Luo, Yufei, et al.
Published: (2024)

AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP
by: Ma, Wenxin, et al.
Published: (2025)

SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval
by: Jawade, Bhavin, et al.
Published: (2025)

SalientFusion: Context-Aware Compositional Zero-Shot Food Recognition
by: Song, Jiajun, et al.
Published: (2025)

CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding
by: Xu, Wenhao, et al.
Published: (2024)

Efficient Zero-Shot AI-Generated Image Detection
by: Sonoda, Ryosuke, et al.
Published: (2026)

CLIP Unreasonable Potential in Single-Shot Face Recognition
by: Luu, Nhan T.
Published: (2024)