Saved in:
| Main Authors: | Shivika, Bose, Kartik, Gupta, Pankaj |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.13561 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Investigating the Semantic Robustness of CLIP-based Zero-Shot Anomaly Segmentation
by: Stangl, Kevin, et al.
Published: (2024)
by: Stangl, Kevin, et al.
Published: (2024)
Transductive Zero-Shot and Few-Shot CLIP
by: Martin, Ségolène, et al.
Published: (2024)
by: Martin, Ségolène, et al.
Published: (2024)
Enhancing Compositional Reasoning in CLIP via Reconstruction and Alignment of Text Descriptions
by: Kwon, Jihoon, et al.
Published: (2025)
by: Kwon, Jihoon, et al.
Published: (2025)
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
by: Qiu, Longtian, et al.
Published: (2024)
by: Qiu, Longtian, et al.
Published: (2024)
ViP$^2$-CLIP: Visual-Perception Prompting with Unified Alignment for Zero-Shot Anomaly Detection
by: Yang, Ziteng, et al.
Published: (2025)
by: Yang, Ziteng, et al.
Published: (2025)
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion
by: Allgeuer, Philipp, et al.
Published: (2024)
by: Allgeuer, Philipp, et al.
Published: (2024)
Semantic Relation-Enhanced CLIP Adapter for Domain Adaptive Zero-Shot Learning
by: Yu, Jiaao, et al.
Published: (2025)
by: Yu, Jiaao, et al.
Published: (2025)
Learning Primitive Relations for Compositional Zero-Shot Learning
by: Lee, Insu, et al.
Published: (2025)
by: Lee, Insu, et al.
Published: (2025)
ClipSAM: CLIP and SAM Collaboration for Zero-Shot Anomaly Segmentation
by: Li, Shengze, et al.
Published: (2024)
by: Li, Shengze, et al.
Published: (2024)
Prompt-Based Continual Compositional Zero-Shot Learning
by: Maryam, Sauda, et al.
Published: (2025)
by: Maryam, Sauda, et al.
Published: (2025)
ComCLIP: Training-Free Compositional Image and Text Matching
by: Jiang, Kenan, et al.
Published: (2022)
by: Jiang, Kenan, et al.
Published: (2022)
Zero Shot Context-Based Object Segmentation using SLIP (SAM+CLIP)
by: Gundavarapu, Saaketh Koundinya, et al.
Published: (2024)
by: Gundavarapu, Saaketh Koundinya, et al.
Published: (2024)
LAGO: Language-Guided Adaptive Object-Region Focus for Zero-Shot Visual-Text Alignment
by: Hu, Junyi, et al.
Published: (2026)
by: Hu, Junyi, et al.
Published: (2026)
Few-Shot Remote Sensing Image Scene Classification with CLIP and Prompt Learning
by: Dimitrovski, Ivica, et al.
Published: (2025)
by: Dimitrovski, Ivica, et al.
Published: (2025)
Topology-Aware CLIP Few-Shot Learning
by: Huang, Dazhi
Published: (2025)
by: Huang, Dazhi
Published: (2025)
Deep Semantic-Visual Alignment for Zero-Shot Remote Sensing Image Scene Classification
by: Xu, Wenjia, et al.
Published: (2024)
by: Xu, Wenjia, et al.
Published: (2024)
FB-CLIP: Fine-Grained Zero-Shot Anomaly Detection with Foreground-Background Disentanglement
by: Hu, Ming, et al.
Published: (2026)
by: Hu, Ming, et al.
Published: (2026)
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
by: Zhang, Mingkun, et al.
Published: (2025)
by: Zhang, Mingkun, et al.
Published: (2025)
Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning
by: Yan, Xudong, et al.
Published: (2024)
by: Yan, Xudong, et al.
Published: (2024)
CT-IDP: Segmentation-Derived Quantitative Phenotypes for Interpretable Abdominal CT Disease Classification
by: Dahal, Lavsen, et al.
Published: (2026)
by: Dahal, Lavsen, et al.
Published: (2026)
Hybrid Discriminative Attribute-Object Embedding Network for Compositional Zero-Shot Learning
by: Liu, Yang, et al.
Published: (2024)
by: Liu, Yang, et al.
Published: (2024)
Self-Attention with State-Object Weighted Combination for Compositional Zero Shot Learning
by: Chang, Cheng-Hong, et al.
Published: (2025)
by: Chang, Cheng-Hong, et al.
Published: (2025)
Zoom-shot: Fast and Efficient Unsupervised Zero-Shot Transfer of CLIP to Vision Encoders with Multimodal Loss
by: Shipard, Jordan, et al.
Published: (2024)
by: Shipard, Jordan, et al.
Published: (2024)
TIMA: Text-Image Mutual Awareness for Balancing Zero-Shot Adversarial Robustness and Generalization Ability
by: Ma, Fengji, et al.
Published: (2024)
by: Ma, Fengji, et al.
Published: (2024)
Enhancing Multimodal Understanding with CLIP-Based Image-to-Text Transformation
by: Che, Chang, et al.
Published: (2024)
by: Che, Chang, et al.
Published: (2024)
Interpreting CLIP's Image Representation via Text-Based Decomposition
by: Gandelsman, Yossi, et al.
Published: (2023)
by: Gandelsman, Yossi, et al.
Published: (2023)
CompDiff: Hierarchical Compositional Diffusion for Fair and Zero-Shot Intersectional Medical Image Generation
by: Ibrahim, Mahmoud, et al.
Published: (2026)
by: Ibrahim, Mahmoud, et al.
Published: (2026)
DGTRSD & DGTRS-CLIP: A Dual-Granularity Remote Sensing Image-Text Dataset and Vision Language Foundation Model for Alignment
by: Chen, Weizhi, et al.
Published: (2025)
by: Chen, Weizhi, et al.
Published: (2025)
ACD-CLIP: Decoupling Representation and Dynamic Fusion for Zero-Shot Anomaly Detection
by: Ma, Ke, et al.
Published: (2025)
by: Ma, Ke, et al.
Published: (2025)
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
by: Metzen, Jan Hendrik, et al.
Published: (2023)
by: Metzen, Jan Hendrik, et al.
Published: (2023)
Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS's LLM-CLIP Framework for Image Captioning
by: Benhammou, Yassir, et al.
Published: (2025)
by: Benhammou, Yassir, et al.
Published: (2025)
Finding Dino: A Plug-and-Play Framework for Zero-Shot Detection of Out-of-Distribution Objects Using Prototypes
by: Sinhamahapatra, Poulami, et al.
Published: (2024)
by: Sinhamahapatra, Poulami, et al.
Published: (2024)
Decoupling Endpoint and Semantic Transition Learning for Zero-Shot Composed Image Retrieval
by: Liu, Mingyu, et al.
Published: (2026)
by: Liu, Mingyu, et al.
Published: (2026)
Pseudo-label Based Domain Adaptation for Zero-Shot Text Steganalysis
by: Luo, Yufei, et al.
Published: (2024)
by: Luo, Yufei, et al.
Published: (2024)
AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP
by: Ma, Wenxin, et al.
Published: (2025)
by: Ma, Wenxin, et al.
Published: (2025)
SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval
by: Jawade, Bhavin, et al.
Published: (2025)
by: Jawade, Bhavin, et al.
Published: (2025)
SalientFusion: Context-Aware Compositional Zero-Shot Food Recognition
by: Song, Jiajun, et al.
Published: (2025)
by: Song, Jiajun, et al.
Published: (2025)
CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding
by: Xu, Wenhao, et al.
Published: (2024)
by: Xu, Wenhao, et al.
Published: (2024)
Efficient Zero-Shot AI-Generated Image Detection
by: Sonoda, Ryosuke, et al.
Published: (2026)
by: Sonoda, Ryosuke, et al.
Published: (2026)
CLIP Unreasonable Potential in Single-Shot Face Recognition
by: Luu, Nhan T.
Published: (2024)
by: Luu, Nhan T.
Published: (2024)
Similar Items
-
Investigating the Semantic Robustness of CLIP-based Zero-Shot Anomaly Segmentation
by: Stangl, Kevin, et al.
Published: (2024) -
Transductive Zero-Shot and Few-Shot CLIP
by: Martin, Ségolène, et al.
Published: (2024) -
Enhancing Compositional Reasoning in CLIP via Reconstruction and Alignment of Text Descriptions
by: Kwon, Jihoon, et al.
Published: (2025) -
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
by: Qiu, Longtian, et al.
Published: (2024) -
ViP$^2$-CLIP: Visual-Perception Prompting with Unified Alignment for Zero-Shot Anomaly Detection
by: Yang, Ziteng, et al.
Published: (2025)