Saved in:
| Main Authors: | Han, Seungdae, Kim, Joohee |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.14944 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ComCLIP: Training-Free Compositional Image and Text Matching
by: Jiang, Kenan, et al.
Published: (2022)
by: Jiang, Kenan, et al.
Published: (2022)
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP
by: Li, Yayuan, et al.
Published: (2024)
by: Li, Yayuan, et al.
Published: (2024)
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
by: Kim, Seoyeon, et al.
Published: (2023)
by: Kim, Seoyeon, et al.
Published: (2023)
Long-CLIP: Unlocking the Long-Text Capability of CLIP
by: Zhang, Beichen, et al.
Published: (2024)
by: Zhang, Beichen, et al.
Published: (2024)
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
by: Bai, Sule, et al.
Published: (2024)
by: Bai, Sule, et al.
Published: (2024)
ABE-CLIP: Training-Free Attribute Binding Enhancement for Compositional Image-Text Matching
by: Zhang, Qi, et al.
Published: (2025)
by: Zhang, Qi, et al.
Published: (2025)
VTD-CLIP: Video-to-Text Discretization via Prompting CLIP
by: Zhu, Wencheng, et al.
Published: (2025)
by: Zhu, Wencheng, et al.
Published: (2025)
CLIP-IT: CLIP-based Pairing for Histology Images Classification
by: Karimian, Banafsheh, et al.
Published: (2025)
by: Karimian, Banafsheh, et al.
Published: (2025)
Raising the Bar of AI-generated Image Detection with CLIP
by: Cozzolino, Davide, et al.
Published: (2023)
by: Cozzolino, Davide, et al.
Published: (2023)
Text-to-Image Generation Via Energy-Based CLIP
by: Ganz, Roy, et al.
Published: (2024)
by: Ganz, Roy, et al.
Published: (2024)
DetailCLIP: Injecting Image Details into CLIP's Feature Space
by: Zhang, Zilun, et al.
Published: (2022)
by: Zhang, Zilun, et al.
Published: (2022)
Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection
by: De Rosa, Vincenzo, et al.
Published: (2024)
by: De Rosa, Vincenzo, et al.
Published: (2024)
TiC-CLIP: Continual Training of CLIP Models
by: Garg, Saurabh, et al.
Published: (2023)
by: Garg, Saurabh, et al.
Published: (2023)
CLIP-KD: An Empirical Study of CLIP Model Distillation
by: Yang, Chuanguang, et al.
Published: (2023)
by: Yang, Chuanguang, et al.
Published: (2023)
IPAD-CLIP: Teaching CLIP to Detect Image Local Perceptual Artifacts
by: Wang, Juan, et al.
Published: (2026)
by: Wang, Juan, et al.
Published: (2026)
HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets and CLIP Models
by: Wei, Zhixiang, et al.
Published: (2025)
by: Wei, Zhixiang, et al.
Published: (2025)
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias
by: Jo, Sanghyun, et al.
Published: (2024)
by: Jo, Sanghyun, et al.
Published: (2024)
CLIP in Medical Imaging: A Survey
by: Zhao, Zihao, et al.
Published: (2023)
by: Zhao, Zihao, et al.
Published: (2023)
A Training-Free Framework for Open-Vocabulary Image Segmentation and Recognition with EfficientNet and CLIP
by: Dai, Ying, et al.
Published: (2025)
by: Dai, Ying, et al.
Published: (2025)
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation
by: Shao, Tong, et al.
Published: (2024)
by: Shao, Tong, et al.
Published: (2024)
CLIP-Guided Source-Free Object Detection in Aerial Images
by: Liu, Nanqing, et al.
Published: (2024)
by: Liu, Nanqing, et al.
Published: (2024)
PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding Space
by: Miya, Ryutaro, et al.
Published: (2026)
by: Miya, Ryutaro, et al.
Published: (2026)
MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly Detection
by: Zhang, Ximiao, et al.
Published: (2024)
by: Zhang, Ximiao, et al.
Published: (2024)
CLIP-AGIQA: Boosting the Performance of AI-Generated Image Quality Assessment with CLIP
by: Tang, Zhenchen, et al.
Published: (2024)
by: Tang, Zhenchen, et al.
Published: (2024)
Extract Free Dense Misalignment from CLIP
by: Nam, JeongYeon, et al.
Published: (2024)
by: Nam, JeongYeon, et al.
Published: (2024)
GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection
by: Kim, Donghyeong, et al.
Published: (2025)
by: Kim, Donghyeong, et al.
Published: (2025)
SuperCLIP: CLIP with Simple Classification Supervision
by: Zhao, Weiheng, et al.
Published: (2025)
by: Zhao, Weiheng, et al.
Published: (2025)
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
by: Zhang, Qian, et al.
Published: (2024)
by: Zhang, Qian, et al.
Published: (2024)
SPACE-CLIP: Spatial Perception via Adaptive CLIP Embeddings for Monocular Depth Estimation
by: Cho, Taewan, et al.
Published: (2026)
by: Cho, Taewan, et al.
Published: (2026)
DouC: Dual-Branch CLIP for Training-Free Open-Vocabulary Segmentation
by: Zamini, Mohamad, et al.
Published: (2026)
by: Zamini, Mohamad, et al.
Published: (2026)
Improving Visual Discriminability of CLIP for Training-Free Open-Vocabulary Semantic Segmentation
by: Zhou, Jinxin, et al.
Published: (2025)
by: Zhou, Jinxin, et al.
Published: (2025)
MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training
by: Vasu, Pavan Kumar Anasosalu, et al.
Published: (2023)
by: Vasu, Pavan Kumar Anasosalu, et al.
Published: (2023)
Erasing CLIP Memories: Non-Destructive, Data-Free Zero-Shot class Unlearning in CLIP Models
by: Mishra, Ashish, et al.
Published: (2025)
by: Mishra, Ashish, et al.
Published: (2025)
TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP
by: Cai, Yuliang, et al.
Published: (2025)
by: Cai, Yuliang, et al.
Published: (2025)
NeuCLIP: Efficient Large-Scale CLIP Training with Neural Normalizer Optimization
by: Wei, Xiyuan, et al.
Published: (2025)
by: Wei, Xiyuan, et al.
Published: (2025)
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts
by: Cao, Anh-Quan, et al.
Published: (2024)
by: Cao, Anh-Quan, et al.
Published: (2024)
CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation
by: Ali, Muhammad, et al.
Published: (2024)
by: Ali, Muhammad, et al.
Published: (2024)
NeuroCLIP: Neuromorphic Data Understanding by CLIP and SNN
by: Guo, Yufei, et al.
Published: (2023)
by: Guo, Yufei, et al.
Published: (2023)
Re-Scoring Using Image-Language Similarity for Few-Shot Object Detection
by: Jung, Min Jae, et al.
Published: (2023)
by: Jung, Min Jae, et al.
Published: (2023)
Enhancing Multimodal Understanding with CLIP-Based Image-to-Text Transformation
by: Che, Chang, et al.
Published: (2024)
by: Che, Chang, et al.
Published: (2024)
Similar Items
-
ComCLIP: Training-Free Compositional Image and Text Matching
by: Jiang, Kenan, et al.
Published: (2022) -
Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP
by: Li, Yayuan, et al.
Published: (2024) -
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
by: Kim, Seoyeon, et al.
Published: (2023) -
Long-CLIP: Unlocking the Long-Text Capability of CLIP
by: Zhang, Beichen, et al.
Published: (2024) -
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
by: Bai, Sule, et al.
Published: (2024)