Saved in:
| Main Authors: | Lahajal, Naresh Kumar, S, Harini |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.13613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Omni-NegCLIP: Enhancing CLIP with Front-Layer Contrastive Fine-Tuning for Comprehensive Negation Understanding
by: Xu, Jingqi
Published: (2026)
by: Xu, Jingqi
Published: (2026)
Unsupervised Memorability Modeling from Tip-of-the-Tongue Retrieval Queries
by: Bhattacharyya, Sree, et al.
Published: (2025)
by: Bhattacharyya, Sree, et al.
Published: (2025)
Enhancing Multimodal Understanding with CLIP-Based Image-to-Text Transformation
by: Che, Chang, et al.
Published: (2024)
by: Che, Chang, et al.
Published: (2024)
Evaluating Contextual Intelligence in Recyclability: A Comprehensive Study of Image-Based Reasoning Systems
by: Park, Eliot, et al.
Published: (2025)
by: Park, Eliot, et al.
Published: (2025)
PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval
by: Pan, Jiancheng, et al.
Published: (2024)
by: Pan, Jiancheng, et al.
Published: (2024)
RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training
by: Lu, Zhixiu, et al.
Published: (2024)
by: Lu, Zhixiu, et al.
Published: (2024)
IDEA: Image Description Enhanced CLIP-Adapter
by: Ye, Zhipeng, et al.
Published: (2025)
by: Ye, Zhipeng, et al.
Published: (2025)
FoCLIP: A Feature-Space Misalignment Framework for CLIP-Based Image Manipulation and Detection
by: Chen, Yulin, et al.
Published: (2025)
by: Chen, Yulin, et al.
Published: (2025)
ViCLIP-OT: The First Foundation Vision-Language Model for Vietnamese Image-Text Retrieval with Optimal Transport
by: Tran, Quoc-Khang, et al.
Published: (2026)
by: Tran, Quoc-Khang, et al.
Published: (2026)
AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP
by: Ma, Wenxin, et al.
Published: (2025)
by: Ma, Wenxin, et al.
Published: (2025)
CLIP-based Synergistic Knowledge Transfer for Text-based Person Retrieval
by: Liu, Yating, et al.
Published: (2023)
by: Liu, Yating, et al.
Published: (2023)
CLIP-DQA: Blindly Evaluating Dehazed Images from Global and Local Perspectives Using CLIP
by: Zeng, Yirui, et al.
Published: (2025)
by: Zeng, Yirui, et al.
Published: (2025)
HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion
by: Zhang, Shiyi, et al.
Published: (2025)
by: Zhang, Shiyi, et al.
Published: (2025)
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval
by: Zeng, Gangyan, et al.
Published: (2024)
by: Zeng, Gangyan, et al.
Published: (2024)
Geometry-Aware CLIP Retrieval via Local Cross-Modal Alignment and Steering
by: Prakash, Nirmalendu, et al.
Published: (2026)
by: Prakash, Nirmalendu, et al.
Published: (2026)
A Comprehensive Dataset for Human vs. AI Generated Image Detection
by: Roy, Rajarshi, et al.
Published: (2026)
by: Roy, Rajarshi, et al.
Published: (2026)
microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
by: Silva, Sathira, et al.
Published: (2025)
by: Silva, Sathira, et al.
Published: (2025)
Frame-Difference Guided Dynamic Region Perception for CLIP Adaptation in Text-Video Retrieval
by: Yu, Jiaao, et al.
Published: (2025)
by: Yu, Jiaao, et al.
Published: (2025)
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
by: Lin, Haokun, et al.
Published: (2025)
by: Lin, Haokun, et al.
Published: (2025)
A Comprehensive Survey on Composed Image Retrieval
by: Song, Xuemeng, et al.
Published: (2025)
by: Song, Xuemeng, et al.
Published: (2025)
Knowledge-Base based Semantic Image Transmission Using CLIP
by: Li, Chongyang, et al.
Published: (2025)
by: Li, Chongyang, et al.
Published: (2025)
Interpreting CLIP's Image Representation via Text-Based Decomposition
by: Gandelsman, Yossi, et al.
Published: (2023)
by: Gandelsman, Yossi, et al.
Published: (2023)
InterCLIP-MEP: Interactive CLIP and Memory-Enhanced Predictor for Multi-modal Sarcasm Detection
by: Chen, Junjie, et al.
Published: (2024)
by: Chen, Junjie, et al.
Published: (2024)
GEA: Generation-Enhanced Alignment for Text-to-Image Person Retrieval
by: Zou, Hao, et al.
Published: (2025)
by: Zou, Hao, et al.
Published: (2025)
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
by: Gao, Bin-Bin, et al.
Published: (2025)
by: Gao, Bin-Bin, et al.
Published: (2025)
DesignCLIP: Multimodal Learning with CLIP for Design Patent Understanding
by: Wang, Zhu, et al.
Published: (2025)
by: Wang, Zhu, et al.
Published: (2025)
Enhancing Compositional Reasoning in CLIP via Reconstruction and Alignment of Text Descriptions
by: Kwon, Jihoon, et al.
Published: (2025)
by: Kwon, Jihoon, et al.
Published: (2025)
MedProbCLIP: Probabilistic Adaptation of Vision-Language Foundation Model for Reliable Radiograph-Report Retrieval
by: Elallaf, Ahmad, et al.
Published: (2026)
by: Elallaf, Ahmad, et al.
Published: (2026)
RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement
by: Gaintseva, Tatiana, et al.
Published: (2024)
by: Gaintseva, Tatiana, et al.
Published: (2024)
CLIP Model for Images to Textual Prompts Based on Top-k Neighbors
by: Zhang, Xin, et al.
Published: (2024)
by: Zhang, Xin, et al.
Published: (2024)
PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval
by: Xu, Tianyi, et al.
Published: (2026)
by: Xu, Tianyi, et al.
Published: (2026)
Semantic Relation-Enhanced CLIP Adapter for Domain Adaptive Zero-Shot Learning
by: Yu, Jiaao, et al.
Published: (2025)
by: Yu, Jiaao, et al.
Published: (2025)
Few-Shot Remote Sensing Image Scene Classification with CLIP and Prompt Learning
by: Dimitrovski, Ivica, et al.
Published: (2025)
by: Dimitrovski, Ivica, et al.
Published: (2025)
Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification
by: Sun, Xiangyu, et al.
Published: (2025)
by: Sun, Xiangyu, et al.
Published: (2025)
Beyond Cosine Similarity: Magnitude-Aware CLIP for No-Reference Image Quality Assessment
by: Liao, Zhicheng, et al.
Published: (2025)
by: Liao, Zhicheng, et al.
Published: (2025)
TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP
by: Cai, Yuliang, et al.
Published: (2025)
by: Cai, Yuliang, et al.
Published: (2025)
Aquila: A Hierarchically Aligned Visual-Language Model for Enhanced Remote Sensing Image Comprehension
by: Lu, Kaixuan, et al.
Published: (2024)
by: Lu, Kaixuan, et al.
Published: (2024)
GoldiCLIP: The Goldilocks Approach for Balancing Explicit Supervision for Language-Image Pretraining
by: Mohan, Deen Dayal, et al.
Published: (2026)
by: Mohan, Deen Dayal, et al.
Published: (2026)
Transformers Meet Hyperspectral Imaging: A Comprehensive Study of Models, Challenges and Open Problems
by: Zhang, Guyang, et al.
Published: (2025)
by: Zhang, Guyang, et al.
Published: (2025)
UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation
by: Zhong, Siru, et al.
Published: (2024)
by: Zhong, Siru, et al.
Published: (2024)
Similar Items
-
Omni-NegCLIP: Enhancing CLIP with Front-Layer Contrastive Fine-Tuning for Comprehensive Negation Understanding
by: Xu, Jingqi
Published: (2026) -
Unsupervised Memorability Modeling from Tip-of-the-Tongue Retrieval Queries
by: Bhattacharyya, Sree, et al.
Published: (2025) -
Enhancing Multimodal Understanding with CLIP-Based Image-to-Text Transformation
by: Che, Chang, et al.
Published: (2024) -
Evaluating Contextual Intelligence in Recyclability: A Comprehensive Study of Image-Based Reasoning Systems
by: Park, Eliot, et al.
Published: (2025) -
PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval
by: Pan, Jiancheng, et al.
Published: (2024)