:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhou, Qiongyi, Du, Changde, Wang, Shengpei, He, Huiguang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2402.08994
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Versatile Framework with Semantic and Structural guidance for Image Reconstruction from Brain Activity
by: Lu, Yizhuo, et al.
Published: (2026)

NeuralOOD: Improving Out-of-Distribution Generalization Performance with Brain-machine Fusion Learning Framework
by: Zhao, Shuangchen, et al.
Published: (2024)

AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
by: Gao, Bin-Bin, et al.
Published: (2025)

Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity
by: Lu, Yizhuo, et al.
Published: (2024)

CLIP-Guided Unsupervised Semantic-Aware Exposure Correction
by: Wu, Puzhen, et al.
Published: (2026)

VeCLIP: Improving CLIP Training via Visual-enriched Captions
by: Lai, Zhengfeng, et al.
Published: (2023)

Knowledge-Base based Semantic Image Transmission Using CLIP
by: Li, Chongyang, et al.
Published: (2025)

CLIPin: A Non-contrastive Plug-in to CLIP for Multimodal Semantic Alignment
by: Yang, Shengzhu, et al.
Published: (2025)

DesignCLIP: Multimodal Learning with CLIP for Design Patent Understanding
by: Wang, Zhu, et al.
Published: (2025)

Human-like object concept representations emerge naturally in multimodal large language models
by: Du, Changde, et al.
Published: (2024)

Position-aware Guided Point Cloud Completion with CLIP Model
by: Zhou, Feng, et al.
Published: (2024)

Color in Visual-Language Models: CLIP deficiencies
by: Arias, Guillem, et al.
Published: (2025)

FG-CLIP: Fine-Grained Visual and Textual Alignment
by: Xie, Chunyu, et al.
Published: (2025)

CLIP-DQA: Blindly Evaluating Dehazed Images from Global and Local Perspectives Using CLIP
by: Zeng, Yirui, et al.
Published: (2025)

TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP
by: Li, Fan, et al.
Published: (2025)

Implicit Inversion turns CLIP into a Decoder
by: D'Orazio, Antonio, et al.
Published: (2025)

AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP
by: Ma, Wenxin, et al.
Published: (2025)

Concept Visualization: Explaining the CLIP Multi-modal Embedding Using WordNet
by: Giulivi, Loris, et al.
Published: (2024)

Parrot Captions Teach CLIP to Spot Text
by: Lin, Yiqi, et al.
Published: (2023)

InterCLIP-MEP: Interactive CLIP and Memory-Enhanced Predictor for Multi-modal Sarcasm Detection
by: Chen, Junjie, et al.
Published: (2024)

FoCLIP: A Feature-Space Misalignment Framework for CLIP-Based Image Manipulation and Detection
by: Chen, Yulin, et al.
Published: (2025)

DiffCLIP: Differential Attention Meets CLIP
by: Hammoud, Hasan Abed Al Kader, et al.
Published: (2025)

TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP
by: Cai, Yuliang, et al.
Published: (2025)

HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion
by: Zhang, Shiyi, et al.
Published: (2025)

VL-CLIP: Enhancing Multimodal Recommendations via Visual Grounding and LLM-Augmented CLIP Embeddings
by: Giahi, Ramin, et al.
Published: (2025)

PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval
by: Pan, Jiancheng, et al.
Published: (2024)

Investigating the Semantic Robustness of CLIP-based Zero-Shot Anomaly Segmentation
by: Stangl, Kevin, et al.
Published: (2024)

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling
by: Zhang, Jihai, et al.
Published: (2024)

Video CLIP Model for Multi-View Echocardiography Interpretation
by: Takizawa, Ryo, et al.
Published: (2025)

CountCLIP -- [Re] Teaching CLIP to Count to Ten
by: Mestha, Harshvardhan, et al.
Published: (2024)

CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP
by: Yang, Tianyu, et al.
Published: (2024)

RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement
by: Gaintseva, Tatiana, et al.
Published: (2024)

MR-CLIP: Efficient Metadata-Guided Learning of MRI Contrast Representations
by: Avci, Mehmet Yigit, et al.
Published: (2025)

SAPL: Semantic-Agnostic Prompt Learning in CLIP for Weakly Supervised Image Manipulation Localization
by: Wang, Xinghao, et al.
Published: (2026)

CLIP is All You Need for Human-like Semantic Representations in Stable Diffusion
by: Braunstein, Cameron, et al.
Published: (2025)

The Rescue Effect: Spatio-Semantic Early Exit Bypasses Quantization Collapse in CLIP
by: Nam, Kahyeon, et al.
Published: (2026)

Semantic Relation-Enhanced CLIP Adapter for Domain Adaptive Zero-Shot Learning
by: Yu, Jiaao, et al.
Published: (2025)

LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts
by: Cao, Anh-Quan, et al.
Published: (2024)

Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records
by: He, Yili, et al.
Published: (2025)

Classifier-guided CLIP Distillation for Unsupervised Multi-label Classification
by: Kim, Dongseob, et al.
Published: (2025)