:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Yiping, Chen, Yifang, Yan, Wendan, Fang, Alex, Zhou, Wenjing, Jamieson, Kevin, Du, Simon Shaolei
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2405.19547
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning
by: Wang, Yiping, et al.
Published: (2024)

LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning
by: Zhang, Jifan, et al.
Published: (2023)

Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
by: Wang, Yiping, et al.
Published: (2024)

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
by: Zhang, Shaolei, et al.
Published: (2025)

RADARSAT Constellation Mission Compact Polarisation SAR Data for Burned Area Mapping with Deep Learning
by: Zhao, Yu, et al.
Published: (2024)

Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval
by: Li, Siting, et al.
Published: (2025)

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
by: Zhang, Shaolei, et al.
Published: (2025)

Exploring How Generative MLLMs Perceive More Than CLIP with the Same Vision Encoder
by: Li, Siting, et al.
Published: (2024)

DDFP: Data-dependent Frequency Prompt for Source Free Domain Adaptation of Medical Image Segmentation
by: Yin, Siqi, et al.
Published: (2025)

Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning
by: Zhao, Zihua, et al.
Published: (2025)

CSE: Surface Anomaly Detection with Contrastively Selected Embedding
by: Thomine, Simon, et al.
Published: (2024)

PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection
by: Bi, Jinhe, et al.
Published: (2025)

Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
by: Jiang, Chaoya, et al.
Published: (2023)

Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment
by: Hu, Runze, et al.
Published: (2024)

DFU: scale-robust diffusion model for zero-shot super-resolution image generation
by: Havrilla, Alex, et al.
Published: (2023)

ID-Selection: Importance-Diversity Based Visual Token Selection for Efficient LVLM Inference
by: Huang, Zhaohong, et al.
Published: (2026)

Low-Rank Adaptation of Geospatial Foundation Models for Wildfire Mapping Using Sentinel-2 Data
by: Shibli, Ali, et al.
Published: (2026)

Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
by: Xu, Yifang, et al.
Published: (2025)

Continuous Urban Change Detection from Satellite Image Time Series with Temporal Feature Refinement and Multi-Task Integration
by: Hafner, Sebastian, et al.
Published: (2024)

Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding
by: Tong, Bingkui, et al.
Published: (2025)

A Multi-view Mask Contrastive Learning Graph Convolutional Neural Network for Age Estimation
by: Zhang, Yiping, et al.
Published: (2024)

GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features
by: Sun, Yunzhuo, et al.
Published: (2024)

Vision+X: A Survey on Multimodal Learning in the Light of Data
by: Zhu, Ye, et al.
Published: (2022)

Contrastive Learning for Multimodal Human Activity Recognition with Limited Labeled Data
by: Jing, Long, et al.
Published: (2026)

Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion
by: Chen, Peiyuan, et al.
Published: (2024)

Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding
by: Li, Yueyang, et al.
Published: (2024)

SCL: Towards Domain Generalization via Single-Temporal Multimodal Contrastive Learning for Remote Sensing Change Detection
by: Du, Qiangang, et al.
Published: (2024)

Robots Autonomously Detecting People: A Multimodal Deep Contrastive Learning Method Robust to Intraclass Variations
by: Fung, Angus, et al.
Published: (2022)

Heterogeneous Network Based Contrastive Learning Method for PolSAR Land Cover Classification
by: Cai, Jianfeng, et al.
Published: (2024)

FaceSnap: Enhanced ID-fidelity Network for Tuning-free Portrait Customization
by: Zhai, Benxiang, et al.
Published: (2026)

Pyramid Feature Attention Network for Monocular Depth Prediction
by: Xu, Yifang, et al.
Published: (2024)

BadCLIP++: Stealthy and Persistent Backdoors in Multimodal Contrastive Learning
by: Liang, Siyuan, et al.
Published: (2026)

RegionMed-CLIP: A Region-Aware Multimodal Contrastive Learning Pre-trained Model for Medical Image Understanding
by: Fang, Tianchen, et al.
Published: (2025)

Diverse Subset Selection via Norm-Based Sampling and Orthogonality
by: Bar, Noga, et al.
Published: (2024)

Causal-Inspired Multitask Learning for Video-Based Human Pose Estimation
by: Chen, Haipeng, et al.
Published: (2025)

JSCDS: A Core Data Selection Method with Jason-Shannon Divergence for Caries RGB Images-Efficient Learning
by: Zhang, Peiliang, et al.
Published: (2024)

MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging
by: Zhou, Jiaying, et al.
Published: (2024)

Generalized Contrastive Learning for Universal Multimodal Retrieval
by: Lee, Jungsoo, et al.
Published: (2025)

Inference-Time Dynamic Modality Selection for Incomplete Multimodal Classification
by: Du, Siyi, et al.
Published: (2026)

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
by: Ye, Junyan, et al.
Published: (2024)