Saved in:
| Main Authors: | Wang, Yiping, Chen, Yifang, Yan, Wendan, Fang, Alex, Zhou, Wenjing, Jamieson, Kevin, Du, Simon Shaolei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.19547 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning
by: Wang, Yiping, et al.
Published: (2024)
by: Wang, Yiping, et al.
Published: (2024)
LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning
by: Zhang, Jifan, et al.
Published: (2023)
by: Zhang, Jifan, et al.
Published: (2023)
Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
by: Wang, Yiping, et al.
Published: (2024)
by: Wang, Yiping, et al.
Published: (2024)
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
by: Zhang, Shaolei, et al.
Published: (2025)
by: Zhang, Shaolei, et al.
Published: (2025)
RADARSAT Constellation Mission Compact Polarisation SAR Data for Burned Area Mapping with Deep Learning
by: Zhao, Yu, et al.
Published: (2024)
by: Zhao, Yu, et al.
Published: (2024)
Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval
by: Li, Siting, et al.
Published: (2025)
by: Li, Siting, et al.
Published: (2025)
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
by: Zhang, Shaolei, et al.
Published: (2025)
by: Zhang, Shaolei, et al.
Published: (2025)
Exploring How Generative MLLMs Perceive More Than CLIP with the Same Vision Encoder
by: Li, Siting, et al.
Published: (2024)
by: Li, Siting, et al.
Published: (2024)
DDFP: Data-dependent Frequency Prompt for Source Free Domain Adaptation of Medical Image Segmentation
by: Yin, Siqi, et al.
Published: (2025)
by: Yin, Siqi, et al.
Published: (2025)
Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning
by: Zhao, Zihua, et al.
Published: (2025)
by: Zhao, Zihua, et al.
Published: (2025)
CSE: Surface Anomaly Detection with Contrastively Selected Embedding
by: Thomine, Simon, et al.
Published: (2024)
by: Thomine, Simon, et al.
Published: (2024)
PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection
by: Bi, Jinhe, et al.
Published: (2025)
by: Bi, Jinhe, et al.
Published: (2025)
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
by: Jiang, Chaoya, et al.
Published: (2023)
by: Jiang, Chaoya, et al.
Published: (2023)
Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment
by: Hu, Runze, et al.
Published: (2024)
by: Hu, Runze, et al.
Published: (2024)
DFU: scale-robust diffusion model for zero-shot super-resolution image generation
by: Havrilla, Alex, et al.
Published: (2023)
by: Havrilla, Alex, et al.
Published: (2023)
ID-Selection: Importance-Diversity Based Visual Token Selection for Efficient LVLM Inference
by: Huang, Zhaohong, et al.
Published: (2026)
by: Huang, Zhaohong, et al.
Published: (2026)
Low-Rank Adaptation of Geospatial Foundation Models for Wildfire Mapping Using Sentinel-2 Data
by: Shibli, Ali, et al.
Published: (2026)
by: Shibli, Ali, et al.
Published: (2026)
Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
by: Xu, Yifang, et al.
Published: (2025)
by: Xu, Yifang, et al.
Published: (2025)
Continuous Urban Change Detection from Satellite Image Time Series with Temporal Feature Refinement and Multi-Task Integration
by: Hafner, Sebastian, et al.
Published: (2024)
by: Hafner, Sebastian, et al.
Published: (2024)
Mitigating Hallucination in Multimodal LLMs with Layer Contrastive Decoding
by: Tong, Bingkui, et al.
Published: (2025)
by: Tong, Bingkui, et al.
Published: (2025)
A Multi-view Mask Contrastive Learning Graph Convolutional Neural Network for Age Estimation
by: Zhang, Yiping, et al.
Published: (2024)
by: Zhang, Yiping, et al.
Published: (2024)
GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features
by: Sun, Yunzhuo, et al.
Published: (2024)
by: Sun, Yunzhuo, et al.
Published: (2024)
Vision+X: A Survey on Multimodal Learning in the Light of Data
by: Zhu, Ye, et al.
Published: (2022)
by: Zhu, Ye, et al.
Published: (2022)
Contrastive Learning for Multimodal Human Activity Recognition with Limited Labeled Data
by: Jing, Long, et al.
Published: (2026)
by: Jing, Long, et al.
Published: (2026)
Enhancing Visual Question Answering through Ranking-Based Hybrid Training and Multimodal Fusion
by: Chen, Peiyuan, et al.
Published: (2024)
by: Chen, Peiyuan, et al.
Published: (2024)
Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding
by: Li, Yueyang, et al.
Published: (2024)
by: Li, Yueyang, et al.
Published: (2024)
SCL: Towards Domain Generalization via Single-Temporal Multimodal Contrastive Learning for Remote Sensing Change Detection
by: Du, Qiangang, et al.
Published: (2024)
by: Du, Qiangang, et al.
Published: (2024)
Robots Autonomously Detecting People: A Multimodal Deep Contrastive Learning Method Robust to Intraclass Variations
by: Fung, Angus, et al.
Published: (2022)
by: Fung, Angus, et al.
Published: (2022)
Heterogeneous Network Based Contrastive Learning Method for PolSAR Land Cover Classification
by: Cai, Jianfeng, et al.
Published: (2024)
by: Cai, Jianfeng, et al.
Published: (2024)
FaceSnap: Enhanced ID-fidelity Network for Tuning-free Portrait Customization
by: Zhai, Benxiang, et al.
Published: (2026)
by: Zhai, Benxiang, et al.
Published: (2026)
Pyramid Feature Attention Network for Monocular Depth Prediction
by: Xu, Yifang, et al.
Published: (2024)
by: Xu, Yifang, et al.
Published: (2024)
BadCLIP++: Stealthy and Persistent Backdoors in Multimodal Contrastive Learning
by: Liang, Siyuan, et al.
Published: (2026)
by: Liang, Siyuan, et al.
Published: (2026)
RegionMed-CLIP: A Region-Aware Multimodal Contrastive Learning Pre-trained Model for Medical Image Understanding
by: Fang, Tianchen, et al.
Published: (2025)
by: Fang, Tianchen, et al.
Published: (2025)
Diverse Subset Selection via Norm-Based Sampling and Orthogonality
by: Bar, Noga, et al.
Published: (2024)
by: Bar, Noga, et al.
Published: (2024)
Causal-Inspired Multitask Learning for Video-Based Human Pose Estimation
by: Chen, Haipeng, et al.
Published: (2025)
by: Chen, Haipeng, et al.
Published: (2025)
JSCDS: A Core Data Selection Method with Jason-Shannon Divergence for Caries RGB Images-Efficient Learning
by: Zhang, Peiliang, et al.
Published: (2024)
by: Zhang, Peiliang, et al.
Published: (2024)
MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging
by: Zhou, Jiaying, et al.
Published: (2024)
by: Zhou, Jiaying, et al.
Published: (2024)
Generalized Contrastive Learning for Universal Multimodal Retrieval
by: Lee, Jungsoo, et al.
Published: (2025)
by: Lee, Jungsoo, et al.
Published: (2025)
Inference-Time Dynamic Modality Selection for Incomplete Multimodal Classification
by: Du, Siyi, et al.
Published: (2026)
by: Du, Siyi, et al.
Published: (2026)
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
by: Ye, Junyan, et al.
Published: (2024)
by: Ye, Junyan, et al.
Published: (2024)
Similar Items
-
Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning
by: Wang, Yiping, et al.
Published: (2024) -
LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning
by: Zhang, Jifan, et al.
Published: (2023) -
Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
by: Wang, Yiping, et al.
Published: (2024) -
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
by: Zhang, Shaolei, et al.
Published: (2025) -
RADARSAT Constellation Mission Compact Polarisation SAR Data for Burned Area Mapping with Deep Learning
by: Zhao, Yu, et al.
Published: (2024)