Saved in:
| Main Authors: | Jordan, Jason, Lor, Mohammadreza Akbari, Koulen, Peter, Shyu, Mei-Ling, Chen, Shu-Ching |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.21358 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Guiding Cross-Modal Representations with MLLM Priors via Preference Alignment
by: Zhao, Pengfei, et al.
Published: (2025)
by: Zhao, Pengfei, et al.
Published: (2025)
Modality-Fair Preference Optimization for Trustworthy MLLM Alignment
by: Jiang, Songtao, et al.
Published: (2024)
by: Jiang, Songtao, et al.
Published: (2024)
Fusion of Deep Learning and GIS for Advanced Remote Sensing Image Analysis
by: Afroosheh, Sajjad, et al.
Published: (2024)
by: Afroosheh, Sajjad, et al.
Published: (2024)
Primordial Black Holes and the First Stars
by: Koulen, Julia Monika, et al.
Published: (2025)
by: Koulen, Julia Monika, et al.
Published: (2025)
Constraints on Primordial Black Holes from $N$-body simulations of the Eridanus II Stellar Cluster
by: Koulen, Julia Monika, et al.
Published: (2024)
by: Koulen, Julia Monika, et al.
Published: (2024)
Fundoscopic Cameras Associated with Shorter Emergency Department Length of Stay for Patients with Vision Loss
by: Dana R. Sax, et al.
Published: (2026)
by: Dana R. Sax, et al.
Published: (2026)
SIMMER: Cross-Modal Food Image--Recipe Retrieval via MLLM-Based Embedding
by: Gomi, Keisuke, et al.
Published: (2026)
by: Gomi, Keisuke, et al.
Published: (2026)
FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data
by: Xu, Binqian, et al.
Published: (2024)
by: Xu, Binqian, et al.
Published: (2024)
Deep Spatiotemporal Clutter Filtering of Transthoracic Echocardiographic Images: Leveraging Contextual Attention and Residual Learning
by: Tabassian, Mahdi, et al.
Published: (2024)
by: Tabassian, Mahdi, et al.
Published: (2024)
MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM
by: Kalan, Saqi Hussain, et al.
Published: (2025)
by: Kalan, Saqi Hussain, et al.
Published: (2025)
FDCT: Frequency-Aware Decomposition and Cross-Modal Token-Alignment for Multi-Sensor Target Classification
by: Sami, Shoaib Meraj, et al.
Published: (2025)
by: Sami, Shoaib Meraj, et al.
Published: (2025)
Gesture Classification in Artworks Using Contextual Image Features
by: Hussian, Azhar, et al.
Published: (2024)
by: Hussian, Azhar, et al.
Published: (2024)
MDF: A Dynamic Fusion Model for Multi-modal Fake News Detection
by: Lv, Hongzhen, et al.
Published: (2024)
by: Lv, Hongzhen, et al.
Published: (2024)
MLLM-Driven Semantic Identifier Generation for Generative Cross-Modal Retrieval
by: Li, Tianyuan, et al.
Published: (2025)
by: Li, Tianyuan, et al.
Published: (2025)
Domain Transfer Through Image-to-Image Translation for Uncertainty-Aware Prostate Cancer Classification
by: Zhou, Meng, et al.
Published: (2023)
by: Zhou, Meng, et al.
Published: (2023)
CrossBench: Generalized Crosstalk Benchmark Generation for Quantum Computers
by: Hawley, Jaden, et al.
Published: (2026)
by: Hawley, Jaden, et al.
Published: (2026)
DeepIPCv3: Event-Aware Multi-Modal Sensor Fusion for Sudden Pedestrian Crossing Avoidance
by: Natan, Oskar, et al.
Published: (2026)
by: Natan, Oskar, et al.
Published: (2026)
Multiscale Feature Fusion Method for Liver Cirrhosis Classification
by: Shanshan Wang, et al.
Published: (2024)
by: Shanshan Wang, et al.
Published: (2024)
Modality-Aware Infrared and Visible Image Fusion with Target-Aware Supervision
by: Sun, Tianyao, et al.
Published: (2025)
by: Sun, Tianyao, et al.
Published: (2025)
TUNI: Real-time RGB-T Semantic Segmentation with Unified Multi-Modal Feature Extraction and Cross-Modal Feature Fusion
by: Guo, Xiaodong, et al.
Published: (2025)
by: Guo, Xiaodong, et al.
Published: (2025)
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval
by: Huang, Hailang, et al.
Published: (2024)
by: Huang, Hailang, et al.
Published: (2024)
CritiFusion: Semantic Critique and Spectral Alignment for Faithful Text-to-Image Generation
by: Chen, ZhenQi, et al.
Published: (2025)
by: Chen, ZhenQi, et al.
Published: (2025)
Cross Feature Fusion of Fundus Image and Generated Lesion Map for Referable Diabetic Retinopathy Classification
by: Mok, Dahyun, et al.
Published: (2024)
by: Mok, Dahyun, et al.
Published: (2024)
Interactive CNN and Transformer‐Based Cross‐Attention Fusion Network for Medical Image Classification
by: Shu Cai, et al.
Published: (2025)
by: Shu Cai, et al.
Published: (2025)
Cross-Modal Synergies: Unveiling the Potential of Motion-Aware Fusion Networks in Handling Dynamic and Static ReID Scenarios
by: Ling, Fuxi, et al.
Published: (2025)
by: Ling, Fuxi, et al.
Published: (2025)
AUV-Fusion: Cross-Modal Adversarial Fusion of User Interactions and Visual Perturbations Against VARS
by: Ling, Hai, et al.
Published: (2025)
by: Ling, Hai, et al.
Published: (2025)
Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques
by: Liu, Weide, et al.
Published: (2025)
by: Liu, Weide, et al.
Published: (2025)
Cross-Modal Scene Semantic Alignment for Image Complexity Assessment
by: Luo, Yuqing, et al.
Published: (2025)
by: Luo, Yuqing, et al.
Published: (2025)
SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification
by: Alkhatib, Mohammed Q., et al.
Published: (2024)
by: Alkhatib, Mohammed Q., et al.
Published: (2024)
Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance
by: Li, Jun, et al.
Published: (2024)
by: Li, Jun, et al.
Published: (2024)
Cross-Modal Prototype Alignment and Mixing for Training-Free Few-Shot Classification
by: Goswami, Dipam, et al.
Published: (2026)
by: Goswami, Dipam, et al.
Published: (2026)
Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification
by: Lan, Tian, et al.
Published: (2025)
by: Lan, Tian, et al.
Published: (2025)
Cross-Modal Diffusion for Biomechanical Dynamical Systems Through Local Manifold Alignment
by: Dey, Sharmita, et al.
Published: (2025)
by: Dey, Sharmita, et al.
Published: (2025)
Judge Anything: MLLM as a Judge Across Any Modality
by: Pu, Shu, et al.
Published: (2025)
by: Pu, Shu, et al.
Published: (2025)
Multi-Modal Image Fusion via Intervention-Stable Feature Learning
by: Wang, Xue, et al.
Published: (2026)
by: Wang, Xue, et al.
Published: (2026)
Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification
by: Yang, Xi, et al.
Published: (2024)
by: Yang, Xi, et al.
Published: (2024)
Existence Result for Difference Equations on Non-Uniform Grids via Upper and Lower Solution Method
by: Bandyopadhyay, Shalmali, et al.
Published: (2025)
by: Bandyopadhyay, Shalmali, et al.
Published: (2025)
EfficientECG: Cross-Attention with Feature Fusion for Efficient Electrocardiogram Classification
by: Deng, Hanhui, et al.
Published: (2025)
by: Deng, Hanhui, et al.
Published: (2025)
A Lightweight Attention-based Deep Network via Multi-Scale Feature Fusion for Multi-View Facial Expression Recognition
by: Ezati, Ali, et al.
Published: (2024)
by: Ezati, Ali, et al.
Published: (2024)
Geometry-Aware CLIP Retrieval via Local Cross-Modal Alignment and Steering
by: Prakash, Nirmalendu, et al.
Published: (2026)
by: Prakash, Nirmalendu, et al.
Published: (2026)
Similar Items
-
Guiding Cross-Modal Representations with MLLM Priors via Preference Alignment
by: Zhao, Pengfei, et al.
Published: (2025) -
Modality-Fair Preference Optimization for Trustworthy MLLM Alignment
by: Jiang, Songtao, et al.
Published: (2024) -
Fusion of Deep Learning and GIS for Advanced Remote Sensing Image Analysis
by: Afroosheh, Sajjad, et al.
Published: (2024) -
Primordial Black Holes and the First Stars
by: Koulen, Julia Monika, et al.
Published: (2025) -
Constraints on Primordial Black Holes from $N$-body simulations of the Eridanus II Stellar Cluster
by: Koulen, Julia Monika, et al.
Published: (2024)