:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jordan, Jason, Lor, Mohammadreza Akbari, Koulen, Peter, Shyu, Mei-Ling, Chen, Shu-Ching
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.21358
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Guiding Cross-Modal Representations with MLLM Priors via Preference Alignment
by: Zhao, Pengfei, et al.
Published: (2025)

Modality-Fair Preference Optimization for Trustworthy MLLM Alignment
by: Jiang, Songtao, et al.
Published: (2024)

Fusion of Deep Learning and GIS for Advanced Remote Sensing Image Analysis
by: Afroosheh, Sajjad, et al.
Published: (2024)

Primordial Black Holes and the First Stars
by: Koulen, Julia Monika, et al.
Published: (2025)

Constraints on Primordial Black Holes from $N$-body simulations of the Eridanus II Stellar Cluster
by: Koulen, Julia Monika, et al.
Published: (2024)

Fundoscopic Cameras Associated with Shorter Emergency Department Length of Stay for Patients with Vision Loss
by: Dana R. Sax, et al.
Published: (2026)

SIMMER: Cross-Modal Food Image--Recipe Retrieval via MLLM-Based Embedding
by: Gomi, Keisuke, et al.
Published: (2026)

FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data
by: Xu, Binqian, et al.
Published: (2024)

Deep Spatiotemporal Clutter Filtering of Transthoracic Echocardiographic Images: Leveraging Contextual Attention and Residual Learning
by: Tabassian, Mahdi, et al.
Published: (2024)

MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM
by: Kalan, Saqi Hussain, et al.
Published: (2025)

FDCT: Frequency-Aware Decomposition and Cross-Modal Token-Alignment for Multi-Sensor Target Classification
by: Sami, Shoaib Meraj, et al.
Published: (2025)

Gesture Classification in Artworks Using Contextual Image Features
by: Hussian, Azhar, et al.
Published: (2024)

MDF: A Dynamic Fusion Model for Multi-modal Fake News Detection
by: Lv, Hongzhen, et al.
Published: (2024)

MLLM-Driven Semantic Identifier Generation for Generative Cross-Modal Retrieval
by: Li, Tianyuan, et al.
Published: (2025)

Domain Transfer Through Image-to-Image Translation for Uncertainty-Aware Prostate Cancer Classification
by: Zhou, Meng, et al.
Published: (2023)

CrossBench: Generalized Crosstalk Benchmark Generation for Quantum Computers
by: Hawley, Jaden, et al.
Published: (2026)

DeepIPCv3: Event-Aware Multi-Modal Sensor Fusion for Sudden Pedestrian Crossing Avoidance
by: Natan, Oskar, et al.
Published: (2026)

Multiscale Feature Fusion Method for Liver Cirrhosis Classification
by: Shanshan Wang, et al.
Published: (2024)

Modality-Aware Infrared and Visible Image Fusion with Target-Aware Supervision
by: Sun, Tianyao, et al.
Published: (2025)

TUNI: Real-time RGB-T Semantic Segmentation with Unified Multi-Modal Feature Extraction and Cross-Modal Feature Fusion
by: Guo, Xiaodong, et al.
Published: (2025)

Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval
by: Huang, Hailang, et al.
Published: (2024)

CritiFusion: Semantic Critique and Spectral Alignment for Faithful Text-to-Image Generation
by: Chen, ZhenQi, et al.
Published: (2025)

Cross Feature Fusion of Fundus Image and Generated Lesion Map for Referable Diabetic Retinopathy Classification
by: Mok, Dahyun, et al.
Published: (2024)

Interactive CNN and Transformer‐Based Cross‐Attention Fusion Network for Medical Image Classification
by: Shu Cai, et al.
Published: (2025)

Cross-Modal Synergies: Unveiling the Potential of Motion-Aware Fusion Networks in Handling Dynamic and Static ReID Scenarios
by: Ling, Fuxi, et al.
Published: (2025)

AUV-Fusion: Cross-Modal Adversarial Fusion of User Interactions and Visual Perturbations Against VARS
by: Ling, Hai, et al.
Published: (2025)

Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques
by: Liu, Weide, et al.
Published: (2025)

Cross-Modal Scene Semantic Alignment for Image Complexity Assessment
by: Luo, Yuqing, et al.
Published: (2025)

SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification
by: Alkhatib, Mohammed Q., et al.
Published: (2024)

Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance
by: Li, Jun, et al.
Published: (2024)

Cross-Modal Prototype Alignment and Mixing for Training-Free Few-Shot Classification
by: Goswami, Dipam, et al.
Published: (2026)

Diffusion-Based Cross-Modal Feature Extraction for Multi-Label Classification
by: Lan, Tian, et al.
Published: (2025)

Cross-Modal Diffusion for Biomechanical Dynamical Systems Through Local Manifold Alignment
by: Dey, Sharmita, et al.
Published: (2025)

Judge Anything: MLLM as a Judge Across Any Modality
by: Pu, Shu, et al.
Published: (2025)

Multi-Modal Image Fusion via Intervention-Stable Feature Learning
by: Wang, Xue, et al.
Published: (2026)

Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification
by: Yang, Xi, et al.
Published: (2024)

Existence Result for Difference Equations on Non-Uniform Grids via Upper and Lower Solution Method
by: Bandyopadhyay, Shalmali, et al.
Published: (2025)

EfficientECG: Cross-Attention with Feature Fusion for Efficient Electrocardiogram Classification
by: Deng, Hanhui, et al.
Published: (2025)

A Lightweight Attention-based Deep Network via Multi-Scale Feature Fusion for Multi-View Facial Expression Recognition
by: Ezati, Ali, et al.
Published: (2024)

Geometry-Aware CLIP Retrieval via Local Cross-Modal Alignment and Steering
by: Prakash, Nirmalendu, et al.
Published: (2026)