Saved in:
| Main Authors: | Gordon, Lucia, Lang, Nico, Ressijac, Catherine, Davies, Andrew |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.04833 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Find Rhinos without Finding Rhinos: Active Learning with Multimodal Imagery of South African Rhino Habitats
by: Gordon, Lucia, et al.
Published: (2024)
by: Gordon, Lucia, et al.
Published: (2024)
Contrasting local and global modeling with machine learning and satellite data: A case study estimating tree canopy height in African savannas
by: Rolf, Esther, et al.
Published: (2024)
by: Rolf, Esther, et al.
Published: (2024)
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features
by: Li, Po-han, et al.
Published: (2024)
by: Li, Po-han, et al.
Published: (2024)
NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction
by: Monninger, Thomas, et al.
Published: (2025)
by: Monninger, Thomas, et al.
Published: (2025)
MemFusionMap: Working Memory Fusion for Online Vectorized HD Map Construction
by: Song, Jingyu, et al.
Published: (2024)
by: Song, Jingyu, et al.
Published: (2024)
A Multimodal Fusion Model Leveraging MLP Mixer and Handcrafted Features-based Deep Learning Networks for Facial Palsy Detection
by: Oo, Heng Yim Nicole, et al.
Published: (2025)
by: Oo, Heng Yim Nicole, et al.
Published: (2025)
Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary
by: Florea, Alexandru, et al.
Published: (2026)
by: Florea, Alexandru, et al.
Published: (2026)
MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning
by: Nedungadi, Vishal, et al.
Published: (2024)
by: Nedungadi, Vishal, et al.
Published: (2024)
Automatic Image Annotation for Mapped Features Detection
by: Noizet, Maxime, et al.
Published: (2024)
by: Noizet, Maxime, et al.
Published: (2024)
Data-Efficient Multimodal Fusion on a Single GPU
by: Vouitsis, Noël, et al.
Published: (2023)
by: Vouitsis, Noël, et al.
Published: (2023)
Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion
by: Ding, Dexuan, et al.
Published: (2024)
by: Ding, Dexuan, et al.
Published: (2024)
Unsupervised Active Learning via Natural Feature Progressive Framework
by: Liu, Yuxi, et al.
Published: (2025)
by: Liu, Yuxi, et al.
Published: (2025)
Token Activation Map to Visually Explain Multimodal LLMs
by: Li, Yi, et al.
Published: (2025)
by: Li, Yi, et al.
Published: (2025)
FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation
by: Tan, Min, et al.
Published: (2026)
by: Tan, Min, et al.
Published: (2026)
Enhancing Out-of-Distribution Detection with Multitesting-based Layer-wise Feature Fusion
by: Li, Jiawei, et al.
Published: (2024)
by: Li, Jiawei, et al.
Published: (2024)
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion
by: Wang, Zehan, et al.
Published: (2024)
by: Wang, Zehan, et al.
Published: (2024)
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation
by: Li, Bingyu, et al.
Published: (2024)
by: Li, Bingyu, et al.
Published: (2024)
Feature Fusion for Improved Classification: Combining Dempster-Shafer Theory and Multiple CNN Architectures
by: Alzahem, Ayyub, et al.
Published: (2024)
by: Alzahem, Ayyub, et al.
Published: (2024)
VIFO: Visual Feature Empowered Multivariate Time Series Forecasting with Cross-Modal Fusion
by: Wang, Yanlong, et al.
Published: (2025)
by: Wang, Yanlong, et al.
Published: (2025)
Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation
by: Yao, Wenfang, et al.
Published: (2024)
by: Yao, Wenfang, et al.
Published: (2024)
Exploring a Multimodal Fusion-based Deep Learning Network for Detecting Facial Palsy
by: Oo, Heng Yim Nicole, et al.
Published: (2024)
by: Oo, Heng Yim Nicole, et al.
Published: (2024)
Robult: Leveraging Redundancy and Modality Specific Features for Robust Multimodal Learning
by: Nguyen, Duy A., et al.
Published: (2025)
by: Nguyen, Duy A., et al.
Published: (2025)
Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment
by: Guthula, Venkanna Babu, et al.
Published: (2024)
by: Guthula, Venkanna Babu, et al.
Published: (2024)
The Lie Derivative for Measuring Learned Equivariance
by: Gruver, Nate, et al.
Published: (2022)
by: Gruver, Nate, et al.
Published: (2022)
Visual Explanations of Image-Text Representations via Multi-Modal Information Bottleneck Attribution
by: Wang, Ying, et al.
Published: (2023)
by: Wang, Ying, et al.
Published: (2023)
MapIQ: Evaluating Multimodal Large Language Models for Map Question Answering
by: Srivastava, Varun, et al.
Published: (2025)
by: Srivastava, Varun, et al.
Published: (2025)
FusionEnsemble-Net: An Attention-Based Ensemble of Spatiotemporal Networks for Multimodal Sign Language Recognition
by: Islam, Md. Milon, et al.
Published: (2025)
by: Islam, Md. Milon, et al.
Published: (2025)
Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation
by: Neha, Fnu, et al.
Published: (2024)
by: Neha, Fnu, et al.
Published: (2024)
Multi-scale Quaternion CNN and BiGRU with Cross Self-attention Feature Fusion for Fault Diagnosis of Bearing
by: Liu, Huanbai, et al.
Published: (2024)
by: Liu, Huanbai, et al.
Published: (2024)
MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection
by: Wang, Yuxiang, et al.
Published: (2025)
by: Wang, Yuxiang, et al.
Published: (2025)
FILS: Self-Supervised Video Feature Prediction In Semantic Language Space
by: Ahmadian, Mona, et al.
Published: (2024)
by: Ahmadian, Mona, et al.
Published: (2024)
WiTUnet: A U-Shaped Architecture Integrating CNN and Transformer for Improved Feature Alignment and Local Information Fusion
by: Wang, Bin, et al.
Published: (2024)
by: Wang, Bin, et al.
Published: (2024)
VLSU: Mapping the Limits of Joint Multimodal Understanding for AI Safety
by: Palaskar, Shruti, et al.
Published: (2025)
by: Palaskar, Shruti, et al.
Published: (2025)
EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware Acceleration
by: Song, Zhifan, et al.
Published: (2025)
by: Song, Zhifan, et al.
Published: (2025)
LiMTR: Time Series Motion Prediction for Diverse Road Users through Multimodal Feature Integration
by: Oerlemans, Camiel, et al.
Published: (2024)
by: Oerlemans, Camiel, et al.
Published: (2024)
MoPE: Mixture of Prompt Experts for Parameter-Efficient and Scalable Multimodal Fusion
by: Jiang, Ruixiang, et al.
Published: (2024)
by: Jiang, Ruixiang, et al.
Published: (2024)
From Data to Insights: A Covariate Analysis of the IARPA BRIAR Dataset for Multimodal Biometric Recognition Algorithms at Altitude and Range
by: Bolme, David S., et al.
Published: (2024)
by: Bolme, David S., et al.
Published: (2024)
From Radiologist Report to Image Label: Assessing Latent Dirichlet Allocation in Training Neural Networks for Orthopedic Radiograph Classification
by: Olczak, Jakub, et al.
Published: (2024)
by: Olczak, Jakub, et al.
Published: (2024)
Spatiotemporal Air Quality Mapping in Urban Areas Using Sparse Sensor Data, Satellite Imagery, Meteorological Factors, and Spatial Features
by: Ahmad, Osama, et al.
Published: (2025)
by: Ahmad, Osama, et al.
Published: (2025)
GAF-FusionNet: Multimodal ECG Analysis via Gramian Angular Fields and Split Attention
by: Qin, Jiahao, et al.
Published: (2024)
by: Qin, Jiahao, et al.
Published: (2024)
Similar Items
-
Find Rhinos without Finding Rhinos: Active Learning with Multimodal Imagery of South African Rhino Habitats
by: Gordon, Lucia, et al.
Published: (2024) -
Contrasting local and global modeling with machine learning and satellite data: A case study estimating tree canopy height in African savannas
by: Rolf, Esther, et al.
Published: (2024) -
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features
by: Li, Po-han, et al.
Published: (2024) -
NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction
by: Monninger, Thomas, et al.
Published: (2025) -
MemFusionMap: Working Memory Fusion for Online Vectorized HD Map Construction
by: Song, Jingyu, et al.
Published: (2024)