Saved in:
Bibliographic Details
Main Authors: Liu, Quan, Yao, Jiawen, Yao, Lisha, Chen, Xin, Zhou, Jingren, Lu, Le, Zhang, Ling, Liu, Zaiyi, Huo, Yuankai
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.07854
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914642250432512
author Liu, Quan
Yao, Jiawen
Yao, Lisha
Chen, Xin
Zhou, Jingren
Lu, Le
Zhang, Ling
Liu, Zaiyi
Huo, Yuankai
author_facet Liu, Quan
Yao, Jiawen
Yao, Lisha
Chen, Xin
Zhou, Jingren
Lu, Le
Zhang, Ling
Liu, Zaiyi
Huo, Yuankai
contents Colorectal cancer (CRC) micro-satellite instability (MSI) prediction on histopathology images is a challenging weakly supervised learning task that involves multi-instance learning on gigapixel images. To date, radiology images have proven to have CRC MSI information and efficient patient imaging techniques. Different data modalities integration offers the opportunity to increase the accuracy and robustness of MSI prediction. Despite the progress in representation learning from the whole slide images (WSI) and exploring the potential of making use of radiology data, CRC MSI prediction remains a challenge to fuse the information from multiple data modalities (e.g., pathology WSI and radiology CT image). In this paper, we propose $M^{2}$Fusion: a Bayesian-based multimodal multi-level fusion pipeline for CRC MSI. The proposed fusion model $M^{2}$Fusion is capable of discovering more novel patterns within and across modalities that are beneficial for predicting MSI than using a single modality alone, as well as other fusion methods. The contribution of the paper is three-fold: (1) $M^{2}$Fusion is the first pipeline of multi-level fusion on pathology WSI and 3D radiology CT image for MSI prediction; (2) CT images are the first time integrated into multimodal fusion for CRC MSI prediction; (3) feature-level fusion strategy is evaluated on both Transformer-based and CNN-based method. Our approach is validated on cross-validation of 352 cases and outperforms either feature-level (0.8177 vs. 0.7908) or decision-level fusion strategy (0.8177 vs. 0.7289) on AUC score.
format Preprint
id arxiv_https___arxiv_org_abs_2401_07854
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle $M^{2}$Fusion: Bayesian-based Multimodal Multi-level Fusion on Colorectal Cancer Microsatellite Instability Prediction
Liu, Quan
Yao, Jiawen
Yao, Lisha
Chen, Xin
Zhou, Jingren
Lu, Le
Zhang, Ling
Liu, Zaiyi
Huo, Yuankai
Computer Vision and Pattern Recognition
Colorectal cancer (CRC) micro-satellite instability (MSI) prediction on histopathology images is a challenging weakly supervised learning task that involves multi-instance learning on gigapixel images. To date, radiology images have proven to have CRC MSI information and efficient patient imaging techniques. Different data modalities integration offers the opportunity to increase the accuracy and robustness of MSI prediction. Despite the progress in representation learning from the whole slide images (WSI) and exploring the potential of making use of radiology data, CRC MSI prediction remains a challenge to fuse the information from multiple data modalities (e.g., pathology WSI and radiology CT image). In this paper, we propose $M^{2}$Fusion: a Bayesian-based multimodal multi-level fusion pipeline for CRC MSI. The proposed fusion model $M^{2}$Fusion is capable of discovering more novel patterns within and across modalities that are beneficial for predicting MSI than using a single modality alone, as well as other fusion methods. The contribution of the paper is three-fold: (1) $M^{2}$Fusion is the first pipeline of multi-level fusion on pathology WSI and 3D radiology CT image for MSI prediction; (2) CT images are the first time integrated into multimodal fusion for CRC MSI prediction; (3) feature-level fusion strategy is evaluated on both Transformer-based and CNN-based method. Our approach is validated on cross-validation of 352 cases and outperforms either feature-level (0.8177 vs. 0.7908) or decision-level fusion strategy (0.8177 vs. 0.7289) on AUC score.
title $M^{2}$Fusion: Bayesian-based Multimodal Multi-level Fusion on Colorectal Cancer Microsatellite Instability Prediction
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2401.07854