Saved in:
Bibliographic Details
Main Authors: Wang, Xuan, Xu, Zhongling, Kannedhara, Gopi, Nguyen, Joakim, Yu, Jian, Fang, Jinrui, Baghdadi, Abdurrahmaan, Chen, Tianlong, Naeem, Awais, Krishnan, Chandra, Castillo, Edward, Song, Andrew H., Shukla, Ankita, Ding, Ying, Konz, Nicholas, Wang, Hairong
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.24399
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916057686474752
author Wang, Xuan
Xu, Zhongling
Kannedhara, Gopi
Nguyen, Joakim
Yu, Jian
Fang, Jinrui
Baghdadi, Abdurrahmaan
Chen, Tianlong
Naeem, Awais
Krishnan, Chandra
Castillo, Edward
Song, Andrew H.
Shukla, Ankita
Ding, Ying
Konz, Nicholas
Wang, Hairong
author_facet Wang, Xuan
Xu, Zhongling
Kannedhara, Gopi
Nguyen, Joakim
Yu, Jian
Fang, Jinrui
Baghdadi, Abdurrahmaan
Chen, Tianlong
Naeem, Awais
Krishnan, Chandra
Castillo, Edward
Song, Andrew H.
Shukla, Ankita
Ding, Ying
Konz, Nicholas
Wang, Hairong
contents Healthcare models are transitioning from unimodal prediction toward multimodal reasoning over heterogeneous diagnostic inputs. In computational pathology, for complex tumor subtypes where morphology alone can be challenging to distinguish, pathology reports and molecular measurements may provide additional diagnostic evidence alongside whole-slide images, yet existing models often fail to clarify how diverse signals assemble into recognizable diagnostic concepts. We propose ConceptM$^3$oE (Concept Multimodal MoE), which embeds concept formation directly within interaction-aware mixture-of-experts (MoE) pathways. The architecture decomposes evidence into modality-specific, redundant, and synergistic experts, which are then projected into structured concept bottlenecks mapping latent features to a hierarchy of morphology and biomarker concepts. To prevent the information loss typical of interpretable bottlenecks, we utilize residual pathways within each expert to allow task-relevant signals to flow both through the concepts and directly to the final task prediction, so that high performance is maintained alongside interpretability. Across an institutional pediatric brain tumor cohort and a public glioma cohort, the framework delivers competitive performance to unconstrained models while producing reasoning traces validated by an independent neuropathologist. In data-limited regimes, ConceptM$^3$oE improves limited-data performance, increasing macro-F1 from 56.41% to 66.70% at small training sizes compared to non-concept-informed baselines, while also showing faster training convergence consistent with the regularizing effect of concept learning. This work offers a scalable path toward high-performance medical AI that is inherently verifiable and better aligned with the complex decision-making of clinical practice.
format Preprint
id arxiv_https___arxiv_org_abs_2605_24399
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle ConceptM$^3$oE: Concept-Guided Multimodal Mixture of Experts for Interpretable Computational Pathology
Wang, Xuan
Xu, Zhongling
Kannedhara, Gopi
Nguyen, Joakim
Yu, Jian
Fang, Jinrui
Baghdadi, Abdurrahmaan
Chen, Tianlong
Naeem, Awais
Krishnan, Chandra
Castillo, Edward
Song, Andrew H.
Shukla, Ankita
Ding, Ying
Konz, Nicholas
Wang, Hairong
Artificial Intelligence
Healthcare models are transitioning from unimodal prediction toward multimodal reasoning over heterogeneous diagnostic inputs. In computational pathology, for complex tumor subtypes where morphology alone can be challenging to distinguish, pathology reports and molecular measurements may provide additional diagnostic evidence alongside whole-slide images, yet existing models often fail to clarify how diverse signals assemble into recognizable diagnostic concepts. We propose ConceptM$^3$oE (Concept Multimodal MoE), which embeds concept formation directly within interaction-aware mixture-of-experts (MoE) pathways. The architecture decomposes evidence into modality-specific, redundant, and synergistic experts, which are then projected into structured concept bottlenecks mapping latent features to a hierarchy of morphology and biomarker concepts. To prevent the information loss typical of interpretable bottlenecks, we utilize residual pathways within each expert to allow task-relevant signals to flow both through the concepts and directly to the final task prediction, so that high performance is maintained alongside interpretability. Across an institutional pediatric brain tumor cohort and a public glioma cohort, the framework delivers competitive performance to unconstrained models while producing reasoning traces validated by an independent neuropathologist. In data-limited regimes, ConceptM$^3$oE improves limited-data performance, increasing macro-F1 from 56.41% to 66.70% at small training sizes compared to non-concept-informed baselines, while also showing faster training convergence consistent with the regularizing effect of concept learning. This work offers a scalable path toward high-performance medical AI that is inherently verifiable and better aligned with the complex decision-making of clinical practice.
title ConceptM$^3$oE: Concept-Guided Multimodal Mixture of Experts for Interpretable Computational Pathology
topic Artificial Intelligence
url https://arxiv.org/abs/2605.24399