Saved in:
Bibliographic Details
Main Authors: Luu, Minh Sao Khue, Tuchinov, Bair N.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.03014
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911249024942080
author Luu, Minh Sao Khue
Tuchinov, Bair N.
author_facet Luu, Minh Sao Khue
Tuchinov, Bair N.
contents We present a foundation model for brain MRI that can work with different combinations of imaging sequences. The model uses one encoder with learnable modality embeddings, conditional layer normalization, and a masked autoencoding objective that accounts for missing modalities. A variance-covariance regularizer is applied to stabilize feature learning and improve representation diversity. This design removes the need for separate models for each modality and allows the network to adapt when some sequences are missing or unseen. It is trained on about 60,000 multi-center MRIs using self-supervised reconstruction and modality imputation to learn flexible representations. A learnable modality embedding guides feature extraction so the encoder can adjust to different inputs. We describe our planned evaluation on brain tumor and multiple sclerosis segmentation, as well as lesion classification, under various modality settings. Preliminary results show that the method works feasibly, and further experiments are planned to study its performance in more detail. All code and pretrained models are available at https://github.com/BrainFM/brainfm
format Preprint
id arxiv_https___arxiv_org_abs_2511_03014
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle A Foundation Model for Brain MRI with Dynamic Modality Integration
Luu, Minh Sao Khue
Tuchinov, Bair N.
Computer Vision and Pattern Recognition
We present a foundation model for brain MRI that can work with different combinations of imaging sequences. The model uses one encoder with learnable modality embeddings, conditional layer normalization, and a masked autoencoding objective that accounts for missing modalities. A variance-covariance regularizer is applied to stabilize feature learning and improve representation diversity. This design removes the need for separate models for each modality and allows the network to adapt when some sequences are missing or unseen. It is trained on about 60,000 multi-center MRIs using self-supervised reconstruction and modality imputation to learn flexible representations. A learnable modality embedding guides feature extraction so the encoder can adjust to different inputs. We describe our planned evaluation on brain tumor and multiple sclerosis segmentation, as well as lesion classification, under various modality settings. Preliminary results show that the method works feasibly, and further experiments are planned to study its performance in more detail. All code and pretrained models are available at https://github.com/BrainFM/brainfm
title A Foundation Model for Brain MRI with Dynamic Modality Integration
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2511.03014