Saved in:
Bibliographic Details
Main Authors: Pu, Yao, Shi, Yiming, Zhang, Zhenxi, Yu, Peixin, Zhuang, Yitao, Wang, Xiang, Chen, Hongzhao, Cai, Jing, Ren, Ge
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.08822
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911434842046464
author Pu, Yao
Shi, Yiming
Zhang, Zhenxi
Yu, Peixin
Zhuang, Yitao
Wang, Xiang
Chen, Hongzhao
Cai, Jing
Ren, Ge
author_facet Pu, Yao
Shi, Yiming
Zhang, Zhenxi
Yu, Peixin
Zhuang, Yitao
Wang, Xiang
Chen, Hongzhao
Cai, Jing
Ren, Ge
contents Magnetic resonance imaging (MRI) is essential for nasopharyngeal carcinoma (NPC) radiotherapy (RT), but practical constraints, such as patient discomfort, long scan times, and high costs often lead to incomplete modalities in clinical practice, compromising RT planning accuracy. Traditional MRI synthesis methods are modality-specific, limited in anatomical adaptability, and lack clinical interpretability-failing to meet NPC's RT needs. Here, we developed a unified foundation model integrating contrastive visual representation learning and vision-language alignment (VLA) to enable any-to-all MRI synthesis. The model uses a contrastive encoder for modality-invariant representations and a CLIP-based text-informed decoder for semantically consistent synthesis, supporting any-to-all MRI synthesis via one unified foundation model. Trained on 40,825 images from 13 institutions, it achieves consistently high performance (average SSIM 0.90, PSNR 27) across 26 internal/external validation sites (15,748 images), with superior synthesis fidelity and robustness to noise and domain shifts. Meanwhile, its unified representation enhances downstream RT-relevant tasks (e.g., segmentation). This work advances digital medicine solutions for NPC care by leveraging foundation models to bridge technical synthesis and clinical utility.
format Preprint
id arxiv_https___arxiv_org_abs_2602_08822
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Any-to-All MRI Synthesis: A Unified Foundation Model for Nasopharyngeal Carcinoma and Its Downstream Applications
Pu, Yao
Shi, Yiming
Zhang, Zhenxi
Yu, Peixin
Zhuang, Yitao
Wang, Xiang
Chen, Hongzhao
Cai, Jing
Ren, Ge
Computer Vision and Pattern Recognition
Magnetic resonance imaging (MRI) is essential for nasopharyngeal carcinoma (NPC) radiotherapy (RT), but practical constraints, such as patient discomfort, long scan times, and high costs often lead to incomplete modalities in clinical practice, compromising RT planning accuracy. Traditional MRI synthesis methods are modality-specific, limited in anatomical adaptability, and lack clinical interpretability-failing to meet NPC's RT needs. Here, we developed a unified foundation model integrating contrastive visual representation learning and vision-language alignment (VLA) to enable any-to-all MRI synthesis. The model uses a contrastive encoder for modality-invariant representations and a CLIP-based text-informed decoder for semantically consistent synthesis, supporting any-to-all MRI synthesis via one unified foundation model. Trained on 40,825 images from 13 institutions, it achieves consistently high performance (average SSIM 0.90, PSNR 27) across 26 internal/external validation sites (15,748 images), with superior synthesis fidelity and robustness to noise and domain shifts. Meanwhile, its unified representation enhances downstream RT-relevant tasks (e.g., segmentation). This work advances digital medicine solutions for NPC care by leveraging foundation models to bridge technical synthesis and clinical utility.
title Any-to-All MRI Synthesis: A Unified Foundation Model for Nasopharyngeal Carcinoma and Its Downstream Applications
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.08822