Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Pu, Yao, Shi, Yiming, Zhang, Zhenxi, Yu, Peixin, Zhuang, Yitao, Wang, Xiang, Chen, Hongzhao, Cai, Jing, Ren, Ge
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.08822
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911434842046464
author	Pu, Yao Shi, Yiming Zhang, Zhenxi Yu, Peixin Zhuang, Yitao Wang, Xiang Chen, Hongzhao Cai, Jing Ren, Ge
author_facet	Pu, Yao Shi, Yiming Zhang, Zhenxi Yu, Peixin Zhuang, Yitao Wang, Xiang Chen, Hongzhao Cai, Jing Ren, Ge
contents	Magnetic resonance imaging (MRI) is essential for nasopharyngeal carcinoma (NPC) radiotherapy (RT), but practical constraints, such as patient discomfort, long scan times, and high costs often lead to incomplete modalities in clinical practice, compromising RT planning accuracy. Traditional MRI synthesis methods are modality-specific, limited in anatomical adaptability, and lack clinical interpretability-failing to meet NPC's RT needs. Here, we developed a unified foundation model integrating contrastive visual representation learning and vision-language alignment (VLA) to enable any-to-all MRI synthesis. The model uses a contrastive encoder for modality-invariant representations and a CLIP-based text-informed decoder for semantically consistent synthesis, supporting any-to-all MRI synthesis via one unified foundation model. Trained on 40,825 images from 13 institutions, it achieves consistently high performance (average SSIM 0.90, PSNR 27) across 26 internal/external validation sites (15,748 images), with superior synthesis fidelity and robustness to noise and domain shifts. Meanwhile, its unified representation enhances downstream RT-relevant tasks (e.g., segmentation). This work advances digital medicine solutions for NPC care by leveraging foundation models to bridge technical synthesis and clinical utility.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_08822
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Any-to-All MRI Synthesis: A Unified Foundation Model for Nasopharyngeal Carcinoma and Its Downstream Applications Pu, Yao Shi, Yiming Zhang, Zhenxi Yu, Peixin Zhuang, Yitao Wang, Xiang Chen, Hongzhao Cai, Jing Ren, Ge Computer Vision and Pattern Recognition Magnetic resonance imaging (MRI) is essential for nasopharyngeal carcinoma (NPC) radiotherapy (RT), but practical constraints, such as patient discomfort, long scan times, and high costs often lead to incomplete modalities in clinical practice, compromising RT planning accuracy. Traditional MRI synthesis methods are modality-specific, limited in anatomical adaptability, and lack clinical interpretability-failing to meet NPC's RT needs. Here, we developed a unified foundation model integrating contrastive visual representation learning and vision-language alignment (VLA) to enable any-to-all MRI synthesis. The model uses a contrastive encoder for modality-invariant representations and a CLIP-based text-informed decoder for semantically consistent synthesis, supporting any-to-all MRI synthesis via one unified foundation model. Trained on 40,825 images from 13 institutions, it achieves consistently high performance (average SSIM 0.90, PSNR 27) across 26 internal/external validation sites (15,748 images), with superior synthesis fidelity and robustness to noise and domain shifts. Meanwhile, its unified representation enhances downstream RT-relevant tasks (e.g., segmentation). This work advances digital medicine solutions for NPC care by leveraging foundation models to bridge technical synthesis and clinical utility.
title	Any-to-All MRI Synthesis: A Unified Foundation Model for Nasopharyngeal Carcinoma and Its Downstream Applications
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2602.08822

Similar Items