NFDI4DS | UHH-SEMS - Publication Details

Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation

FOS: Computer and information sciences Computer Vision and Pattern Recognition (cs.CV) Image and Video Processing (eess.IV) Computer Science - Computer Vision and Pattern Recognition FOS: Electrical engineering, electronic engineering, information engineering Electrical Engineering and Systems Science - Image and Video Processing

DOI: 10.48550/arxiv.2409.16818 Publication Date: 2024-01-01

Abstract Supplemental Material References Cited by

AUTHORS (10)

Wang, Yulin

Xiong, Honglin

Sun, Kaicong

Bai, Shuwei

Dai, Ling

Ding, Zhongxiang

Liu, Jiameng

Wang, Qian

Liu, Qian

Shen, Dinggang

ABSTRACT

Multimodal brain magnetic resonance (MR) imaging is indispensable in neuroscience and neurology. However, due to the accessibility of MRI scanners and their lengthy acquisition time, multimodal MR images are not commonly available. Current MR image synthesis approaches are typically trained on independent datasets for specific tasks, leading to suboptimal performance when applied to novel datasets and tasks. Here, we present TUMSyn, a Text-guided Universal MR image Synthesis generalist model, which can flexibly generate brain MR images with demanded imaging metadata from routinely acquired scans guided by text prompts. To ensure TUMSyn's image synthesis precision, versatility, and generalizability, we first construct a brain MR database comprising 31,407 3D images with 7 MRI modalities from 13 centers. We then pre-train an MRI-specific text encoder using contrastive learning to effectively control MR image synthesis based on text prompts. Extensive experiments on diverse datasets and physician assessments indicate that TUMSyn can generate clinically meaningful MR images with specified imaging metadata in supervised and zero-shot scenarios. Therefore, TUMSyn can be utilized along with acquired MR scan(s) to facilitate large-scale MRI-based screening and diagnosis of brain diseases.<br/>23 pages, 9 figures<br/>

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....