Mensing et al. build first joint latent diffusion model for MRI and patient data
A multimodal model that co-generates brain scans and clinical metrics like age and sex via cross-attention...
Daniel Mensing and colleagues from the University of Bremen and Fraunhofer MEVIS introduce a latent diffusion model that jointly synthesizes volumetric brain MRI and mixed-type tabular clinical data (age, sex, body measurements, ethnicity) in a single framework. The model uses a variational autoencoder to fuse both modalities into a shared latent space, then applies diffusion-based synthesis with cross-attention to ensure coherence. Separate decoders reconstruct the MRI volumes and tabular features from the joint representation, enabling modality-appropriate outputs that are mutually consistent—e.g., generating an MRI with brain anatomy that matches the synthesized patient's age and BMI.
Evaluated on the German National Cohort (NAKO), which includes over 10,000 participants, the generated MRI volumes show high anatomical plausibility, validated by Fréchet distance and precision-recall metrics. For the tabular modality, the model outperforms CTGAN and achieves results comparable to TVAE across standard metrics. The authors claim this is the first demonstration of joint multimodal latent diffusion for MRI and clinical data, positioning it as a key step toward creating digital twins in healthcare—detailed synthetic patient representations that could accelerate research and personalized medicine.
- First multimodal latent diffusion model to jointly synthesize volumetric MRI and mixed-type tabular clinical data using cross-attention in a shared latent space.
- Trained on over 10,000 participants from the German National Cohort (NAKO), generating anatomically plausible MRI volumes consistent with synthesized clinical attributes.
- Outperformed CTGAN on tabular data metrics and achieved high-fidelity image generation, serving as a proof-of-concept for healthcare digital twins.
Why It Matters
Enables realistic, coherent synthetic patient data, advancing digital twin technology for personalized medicine and medical research.