TY - JOUR
T1 - Generating multi-pathological and multi-modal images and labels for brain MRI
AU - Fernandez, Virginia
AU - Pinaya, Walter Hugo Lopez
AU - Borges, Pedro
AU - Graham, Mark S
AU - Tudosiu, Petru-Daniel
AU - Vercauteren, Tom
AU - Cardoso, M Jorge
N1 - Copyright © 2024 The Authors. Published by Elsevier B.V. All rights reserved.
PY - 2024/10
Y1 - 2024/10
N2 - The last few years have seen a boom in using generative models to augment real datasets, as synthetic data can effectively model real data distributions and provide privacy-preserving, shareable datasets that can be used to train deep learning models. However, most of these methods are 2D and provide synthetic datasets that come, at most, with categorical annotations. The generation of paired images and segmentation samples that can be used in downstream, supervised segmentation tasks remains fairly uncharted territory. This work proposes a two-stage generative model capable of producing 2D and 3D semantic label maps and corresponding multi-modal images. We use a latent diffusion model for label synthesis and a VAE-GAN for semantic image synthesis. Synthetic datasets provided by this model are shown to work in a wide variety of segmentation tasks, supporting small, real datasets or fully replacing them while maintaining good performance. We also demonstrate its ability to improve downstream performance on out-of-distribution data.
AB - The last few years have seen a boom in using generative models to augment real datasets, as synthetic data can effectively model real data distributions and provide privacy-preserving, shareable datasets that can be used to train deep learning models. However, most of these methods are 2D and provide synthetic datasets that come, at most, with categorical annotations. The generation of paired images and segmentation samples that can be used in downstream, supervised segmentation tasks remains fairly uncharted territory. This work proposes a two-stage generative model capable of producing 2D and 3D semantic label maps and corresponding multi-modal images. We use a latent diffusion model for label synthesis and a VAE-GAN for semantic image synthesis. Synthetic datasets provided by this model are shown to work in a wide variety of segmentation tasks, supporting small, real datasets or fully replacing them while maintaining good performance. We also demonstrate its ability to improve downstream performance on out-of-distribution data.
UR - http://www.scopus.com/inward/record.url?scp=85199350492&partnerID=8YFLogxK
U2 - 10.1016/j.media.2024.103278
DO - 10.1016/j.media.2024.103278
M3 - Article
C2 - 39059240
SN - 1361-8415
VL - 97
SP - 103278
JO - Medical Image Analysis
JF - Medical Image Analysis
M1 - 103278
ER -