Generating multi-pathological and multi-modal images and labels for brain MRI

Virginia Fernandez; Walter Hugo Lopez Pinaya; Pedro Borges; Mark S Graham; Petru-Daniel Tudosiu; Tom Vercauteren; M Jorge Cardoso

doi:10.1016/j.media.2024.103278

Generating multi-pathological and multi-modal images and labels for brain MRI

Virginia Fernandez, Walter Hugo Lopez Pinaya, Pedro Borges, Mark S Graham, Petru-Daniel Tudosiu, Tom Vercauteren, M Jorge Cardoso

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

The last few years have seen a boom in using generative models to augment real datasets, as synthetic data can effectively model real data distributions and provide privacy-preserving, shareable datasets that can be used to train deep learning models. However, most of these methods are 2D and provide synthetic datasets that come, at most, with categorical annotations. The generation of paired images and segmentation samples that can be used in downstream, supervised segmentation tasks remains fairly uncharted territory. This work proposes a two-stage generative model capable of producing 2D and 3D semantic label maps and corresponding multi-modal images. We use a latent diffusion model for label synthesis and a VAE-GAN for semantic image synthesis. Synthetic datasets provided by this model are shown to work in a wide variety of segmentation tasks, supporting small, real datasets or fully replacing them while maintaining good performance. We also demonstrate its ability to improve downstream performance on out-of-distribution data.

Original language	English
Article number	103278
Pages (from-to)	103278
Journal	Medical Image Analysis
Volume	97
Early online date	18 Jul 2024
DOIs	https://doi.org/10.1016/j.media.2024.103278
Publication status	Published - Oct 2024

Access to Document

10.1016/j.media.2024.103278

Cite this

@article{b59b16b435364d91a073b61e8ea002e0,

title = "Generating multi-pathological and multi-modal images and labels for brain MRI",

abstract = "The last few years have seen a boom in using generative models to augment real datasets, as synthetic data can effectively model real data distributions and provide privacy-preserving, shareable datasets that can be used to train deep learning models. However, most of these methods are 2D and provide synthetic datasets that come, at most, with categorical annotations. The generation of paired images and segmentation samples that can be used in downstream, supervised segmentation tasks remains fairly uncharted territory. This work proposes a two-stage generative model capable of producing 2D and 3D semantic label maps and corresponding multi-modal images. We use a latent diffusion model for label synthesis and a VAE-GAN for semantic image synthesis. Synthetic datasets provided by this model are shown to work in a wide variety of segmentation tasks, supporting small, real datasets or fully replacing them while maintaining good performance. We also demonstrate its ability to improve downstream performance on out-of-distribution data.",

author = "Virginia Fernandez and Pinaya, {Walter Hugo Lopez} and Pedro Borges and Graham, {Mark S} and Petru-Daniel Tudosiu and Tom Vercauteren and Cardoso, {M Jorge}",

year = "2024",

month = oct,

doi = "10.1016/j.media.2024.103278",

language = "English",

volume = "97",

pages = "103278",

journal = "Medical Image Analysis",

issn = "1361-8415",

publisher = "Elsevier",

}

TY - JOUR

T1 - Generating multi-pathological and multi-modal images and labels for brain MRI

AU - Fernandez, Virginia

AU - Pinaya, Walter Hugo Lopez

AU - Borges, Pedro

AU - Graham, Mark S

AU - Tudosiu, Petru-Daniel

AU - Vercauteren, Tom

AU - Cardoso, M Jorge

PY - 2024/10

Y1 - 2024/10

N2 - The last few years have seen a boom in using generative models to augment real datasets, as synthetic data can effectively model real data distributions and provide privacy-preserving, shareable datasets that can be used to train deep learning models. However, most of these methods are 2D and provide synthetic datasets that come, at most, with categorical annotations. The generation of paired images and segmentation samples that can be used in downstream, supervised segmentation tasks remains fairly uncharted territory. This work proposes a two-stage generative model capable of producing 2D and 3D semantic label maps and corresponding multi-modal images. We use a latent diffusion model for label synthesis and a VAE-GAN for semantic image synthesis. Synthetic datasets provided by this model are shown to work in a wide variety of segmentation tasks, supporting small, real datasets or fully replacing them while maintaining good performance. We also demonstrate its ability to improve downstream performance on out-of-distribution data.

AB - The last few years have seen a boom in using generative models to augment real datasets, as synthetic data can effectively model real data distributions and provide privacy-preserving, shareable datasets that can be used to train deep learning models. However, most of these methods are 2D and provide synthetic datasets that come, at most, with categorical annotations. The generation of paired images and segmentation samples that can be used in downstream, supervised segmentation tasks remains fairly uncharted territory. This work proposes a two-stage generative model capable of producing 2D and 3D semantic label maps and corresponding multi-modal images. We use a latent diffusion model for label synthesis and a VAE-GAN for semantic image synthesis. Synthetic datasets provided by this model are shown to work in a wide variety of segmentation tasks, supporting small, real datasets or fully replacing them while maintaining good performance. We also demonstrate its ability to improve downstream performance on out-of-distribution data.

UR - http://www.scopus.com/inward/record.url?scp=85199350492&partnerID=8YFLogxK

U2 - 10.1016/j.media.2024.103278

DO - 10.1016/j.media.2024.103278

M3 - Article

C2 - 39059240

SN - 1361-8415

VL - 97

SP - 103278

JO - Medical Image Analysis

JF - Medical Image Analysis

M1 - 103278

ER -

Generating multi-pathological and multi-modal images and labels for brain MRI

Abstract

Access to Document

Other files and links

Fingerprint

Cite this