Please use this identifier to cite or link to this item: https://hdl.handle.net/2445/220771
Title: Medigan: A Python library of pretrained generative models for medical image synthesis
Author: Osuala, Richard
Skorupko, Grzegorz
Lazrak, Noussair
Garrucho, Lidia
García, Eloy
Joshi, Smriti
Jouide, Socayna
Rutherford, Michael
Prior, Fred
Kushibar, Kaisar
Díaz, Oliver
Lekadir, Karim, 1977-
Keywords: Imatges mèdiques
Aprenentatge automàtic
Dades massives
Imaging systems in medicine
Machine learning
Big data
Issue Date: 20-Feb-2023
Publisher: Society of Photo-Optical Instrumentation Engineers (SPIE)
Abstract: Purpose: Deep learning has shown great promise as the backbone of clinical decision support systems. Synthetic data generated by generative models can enhance the performance and capabilities of data-hungry deep learning models. However, there is (1) limited availability of (synthetic) datasets and (2) generative models are complex to train, which hinders their adoption in research and clinical applications. To reduce this entry barrier, we explore generative model sharing to allow more researchers to access, generate, and benefit from synthetic data. Approach: We propose medigan, a one-stop shop for pretrained generative models imple- mented as an open-source framework-agnostic Python library. After gathering end-user requirements, design decisions based on usability, technical feasibility, and scalability are formulated. Subsequently, we implement medigan based on modular components for generative model (i) execution, (ii) visualization, (iii) search & ranking, and (iv) contribution. We integrate pre- trained models with applications across modalities such as mammography, endoscopy, x-ray, and MRI. Results: The scalability and design of the library are demonstrated by its growing number of integrated and readily-usable pretrained generative models, which include 21 models utilizing nine different generative adversarial network architectures trained on 11 different datasets. We further analyze three medigan applications, which include (a) enabling community-wide sharing of restricted data, (b) investigating generative model evaluation metrics, and (c) improving clinical downstream tasks. In (b), we extract Fréchet inception distances (FID) demonstrating FID variability based on image normalization and radiology-specific feature extractors. Conclusion: medigan allows researchers and developers to create, increase, and domain-adapt their training data in just a few lines of code. Capable of enriching and accelerating the development of clinical machine learning models, we show medigan’s viability as platform for generative model sharing. Our multimodel synthetic data experiments uncover standards for assessing and reporting metrics, such as FID, in image synthesis studies.
Note: Reproducció del document publicat a: https://doi.org/10.1117/1.JMI.10.6.061403
It is part of: Journal of Medical Imaging, 2023, vol. 10, num.6
URI: https://hdl.handle.net/2445/220771
Related resource: https://doi.org/10.1117/1.JMI.10.6.061403
ISSN: 2329-4302
Appears in Collections:Articles publicats en revistes (Matemàtiques i Informàtica)

Files in This Item:
File Description SizeFormat 
841885.pdf4.47 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons