Please use this identifier to cite or link to this item:
https://hdl.handle.net/2445/220771
Title: | Medigan: A Python library of pretrained generative models for medical image synthesis |
Author: | Osuala, Richard Skorupko, Grzegorz Lazrak, Noussair Garrucho, Lidia García, Eloy Joshi, Smriti Jouide, Socayna Rutherford, Michael Prior, Fred Kushibar, Kaisar Díaz, Oliver Lekadir, Karim, 1977- |
Keywords: | Imatges mèdiques Aprenentatge automàtic Dades massives Imaging systems in medicine Machine learning Big data |
Issue Date: | 20-Feb-2023 |
Publisher: | Society of Photo-Optical Instrumentation Engineers (SPIE) |
Abstract: | Purpose: Deep learning has shown great promise as the backbone of clinical decision support systems. Synthetic data generated by generative models can enhance the performance and capabilities of data-hungry deep learning models. However, there is (1) limited availability of (synthetic) datasets and (2) generative models are complex to train, which hinders their adoption in research and clinical applications. To reduce this entry barrier, we explore generative model sharing to allow more researchers to access, generate, and benefit from synthetic data. Approach: We propose medigan, a one-stop shop for pretrained generative models imple- mented as an open-source framework-agnostic Python library. After gathering end-user requirements, design decisions based on usability, technical feasibility, and scalability are formulated. Subsequently, we implement medigan based on modular components for generative model (i) execution, (ii) visualization, (iii) search & ranking, and (iv) contribution. We integrate pre- trained models with applications across modalities such as mammography, endoscopy, x-ray, and MRI. Results: The scalability and design of the library are demonstrated by its growing number of integrated and readily-usable pretrained generative models, which include 21 models utilizing nine different generative adversarial network architectures trained on 11 different datasets. We further analyze three medigan applications, which include (a) enabling community-wide sharing of restricted data, (b) investigating generative model evaluation metrics, and (c) improving clinical downstream tasks. In (b), we extract Fréchet inception distances (FID) demonstrating FID variability based on image normalization and radiology-specific feature extractors. Conclusion: medigan allows researchers and developers to create, increase, and domain-adapt their training data in just a few lines of code. Capable of enriching and accelerating the development of clinical machine learning models, we show medigan’s viability as platform for generative model sharing. Our multimodel synthetic data experiments uncover standards for assessing and reporting metrics, such as FID, in image synthesis studies. |
Note: | Reproducció del document publicat a: https://doi.org/10.1117/1.JMI.10.6.061403 |
It is part of: | Journal of Medical Imaging, 2023, vol. 10, num.6 |
URI: | https://hdl.handle.net/2445/220771 |
Related resource: | https://doi.org/10.1117/1.JMI.10.6.061403 |
ISSN: | 2329-4302 |
Appears in Collections: | Articles publicats en revistes (Matemàtiques i Informàtica) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
841885.pdf | 4.47 MB | Adobe PDF | View/Open |
This item is licensed under a
Creative Commons License