Synthetic training data generation from a single image for enhanced breast cancer diagnosis

dc.contributor.advisorDíaz, Oliver
dc.contributor.advisorOsuala, Richard
dc.contributor.authorBuetas Arcas, Marta
dc.date.accessioned2024-06-13T09:43:26Z
dc.date.available2024-06-13T09:43:26Z
dc.date.issued2023-06-29
dc.descriptionTreballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Curs: 2022-2023. Tutor: Oliver Díaz i Richard Osualaca
dc.description.abstract[en] According to the World Health Organisation (WHO), breast cancer is one of the cancer types with a high prevalence worldwide. Deep-learning based computeraided detection systems have shown promising potential in improving the curability and reducing mortality rates through early detection in mammography screening. Artificial Intelligence (AI) has become a popular tool in medicine, aiming to reduce costs and assist radiologists in decision-making processes. However, AI in cancer imaging presents significant challenges, including data access and privacy issues, as well as a scarcity of expert-annotated medical imaging. Motivated by these factors, this project aims to enhance the robustness and generalisability of breast cancer classification tools. The study focuses on obtaining a pre-biopsy result of suspicious areas in mammograms, providing a comprehensive assessment of lesion nature. It was observed that the classifier’s performance for the malignant class was inferior to that of the other classes, and the tightness of the annotation mask around the lesion significantly influenced the classifier’s performance. To improve the performance for malignant lesions, the study investigates data augmentation based in single image Generative Adversarial Network (SinGAN) to balance this underrepresented class. To the best of our knowledge, this project represents a novel investigation into the application of single-image generative models for breast cancer, addressing the challenge of expert annotation scarcity. Promising results were observed through the use of SinGAN-based data augmentation. The classification model, trained with SinGAN-augmented training data, demonstrated a higher area under the receiver operating characteristic (AUROC) for the malignant class (0.718 ± 0.044), compared to the same model without augmented data (0.677 ± 0.076). Furthermore, it was also identified an unexpected trend during the experiments. It was observed that using more SinGANs for data augmentation did not always result in a higher enhancement of performance. This project opens up new research possibilities through collaboration with healthcare experts. Its ultimate goal is to analyse and validate a mitigation strategy for improving robustness and, as such, trustworthiness of AI-based applications for adoption in the clinical workflow.ca
dc.format.extent53 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/212960
dc.language.isoengca
dc.rightscc-by-nc-nd (c) Marta Buetas Arcas, 2023
dc.rightscodi: MIT (c) Marta Buetas Arcas, 2023
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.rights.urihttps://opensource.org/license/MIT*
dc.sourceMàster Oficial - Fonaments de la Ciència de Dades
dc.subject.classificationCàncer de mama
dc.subject.classificationAprenentatge automàtic
dc.subject.classificationDiagnòstic per la imatge
dc.subject.classificationTreballs de fi de màster
dc.subject.classificationSistemes classificadors (Intel·ligència artificial)ca
dc.subject.otherBreast cancer
dc.subject.otherMachine learning
dc.subject.otherDiagnostic imaging
dc.subject.otherMaster's thesis
dc.subject.otherLearning classifier systemsen
dc.titleSynthetic training data generation from a single image for enhanced breast cancer diagnosisca
dc.typeinfo:eu-repo/semantics/masterThesisca

Fitxers

Paquet original

Mostrant 1 - 2 de 2
Carregant...
Miniatura
Nom:
tfm_buetas_arcas_marta.pdf
Mida:
5.7 MB
Format:
Adobe Portable Document Format
Descripció:
Memòria
Carregant...
Miniatura
Nom:
IWBIconference_EnhancingBreastCancerDiagnosis-main.zip
Mida:
6.47 MB
Format:
ZIP file
Descripció:
Codi font