A Domain Adaptation Framework for Harmonized Representation Learning in Medical Datasets

dc.contributor.advisorPujol Vila, Oriol
dc.contributor.advisorLobato Delgado, Bárbara
dc.contributor.authorVara Mira, Alejandro
dc.date.accessioned2026-04-01T14:42:05Z
dc.date.available2026-04-01T14:42:05Z
dc.date.issued2026-01-17
dc.descriptionTreballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Any: 2026. Tutor: Oriol Pujol Vila i Bárbara Lobato Delgado
dc.description.abstractThis Master’s Thesis addresses the critical challenge of clinical data fragmentation and the prohibitive costs of medical data acquisition by proposing a deep learning architecture for cross-dataset knowledge transfer. While the medical community possesses vast amounts of data, it remains largely trapped in isolated silos characterized by structural heterogeneity and measurement bias. To bridge these gaps, this research introduces a multi-branch neural framework that leverages a large-scale auxiliary dataset, MIMIC-III, to enrich the latent representations of smaller, specialized target datasets. The methodology centers on a dual-encoding strategy where a shared encoder extracts robust statistical patterns from common clinical attributes across populations, while independent private encoders preserve domain-specific niche variables. Empirical validation in the context of ICU mortality prediction demonstrates that this harmonized representation learning consistently improves Precision-Recall and AUC-ROC metrics. By employing a rigorous methodology upon sequential experiments, the study confirms that these performance gains are statistically significant and directly attributable to the enhanced feature representation, rather than artifacts of stochasticity or overfitting. Ultimately, this work provides a scalable blueprint for clinical data codification, proving that common attributes can serve as a functional bridge to maximize the utility of existing medical records in data-constrained environments.
dc.format.extent25 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/228662
dc.language.isoeng
dc.rightscc-by-nc-nd (c) Alejandro Vara Mira, 2026
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.sourceMàster Oficial - Fonaments de la Ciència de Dades
dc.subject.classificationInformàtica mèdica
dc.subject.classificationAprenentatge per transferència
dc.subject.classificationAprenentatge profund
dc.subject.classificationMedicina basada en l'evidència
dc.subject.classificationAlejandro Vara Mira
dc.subject.classificationTreballs de fi de màster
dc.subject.otherMedical informatics
dc.subject.otherTransfer learning (Machine learning)
dc.subject.otherDeep learning (Machine learning)
dc.subject.otherEvidence-based medicine
dc.subject.otherMaster's thesis
dc.titleA Domain Adaptation Framework for Harmonized Representation Learning in Medical Datasets
dc.typeinfo:eu-repo/semantics/masterThesis

Fitxers

Paquet original

Mostrant 1 - 2 de 2
Carregant...
Miniatura
Nom:
TFM_Vara_Mira_Alejandro.pdf
Mida:
1.73 MB
Format:
Adobe Portable Document Format
Carregant...
Miniatura
Nom:
TFM Alejandro Vara.zip
Mida:
20.49 MB
Format:
ZIP file