Use of Machine Learning and SNOMED CT Encoded Health Problems to Predict Hospital Discharge Diagnoses

dc.contributor.advisorJ. C. Barrios Montenegro
dc.contributor.authorChen, Cindy
dc.date.accessioned2025-06-27T16:27:37Z
dc.date.available2025-06-27T16:27:37Z
dc.date.issued2025-06-11
dc.descriptionTreballs Finals de Grau d'Enginyeria Biomèdica. Facultat de Medicina i Ciències de la Salut. Universitat de Barcelona. Curs: 2024-2025. Tutor: Barrios Montenegro, J. C. ; Director: Santiago Fridca
dc.description.abstractThe accurate classification of discharge diagnoses is a critical step in clinical decision-making, as it has direct effect on patient care, hospital management, and administrative tasks. Traditionally, diagnostic coding has been a manual and time-consuming process, typically done after a patient is discharged, which could lead to delays for subsequent processes such as billing, reporting, and care optimization. Recently, the Hospital Clínic de Barcelona has integrated a structured list of health problems coded in SNOMED CT into the Electronic Health Record (EHR) from the beginning of the patient’s hospitalization. This development has enabled the reuse of structured clinical data throughout the care process and has opened the door for predictive tools using Machine Learning (ML). The goal of this research is to determine whether there’s a significant relationship between reported health problems and the final ICD-10 discharge diagnoses. To explore this, data obtained from the Hospital Clínic de Barcelona was analysed, incorporating information from various clinical sources, such as demographics, laboratory results, prescriptions, and admissions records. Feature engineering was also carried out and methods based on decision trees, along with ANOVA tests, were used to identify the most relevant input variables. Subsequently, several supervised ML models, including Decision Trees (DTs), Random Forest (RF), and XGBoost were trained and evaluated. The best performing model, a Decision Tree classifier, achieved an accuracy of 69.8%, with a recall and F1-score of 0.68, and an AUC of 0.83. While no single variable served as a dominant predictor, the results show that health problems coded in SNOMED CT, combined with other clinical andca
dc.format.extent80 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/221864
dc.language.isoengca
dc.rightscc-by-nc-nd (c) autor, 202X
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.sourceTreballs Finals de Grau (TFG) - Enginyeria Biomèdica
dc.subject.classificationEnginyeria biomèdica
dc.subject.classificationAprenentatge automàtic
dc.subject.classificationTreballs de fi de grau
dc.subject.otherBiomedical engineering
dc.subject.otherMachine learning
dc.subject.otherBachelor's theses
dc.titleUse of Machine Learning and SNOMED CT Encoded Health Problems to Predict Hospital Discharge Diagnosesca
dc.typeinfo:eu-repo/semantics/bachelorThesisca

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
TFG_GEB_Chen_Cindy.pdf
Mida:
7.58 MB
Format:
Adobe Portable Document Format
Descripció: