Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/213240
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorBarrios, Juan Ignacio-
dc.contributor.authorMarsol Torrent, Sergi-
dc.date.accessioned2024-06-14T15:38:00Z-
dc.date.available2024-06-14T15:38:00Z-
dc.date.issued2024-06-05-
dc.identifier.urihttp://hdl.handle.net/2445/213240-
dc.descriptionTreballs Finals de Grau d'Enginyeria Biomèdica. Facultat de Medicina i Ciències de la Salut. Universitat de Barcelona. Curs: 2023-2024. Tutor/Director: Juan Ignacio Barrios ; Director: Luis Gascó, Martin Krallingerca
dc.description.abstractThe medical sector generates vast amounts of unstructured data, which, if processed correctly, can significantly enhance medical processes and their outcomes. This thesis presents the development of KeyCARE, a Python library for keyword extraction, term categorization, and relations extraction that tackles this need. Utilizing mainly unsupervised and few-shot methods, KeyCARE efficiently extracts classified keywords from medical records with a recall of up to 98% and an f-score of up to 61%, with partial overlaps considered as correct. While these scores are not comparable to those of supervised Named Entity Recognition systems, they set a high standard for an unsupervised alternative in scenarios of data scarcity. Moreover, the library incorporates relation extractors that identify hierarchical relationships among biomedical keywords and with terminologies, achieving a precision and recall of 93%. This has a clear application in terminology enrichment, data generation and information extraction, particularly in specific domains and low-resource languages such as Catalan. This thesis encompasses the comprehensive development of KeyCARE, including an in-depth evaluation of the implemented models as well as basic use cases demonstrating its practical applications.ca
dc.format.extent76 p.-
dc.format.mimetypeapplication/pdf-
dc.language.isoengca
dc.rightscc-by-nc-nd (c) Sergi Marsol Torrent, 2024-
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.sourceTreballs Finals de Grau (TFG) - Enginyeria Biomèdica-
dc.subject.classificationEnginyeria biomèdica-
dc.subject.classificationMaterials biomèdics-
dc.subject.classificationTreballs de fi de grau-
dc.subject.otherBiomedical engineering-
dc.subject.otherBiomedical materials-
dc.subject.otherBachelor's theses-
dc.titleKeyCARE: a framework for biomedical Keyword Extraction, term Categorization, and semantic Relationca
dc.typeinfo:eu-repo/semantics/bachelorThesisca
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
Appears in Collections:Treballs Finals de Grau (TFG) - Enginyeria Biomèdica

Files in This Item:
File Description SizeFormat 
TFG_ Marsol_Torrent_Sergi.pdf3.74 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons