El Dipòsit Digital ha actualitzat el programari. Qualsevol incidència que trobeu si us plau contacteu amb dipositdigital@ub.edu.

 

KeyCARE: a framework for biomedical Keyword Extraction, term Categorization, and semantic Relation

dc.contributor.advisorBarrios, Juan Ignacio
dc.contributor.authorMarsol Torrent, Sergi
dc.date.accessioned2024-06-14T15:38:00Z
dc.date.available2024-06-14T15:38:00Z
dc.date.issued2024-06-05
dc.descriptionTreballs Finals de Grau d'Enginyeria Biomèdica. Facultat de Medicina i Ciències de la Salut. Universitat de Barcelona. Curs: 2023-2024. Tutor/Director: Juan Ignacio Barrios ; Director: Luis Gascó, Martin Krallingerca
dc.description.abstractThe medical sector generates vast amounts of unstructured data, which, if processed correctly, can significantly enhance medical processes and their outcomes. This thesis presents the development of KeyCARE, a Python library for keyword extraction, term categorization, and relations extraction that tackles this need. Utilizing mainly unsupervised and few-shot methods, KeyCARE efficiently extracts classified keywords from medical records with a recall of up to 98% and an f-score of up to 61%, with partial overlaps considered as correct. While these scores are not comparable to those of supervised Named Entity Recognition systems, they set a high standard for an unsupervised alternative in scenarios of data scarcity. Moreover, the library incorporates relation extractors that identify hierarchical relationships among biomedical keywords and with terminologies, achieving a precision and recall of 93%. This has a clear application in terminology enrichment, data generation and information extraction, particularly in specific domains and low-resource languages such as Catalan. This thesis encompasses the comprehensive development of KeyCARE, including an in-depth evaluation of the implemented models as well as basic use cases demonstrating its practical applications.ca
dc.format.extent76 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/213240
dc.language.isoengca
dc.rightscc-by-nc-nd (c) Sergi Marsol Torrent, 2024
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.sourceTreballs Finals de Grau (TFG) - Enginyeria Biomèdica
dc.subject.classificationEnginyeria biomèdica
dc.subject.classificationMaterials biomèdics
dc.subject.classificationTreballs de fi de grau
dc.subject.otherBiomedical engineering
dc.subject.otherBiomedical materials
dc.subject.otherBachelor's theses
dc.titleKeyCARE: a framework for biomedical Keyword Extraction, term Categorization, and semantic Relationca
dc.typeinfo:eu-repo/semantics/bachelorThesisca

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
TFG_ Marsol_Torrent_Sergi.pdf
Mida:
3.65 MB
Format:
Adobe Portable Document Format
Descripció: