Distance-based copying of machine learning classifiers

dc.contributor.advisorPujol Vila, Oriol
dc.contributor.authorJiménez Lumbreras, Rubén
dc.date.accessioned2026-04-09T13:32:32Z
dc.date.available2026-04-09T13:32:32Z
dc.date.issued2026-01-10
dc.descriptionTreballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Any: 2026. Tutor: Oriol Pujol Vila
dc.description.abstractCopying machine learning black box classifiers is a key framework that allows practitioners to upgrade their old models, enriching them with new properties, changing their architectures or adapting them to comply with the current AI legislations. Thanks to the copying techniques and assumptions, these improvements can be done even in settings where retraining the original system from scratch is not possible, due to resource, protocol or availability constraints. In this work, we propose the use of signed distances to the decision boundary as a replacement of the black box hard labels used to build the copies, and introduce two different algorithms to compute these distances. In addition, we observe that distance-based copying could behave as a model-agnostic regularization technique and develop a flexible framework to reduce the generalization error of the copies. Then, we validate these proposals through a series of experiments on synthetic datasets and real problems. Results show that distance-based copying is successful across multiple relevant settings and evaluation metrics. Furthermore, results also validate the quality of the predicted distances and their potential as uncertainty measures.
dc.format.extent54 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/228768
dc.language.isoeng
dc.rightscc-by-nc-nd (c) Rubén Jiménez Lumbreras, 2026
dc.rightscodi: GPL (c) nom, 2026
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.rights.urihttp://www.gnu.org/licenses/gpl-3.0.ca.html
dc.sourceMàster Oficial - Fonaments de la Ciència de Dades
dc.subject.classificationAprenentatge per transferència
dc.subject.classificationSistemes classificadors (Intel·ligència artificial)
dc.subject.classificationAprenentatge automàtic
dc.subject.classificationAprenentatge profund
dc.subject.classificationRubén Jiménez Lumbreras
dc.subject.classificationTreballs de fi de màster
dc.subject.otherTransfer learning (Machine learning)
dc.subject.otherLearning classifier systems
dc.subject.otherMachine learning
dc.subject.otherDeep learning (Machine learning)
dc.subject.otherMaster's thesis
dc.titleDistance-based copying of machine learning classifiers
dc.typeinfo:eu-repo/semantics/masterThesis

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
TFM_Jimenez_Lumbreras_Ruben.pdf
Mida:
49.34 MB
Format:
Adobe Portable Document Format