Distance-based copying of machine learning classifiers
| dc.contributor.advisor | Pujol Vila, Oriol | |
| dc.contributor.author | Jiménez Lumbreras, Rubén | |
| dc.date.accessioned | 2026-04-09T13:32:32Z | |
| dc.date.available | 2026-04-09T13:32:32Z | |
| dc.date.issued | 2026-01-10 | |
| dc.description | Treballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Any: 2026. Tutor: Oriol Pujol Vila | |
| dc.description.abstract | Copying machine learning black box classifiers is a key framework that allows practitioners to upgrade their old models, enriching them with new properties, changing their architectures or adapting them to comply with the current AI legislations. Thanks to the copying techniques and assumptions, these improvements can be done even in settings where retraining the original system from scratch is not possible, due to resource, protocol or availability constraints. In this work, we propose the use of signed distances to the decision boundary as a replacement of the black box hard labels used to build the copies, and introduce two different algorithms to compute these distances. In addition, we observe that distance-based copying could behave as a model-agnostic regularization technique and develop a flexible framework to reduce the generalization error of the copies. Then, we validate these proposals through a series of experiments on synthetic datasets and real problems. Results show that distance-based copying is successful across multiple relevant settings and evaluation metrics. Furthermore, results also validate the quality of the predicted distances and their potential as uncertainty measures. | |
| dc.format.extent | 54 p. | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.uri | https://hdl.handle.net/2445/228768 | |
| dc.language.iso | eng | |
| dc.rights | cc-by-nc-nd (c) Rubén Jiménez Lumbreras, 2026 | |
| dc.rights | codi: GPL (c) nom, 2026 | |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | |
| dc.rights.uri | http://www.gnu.org/licenses/gpl-3.0.ca.html | |
| dc.source | Màster Oficial - Fonaments de la Ciència de Dades | |
| dc.subject.classification | Aprenentatge per transferència | |
| dc.subject.classification | Sistemes classificadors (Intel·ligència artificial) | |
| dc.subject.classification | Aprenentatge automàtic | |
| dc.subject.classification | Aprenentatge profund | |
| dc.subject.classification | Rubén Jiménez Lumbreras | |
| dc.subject.classification | Treballs de fi de màster | |
| dc.subject.other | Transfer learning (Machine learning) | |
| dc.subject.other | Learning classifier systems | |
| dc.subject.other | Machine learning | |
| dc.subject.other | Deep learning (Machine learning) | |
| dc.subject.other | Master's thesis | |
| dc.title | Distance-based copying of machine learning classifiers | |
| dc.type | info:eu-repo/semantics/masterThesis |
Fitxers
Paquet original
1 - 1 de 1
Carregant...
- Nom:
- TFM_Jimenez_Lumbreras_Ruben.pdf
- Mida:
- 49.34 MB
- Format:
- Adobe Portable Document Format