Machine learning copies as a means for black box model evaluation

Rovira Esteva, Muriel

Machine learning copies as a means for black box model evaluation

dc.contributor.advisor	Zeber, David
dc.contributor.advisor	Pujol Vila, Oriol
dc.contributor.author	Rovira Esteva, Muriel
dc.date.accessioned	2022-05-25T06:45:14Z
dc.date.available	2022-05-25T06:45:14Z
dc.date.issued	2021-09-02
dc.description	Treballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Curs: 2020-2021. Tutor: David Zeber i Oriol Pujol Vila	ca
dc.description.abstract	[en] The use of propietary black-box machine learning models and APIs in the form of Machine Learning as a Service, makes it very difficult to control and mitigate their potential harmful effects (such as lack of transparency, privacy safeguards, robustness, reusability or fairness). The state-of-the-art technique of Machine Learning Classifier Copying allows us to build a new model that replicates the decision behaviour of an existing one without the need of knowing its architecture nor having access to the original training data. PRESC (Performance and Robustness Evaluation for Statistical Classifiers) is an existing free software tool for the evaluation of machine learning classifiers, which is maintained by Mozilla’s Data Science team. It aims to provide a toolkit to analyze model performance beyond the standard accuracy-based methods and into areas which tend to be underexplored in data science practice. Among the multiple applications of Machine Learning Classifier Copying, a systematic construction and examination of model copies has the potential to be an universally accessible and inexpensive approach to study and evaluate a rich variety of original models, and to help understand its behavior. In this work, an implementation of Machine Learning Classifier Copying has been contributed to the PRESC project, so that this tool becomes readily accessible to researchers and practitioners, and its applicability and performance in a synthetic problem has been explored to understand the copying process. The solution provides a model agnostic sampling strategy and an automated copy process for a number of fundamentally different hypothesis spaces, so that the set of achievable copy-model-fidelity measures can be used as a diagnostic measure of the original model characteristics.	ca
dc.format.extent	48 p.
dc.format.mimetype	application/pdf
dc.identifier.uri	https://hdl.handle.net/2445/186003
dc.language.iso	eng	ca
dc.rights	cc-by-nc-nd (c) Muriel Rovira Esteva, 2021
dc.rights	codi: Mozilla Public License 2.0 (c) Muriel Rovira Esteva, 2021
dc.rights.accessRights	info:eu-repo/semantics/openAccess	ca
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/	*
dc.rights.uri	https://www.mozilla.org/en-US/MPL/2.0/	*
dc.source	Màster Oficial - Fonaments de la Ciència de Dades
dc.subject.classification	Aprenentatge automàtic
dc.subject.classification	Sistemes classificadors (Intel·ligència artificial)
dc.subject.classification	Treballs de fi de màster
dc.subject.other	Machine learning
dc.subject.other	Learning classifier systems
dc.subject.other	Master's theses
dc.title	Machine learning copies as a means for black box model evaluation	ca
dc.type	info:eu-repo/semantics/masterThesis	ca

Fitxers

Paquet original

Mostrant 1 - 2 de 2

Nom:: tfg_rovira_esteva_muriel.pdf
Mida:: 5.35 MB
Format:: Adobe Portable Document Format
Descripció:: Memòria

Descarregar

Nom:: PRESC-master_thesis.zip
Mida:: 32.08 MB
Format:: ZIP file
Descripció:: Codi font

Descarregar

Col·leccions

Màster Oficial - Fonaments de la Ciència de Dades
Programari - Treballs de l'alumnat