Application of ethical reinforcement learning to a resource gathering scenario

dc.contributor.advisorLópez Sánchez, Maite
dc.contributor.authorHuerta Climent, Martí
dc.date.accessioned2019-11-25T08:54:25Z
dc.date.available2019-11-25T08:54:25Z
dc.date.issued2019-05-27
dc.descriptionTreballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2019, Director: Maite López Sánchezca
dc.description.abstract[en] In this project we present an application of a formal framework for defining moral values to a multi-agent system simulation of a society facing a social dilemma. First, a description of the framework and the motivation and key concepts for the understanding of this project are explained. Then we describe the case study: A resource gathering scenario, where agents have to face a dilemma between being benevolent and helping others or not, which has an obvious impact in the survival rate of their society. We use a Python 3 framework for agent-based modelling, MESA, and describe its structure along with which classes will be used in this project. We will also describe the class design for the implementation of the project as well as any other design decision. Our goal is to successfully add a moral dimension to learning agents by modifying its learning process, through the usage of norms, in order to instill our desired moral values. The results are discussed and compared to what we expect to be the optimal performance of a society facing said dilemma. We are interested in measuring its cooperation, which impacts directly in its survival rate, with and without the application of moral values. An improvement is expected to be seen in those measures when moral values are applied. Last, further work and possible projects derived from this one are also discussed as well as possible improvements to this project.ca
dc.format.extent38 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/145297
dc.language.isoengca
dc.rightsmemòria: cc-nc-nd (c) Martí Huerta Climent, 2019
dc.rightscodi: GPL (c) Martí Huerta Climent, 2019
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.rights.urihttp://www.gnu.org/licenses/gpl-3.0.ca.html*
dc.sourceTreballs Finals de Grau (TFG) - Enginyeria Informàtica
dc.subject.classificationIntel·ligència artificialca
dc.subject.classificationAprenentatge automàticca
dc.subject.classificationProgramarica
dc.subject.classificationTreballs de fi de grauca
dc.subject.classificationAlgorismes computacionalsca
dc.subject.classificationAspectes moralsca
dc.subject.otherArtificial intelligenceen
dc.subject.otherMachine learningen
dc.subject.otherComputer softwareen
dc.subject.otherComputer algorithmsen
dc.subject.otherMoral aspectsen
dc.subject.otherBachelor's thesesen
dc.titleApplication of ethical reinforcement learning to a resource gathering scenarioca
dc.typeinfo:eu-repo/semantics/bachelorThesisca

Fitxers

Paquet original

Mostrant 1 - 2 de 2
Carregant...
Miniatura
Nom:
codi.zip
Mida:
157.62 MB
Format:
ZIP file
Descripció:
Codi font
Carregant...
Miniatura
Nom:
memoria.pdf
Mida:
1.48 MB
Format:
Adobe Portable Document Format
Descripció:
Memòria