Please use this identifier to cite or link to this item:
Title: Application of ethical reinforcement learning to a resource gathering scenario
Author: Huerta Climent, Martí
Director/Tutor: López Sánchez, Maite
Keywords: Intel·ligència artificial
Aprenentatge automàtic
Treballs de fi de grau
Algorismes computacionals
Aspectes morals
Artificial intelligence
Machine learning
Computer software
Computer algorithms
Moral aspects
Bachelor's theses
Issue Date: 27-May-2019
Abstract: [en] In this project we present an application of a formal framework for defining moral values to a multi-agent system simulation of a society facing a social dilemma. First, a description of the framework and the motivation and key concepts for the understanding of this project are explained. Then we describe the case study: A resource gathering scenario, where agents have to face a dilemma between being benevolent and helping others or not, which has an obvious impact in the survival rate of their society. We use a Python 3 framework for agent-based modelling, MESA, and describe its structure along with which classes will be used in this project. We will also describe the class design for the implementation of the project as well as any other design decision. Our goal is to successfully add a moral dimension to learning agents by modifying its learning process, through the usage of norms, in order to instill our desired moral values. The results are discussed and compared to what we expect to be the optimal performance of a society facing said dilemma. We are interested in measuring its cooperation, which impacts directly in its survival rate, with and without the application of moral values. An improvement is expected to be seen in those measures when moral values are applied. Last, further work and possible projects derived from this one are also discussed as well as possible improvements to this project.
Note: Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2019, Director: Maite López Sánchez
Appears in Collections:Treballs Finals de Grau (TFG) - Enginyeria Informàtica
Programari - Treballs de l'alumnat

Files in This Item:
File Description SizeFormat 
codi.zipCodi font161.4 MBzipView/Open
memoria.pdfMemòria1.51 MBAdobe PDFView/Open

This item is licensed under a Creative Commons License Creative Commons