Please use this identifier to cite or link to this item:
https://hdl.handle.net/2445/145297
Title: | Application of ethical reinforcement learning to a resource gathering scenario |
Author: | Huerta Climent, Martí |
Director/Tutor: | López Sánchez, Maite |
Keywords: | Intel·ligència artificial Aprenentatge automàtic Programari Treballs de fi de grau Algorismes computacionals Aspectes morals Artificial intelligence Machine learning Computer software Computer algorithms Moral aspects Bachelor's theses |
Issue Date: | 27-May-2019 |
Abstract: | [en] In this project we present an application of a formal framework for defining moral values to a multi-agent system simulation of a society facing a social dilemma. First, a description of the framework and the motivation and key concepts for the understanding of this project are explained. Then we describe the case study: A resource gathering scenario, where agents have to face a dilemma between being benevolent and helping others or not, which has an obvious impact in the survival rate of their society. We use a Python 3 framework for agent-based modelling, MESA, and describe its structure along with which classes will be used in this project. We will also describe the class design for the implementation of the project as well as any other design decision. Our goal is to successfully add a moral dimension to learning agents by modifying its learning process, through the usage of norms, in order to instill our desired moral values. The results are discussed and compared to what we expect to be the optimal performance of a society facing said dilemma. We are interested in measuring its cooperation, which impacts directly in its survival rate, with and without the application of moral values. An improvement is expected to be seen in those measures when moral values are applied. Last, further work and possible projects derived from this one are also discussed as well as possible improvements to this project. |
Note: | Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2019, Director: Maite López Sánchez |
URI: | https://hdl.handle.net/2445/145297 |
Appears in Collections: | Programari - Treballs de l'alumnat Treballs Finals de Grau (TFG) - Enginyeria Informàtica |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
codi.zip | Codi font | 161.4 MB | zip | View/Open |
memoria.pdf | Memòria | 1.51 MB | Adobe PDF | View/Open |
This item is licensed under a
Creative Commons License