Please use this identifier to cite or link to this item:
http://hdl.handle.net/2445/186485
Title: | Aprenentatge per reforç aplicat a un cas de recursos compartits |
Author: | Finol Peñalver, Arnau |
Director/Tutor: | López Sánchez, Maite |
Keywords: | Aprenentatge per reforç (Intel·ligència artificial) Treballs de fi de grau Processos de Markov Ciència i ètica Algorismes computacionals Reinforcement learning Bachelor's theses Markov processes Science and ethics Computer algorithms |
Issue Date: | 24-Jan-2022 |
Abstract: | [en] This thesis explores the theoretical concepts needed to generate an ethical embedding, as well as the development of prior theoretical knowledge for understanding. Ethical embedding involves generating a Markow decision process where optimal policies are ethical based on a multi-objective Markow decision process where at least one of them follows an ethical criterion. Finally, it includes the implementation of the knowledge through the adaptation of the Common Game problem proposed by the company DeepMind and its subsequent resolution through the algorithms previously seveloped in a theoretical way. |
Note: | Treballs Finals de Grau de Matemàtiques, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2022, Director: Maite López Sánchez |
URI: | http://hdl.handle.net/2445/186485 |
Appears in Collections: | Programari - Treballs de l'alumnat Treballs Finals de Grau (TFG) - Matemàtiques |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
tfg_finol_peñalver_arnau.pdf | Memòria | 631.84 kB | Adobe PDF | View/Open |
CodiTreball.zip | Codi font | 32.68 kB | zip | View/Open |
This item is licensed under a Creative Commons License