Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/186485
Title: Aprenentatge per reforç aplicat a un cas de recursos compartits
Author: Finol Peñalver, Arnau
Director/Tutor: López Sánchez, Maite
Keywords: Aprenentatge per reforç (Intel·ligència artificial)
Treballs de fi de grau
Processos de Markov
Ciència i ètica
Algorismes computacionals
Reinforcement learning
Bachelor's theses
Markov processes
Science and ethics
Computer algorithms
Issue Date: 24-Jan-2022
Abstract: [en] This thesis explores the theoretical concepts needed to generate an ethical embedding, as well as the development of prior theoretical knowledge for understanding. Ethical embedding involves generating a Markow decision process where optimal policies are ethical based on a multi-objective Markow decision process where at least one of them follows an ethical criterion. Finally, it includes the implementation of the knowledge through the adaptation of the Common Game problem proposed by the company DeepMind and its subsequent resolution through the algorithms previously seveloped in a theoretical way.
Note: Treballs Finals de Grau de Matemàtiques, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2022, Director: Maite López Sánchez
URI: http://hdl.handle.net/2445/186485
Appears in Collections:Programari - Treballs de l'alumnat
Treballs Finals de Grau (TFG) - Matemàtiques

Files in This Item:
File Description SizeFormat 
tfg_finol_peñalver_arnau.pdfMemòria631.84 kBAdobe PDFView/Open
CodiTreball.zipCodi font32.68 kBzipView/Open


This item is licensed under a Creative Commons License Creative Commons