Aprenentatge per reforç aplicat a un cas de recursos compartits

Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/186485

Title:	Aprenentatge per reforç aplicat a un cas de recursos compartits
Author:	Finol Peñalver, Arnau
Director/Tutor:	López Sánchez, Maite
Keywords:	Aprenentatge per reforç (Intel·ligència artificial) Treballs de fi de grau Processos de Markov Ciència i ètica Algorismes computacionals Reinforcement learning Bachelor's theses Markov processes Science and ethics Computer algorithms
Issue Date:	24-Jan-2022
Abstract:	[en] This thesis explores the theoretical concepts needed to generate an ethical embedding, as well as the development of prior theoretical knowledge for understanding. Ethical embedding involves generating a Markow decision process where optimal policies are ethical based on a multi-objective Markow decision process where at least one of them follows an ethical criterion. Finally, it includes the implementation of the knowledge through the adaptation of the Common Game problem proposed by the company DeepMind and its subsequent resolution through the algorithms previously seveloped in a theoretical way.
Note:	Treballs Finals de Grau de Matemàtiques, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2022, Director: Maite López Sánchez
URI:	http://hdl.handle.net/2445/186485
Appears in Collections:	Programari - Treballs de l'alumnat Treballs Finals de Grau (TFG) - Matemàtiques

Files in This Item:

File	Description	Size	Format
tfg_finol_peñalver_arnau.pdf	Memòria	631.84 kB	Adobe PDF	View/Open
CodiTreball.zip	Codi font	32.68 kB	zip	View/Open

This item is licensed under a Creative Commons License