Value-Based Reinforcement Learning algorithms in Sparse Distributed Memories to solve the Mountain-Car Problem

Francí i Rodon, Arnau

Please use this identifier to cite or link to this item: https://hdl.handle.net/2445/67390

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Pellegrino, Paolo	-
dc.contributor.author	Francí i Rodon, Arnau	-
dc.date.accessioned	2015-10-21T11:51:14Z	-
dc.date.available	2015-10-21T11:51:14Z	-
dc.date.issued	2015-06	-
dc.identifier.uri	https://hdl.handle.net/2445/67390	-
dc.description	Treballs Finals de Grau de Física, Facultat de Física, Universitat de Barcelona, Any: 2015, Tutor: Paolo Pellegrino	ca
dc.description.abstract	In the framework of digital electronics optimization of the memory resources used is a crucial issue. Therefore many Control algorithms are studied in order to improve the trade-off between computational power and memory requirements. In this work we explore some possibilities to improve current state-of-the-art Temporal-Difference (TD) Reinforcement Learning (RL) strategies. We made use of a type of local function approximation structures known as Sparse Distributed Memories (SDMs). The interest of this investigation underlies on the belief that SDMs architectures can help to avoid the exponential increase of memory sizes due to a linear increase in the state’s variables. Because RL doesn´t rely in prior information of the environment this is a frequent problem for these algorithms, as a lot of different features can appear to play a role when in fact only few of them are really relevant for the agent; a sampling of the states along with a method to generalize unseen states’ values becomes a must.The main achievement has been a method capable to distribute the memory locations which ensured that regions in the state space more needed had a more intense coverage, with the purpose to improve approximations’ resolution while keeping low memory requirements and high-dimensional scalability. We gave attention also to another issues as the reduction in the number of parameters.	ca
dc.format.extent	5 p.	-
dc.format.mimetype	application/pdf	-
dc.language.iso	eng	ca
dc.rights	cc-by-nc-nd (c) Francí i Rodon, 2015	-
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/	-
dc.source	Treballs Finals de Grau (TFG) - Física	-
dc.subject.classification	Intel·ligència artificial	cat
dc.subject.classification	Simulació per ordinador	cat
dc.subject.classification	Treballs de fi de grau	cat
dc.subject.other	Artificial intelligence	eng
dc.subject.other	Computer simulation	eng
dc.subject.other	Bachelor's theses	eng
dc.title	Value-Based Reinforcement Learning algorithms in Sparse Distributed Memories to solve the Mountain-Car Problem	eng
dc.type	info:eu-repo/semantics/bachelorThesis	ca
dc.rights.accessRights	info:eu-repo/semantics/openAccess	ca
Appears in Collections:	Treballs Finals de Grau (TFG) - Física

Files in This Item:

File	Description	Size	Format
TFG-Franci-Rodon-Arnau.pdf		536.92 kB	Adobe PDF	View/Open

Show simple item record

This item is licensed under a Creative Commons License