Value-Based Reinforcement Learning algorithms in Sparse Distributed Memories to solve the Mountain-Car Problem

Francí i Rodon, Arnau

Fitxers

TFG-Franci-Rodon-Arnau.pdf (536.92 KB)

Tipus de document

Treball de fi de grau

Data de publicació

2015-06

Llicència de publicació

Si us plau utilitzeu sempre aquest identificador per citar o enllaçar aquest document: https://hdl.handle.net/2445/67390

Value-Based Reinforcement Learning algorithms in Sparse Distributed Memories to solve the Mountain-Car Problem

Autors

Francí i Rodon, Arnau

Director/Tutor

Pellegrino, Paolo

Resum

In the framework of digital electronics optimization of the memory resources used is a crucial issue. Therefore many Control algorithms are studied in order to improve the trade-off between computational power and memory requirements. In this work we explore some possibilities to improve current state-of-the-art Temporal-Difference (TD) Reinforcement Learning (RL) strategies. We made use of a type of local function approximation structures known as Sparse Distributed Memories (SDMs). The interest of this investigation underlies on the belief that SDMs architectures can help to avoid the exponential increase of memory sizes due to a linear increase in the state’s variables. Because RL doesn´t rely in prior information of the environment this is a frequent problem for these algorithms, as a lot of different features can appear to play a role when in fact only few of them are really relevant for the agent; a sampling of the states along with a method to generalize unseen states’ values becomes a must.The main achievement has been a method capable to distribute the memory locations which ensured that regions in the state space more needed had a more intense coverage, with the purpose to improve approximations’ resolution while keeping low memory requirements and high-dimensional scalability. We gave attention also to another issues as the reduction in the number of parameters.

Descripció

Treballs Finals de Grau de Física, Facultat de Física, Universitat de Barcelona, Any: 2015, Tutor: Paolo Pellegrino

Matèries

Intel·ligència artificial, Simulació per ordinador, Treballs de fi de grau

Matèries (anglès)

Artificial intelligence, Computer simulation, Bachelor's theses

Col·leccions

Treballs Finals de Grau (TFG) - Física

Pàgina completa de l'ítem

Citació

FRANCÍ I RODON, Arnau. Value-Based Reinforcement Learning algorithms in Sparse Distributed Memories to solve the Mountain-Car Problem. [consulted: 24 of July of 2026]. Available at: https://hdl.handle.net/2445/67390

Estadístiques

Exportar metadades

JSON - METS

Fitxers

Tipus de document

Data de publicació

Llicència de publicació

Value-Based Reinforcement Learning algorithms in Sparse Distributed Memories to solve the Mountain-Car Problem

Títol de la revista

Autors

Director/Tutor

ISSN de la revista

Títol del volum

Recurs relacionat

Resum

Descripció

Matèries

Matèries (anglès)

Citació

Col·leccions

Citació

Exportar metadades

Fitxers

Tipus de document

Data de publicació

Llicència de publicació

Value-Based Reinforcement Learning algorithms in Sparse Distributed Memories to solve the Mountain-Car Problem

Títol de la revista

Autors

Director/Tutor

ISSN de la revista

Títol del volum

Recurs relacionat

Resum

Descripció

Matèries

Matèries (anglès)

Citació

Col·leccions

Citació

Exportar metadades

Compartir registre