Towards sample-efficient policy learning with DAC-ML

dc.contributor.authorFreire, Ismael T.
dc.contributor.authorAmil, Adrián F.
dc.contributor.authorVouloutsi, Vasiliki
dc.contributor.authorVerschure, Paul F. M. J.
dc.date.accessioned2022-02-04T17:03:03Z
dc.date.available2022-02-04T17:03:03Z
dc.date.issued2021-07-01
dc.date.updated2022-02-03T07:10:37Z
dc.description.abstractThe sample-inefficiency problem in Artificial Intelligence refers to the inability of current Deep Reinforcement Learning models to optimize action policies within a small number of episodes. Recent studies have tried to overcome this limitation by adding memory systems and architectural biases to improve learning speed, such as in Episodic Reinforcement Learning. However, despite achieving incremental improvements, their performance is still not comparable to how humans learn behavioral policies. In this paper, we capitalize on the design principles of the Distributed Adaptive Control (DAC) theory of mind and brain to build a novel cognitive architecture (DAC-ML) that, by incorporating a hippocampus-inspired sequential memory system, can rapidly converge to effective action policies that maximize reward acquisition in a challenging foraging task.
dc.format.extent7 p.
dc.format.mimetypeapplication/pdf
dc.identifier.idimarina6524917
dc.identifier.issnFreire IT;Amil AF;Vouloutsi V;Verschure PFMJ. Towards sample-efficient policy learning with DAC-ML. Procedia Computer Science, 2021, 190, 256-262
dc.identifier.issn1877-0509
dc.identifier.urihttps://hdl.handle.net/2445/182959
dc.language.isoeng
dc.publisherElsevier B.V.
dc.relation.isformatofReproducció del document publicat a: https://doi.org/10.1016/j.procs.2021.06.035
dc.relation.ispartofProcedia Computer Science, 2021, vol 190, p. 256-262
dc.relation.urihttps://doi.org/10.1016/j.procs.2021.06.035
dc.rightscc by-nc-nd (c) Freire, Ismael T. et al, 2021
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.sourceArticles publicats en revistes (Institut de Bioenginyeria de Catalunya (IBEC))
dc.subject.classificationAprenentatge automàtic
dc.subject.classificationPresa de decisions
dc.subject.otherMachine learning
dc.subject.otherDecision making
dc.titleTowards sample-efficient policy learning with DAC-ML
dc.typeinfo:eu-repo/semantics/other
dc.typeinfo:eu-repo/semantics/publishedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
1-s2.0-S1877050921012795-main.pdf
Mida:
958.17 KB
Format:
Adobe Portable Document Format