Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/186821
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorCorcuera Valverde, José Manuel-
dc.contributor.authorRosell Esau, Keila Ruth-
dc.date.accessioned2022-06-20T08:36:26Z-
dc.date.available2022-06-20T08:36:26Z-
dc.date.issued2022-01-23-
dc.identifier.urihttp://hdl.handle.net/2445/186821-
dc.descriptionTreballs Finals de Grau de Matemàtiques, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2022, Director: José Manuel Corcuera Valverdeca
dc.description.abstract[en] In this thesis we study the optimization method called Dynamic Programming and how it is implemented to solve sequential problems, that is, those problems in which the solution is to make a series of decisions in many different stages in order to maximize a reward, according to a purpose. Different approaches are analyzed, depending on whether all the data is known for the problem, in the deterministic case, or if the data is determined by a probability distribution, in the stochastic case. A distinction will also be made for cases where time evolves in a discrete way or if it does so continuously. For each case we will develop the Hamilton-Jacobi-Bellman equation, which is a central element of the dynamic programming algorithms and is useful in finding and comparing different strategies for the decision-making agent. Finally, dynamic programming is applied to reinforcement learning, which is an area of artificial intelligence that is focused on determining what actions a software agent must choose in a given environment, in order to find the highest reward.ca
dc.format.extent48 p.-
dc.format.mimetypeapplication/pdf-
dc.language.isospaca
dc.rightscc-by-nc-nd (c) Keila Ruth Rosell Esau, 2022-
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.sourceTreballs Finals de Grau (TFG) - Matemàtiques-
dc.subject.classificationEquacions de Hamilton-Jacobica
dc.subject.classificationTreballs de fi de grau-
dc.subject.classificationProgramació dinàmicaca
dc.subject.classificationOptimització matemàticaca
dc.subject.classificationProcessos de Markovca
dc.subject.otherHamilton-Jacobi equationsen
dc.subject.otherBachelor's theses-
dc.subject.otherDynamic programmingen
dc.subject.otherMathematical optimizationen
dc.subject.otherMarkov processesen
dc.titleOptimización con programación dinámicaca
dc.typeinfo:eu-repo/semantics/bachelorThesisca
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
Appears in Collections:Treballs Finals de Grau (TFG) - Matemàtiques

Files in This Item:
File Description SizeFormat 
tfg_rosell_esau_keila_ruth.pdfMemòria628.57 kBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons