Q-learnings in RTs game's micro-management

Palacios Garzón, Ángel Camilo

Q-learnings in RTs game's micro-management

dc.contributor.advisor	Cerquides Bueno, Jesús
dc.contributor.advisor	Preuss, Mike
dc.contributor.author	Palacios Garzón, Ángel Camilo
dc.date.accessioned	2015-10-16T08:23:19Z
dc.date.available	2015-10-16T08:23:19Z
dc.date.issued	2015-09-10
dc.description	Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2015, Director: Jesús Cerquides Bueno	ca
dc.description.abstract	The purpose of this Project is to implement the one-step Q-Learning algorithm and a similar version using linear function approximation in a combat scenario in the Real-Time Strategy game Starcraft: BroodwarTM. First, there is a brief description of Real-Time Strategy games, and particularly about Starcraft, and some of the work done in the field of Reinforcement Learning. After the introduction and previous work are covered, a description of the Reinforcement Learning problem in Real-Time Strategy games is shown. Then, the development of the Reinforcement Learning agents using Q-Learning and Approximate Q-Learning is explained. It is divided into three phases: the first phase consists of defining the task that the agents must solve as a Markov Decision Process and implementing the Reinforcement Learning agents. The second phase is the training period: the agents have to learn how to destroy the rival units and avoid being destroyed in a set of training maps. This will be done through exploration because the agents have no prior knowledge of the outcome of the available actions. The third and last phase is testing the agents’ knowledge acquired in the training period in a different set of maps, observing the results and finally comparing which agent has performed better. The expected behavior is that both Q-Learning agents will learn how to kite (attack and flee) in any combat scenario. Ultimately, this behavior could become the micro-management portion of a new Bot or could be added to an existing bot.	ca
dc.format.extent	31 p.
dc.format.mimetype	application/pdf
dc.identifier.uri	https://hdl.handle.net/2445/67303
dc.language.iso	eng	ca
dc.rights	memòria: cc-by-nc-sa (c) Ángel Camilo Palacios Garzón, 2015
dc.rights	codi: GPL (c) Ángel Camilo Palacios Garzón, 2015
dc.rights.accessRights	info:eu-repo/semantics/openAccess	ca
dc.rights.uri	http://creativecommons.org/licenses/by-sa/3.0/es
dc.rights.uri	http://www.gnu.org/licenses/gpl-3.0.ca.html
dc.source	Treballs Finals de Grau (TFG) - Enginyeria Informàtica
dc.subject.classification	Aprenentatge automàtic	cat
dc.subject.classification	Aprenentatge per reforç	cat
dc.subject.classification	Programari	cat
dc.subject.classification	Treballs de fi de grau	cat
dc.subject.classification	Disseny de videojocs	ca
dc.subject.classification	Algorismes computacionals	ca
dc.subject.classification	Agents intel·ligents (Programes d'ordinador)	ca
dc.subject.other	Machine learning	eng
dc.subject.other	Reinforcement learning	eng
dc.subject.other	Computer software	eng
dc.subject.other	Bachelor's theses	eng
dc.subject.other	Video games design	eng
dc.subject.other	Computer algorithms	eng
dc.subject.other	Intelligent agents (Computer software)	eng
dc.title	Q-learnings in RTs game's micro-management	ca
dc.type	info:eu-repo/semantics/bachelorThesis	ca

Fitxers

Paquet original

Mostrant 1 - 2 de 2

Nom:: codi_font.zip
Mida:: 17.7 MB
Format:: ZIP file
Descripció:: Codi font

Descarregar

Nom:: memoria.pdf
Mida:: 2.31 MB
Format:: Adobe Portable Document Format
Descripció:: Memòria

Descarregar

Col·leccions

Treballs Finals de Grau (TFG) - Enginyeria Informàtica
Programari - Treballs de l'alumnat