Biblioteca para el apoyo al desarrollo de aproximaciones generales en el aprendizaje por refuerzo

López López, Daniel

dc.contributor.advisor	Fernández Blanco, Enrique
dc.contributor.author	López López, Daniel
dc.contributor.other	Enxeñaría informática, Grao en	es_ES
dc.date.accessioned	2020-08-28T08:26:47Z
dc.date.available	2020-08-28T08:26:47Z
dc.date.issued	2020
dc.identifier.uri	http://hdl.handle.net/2183/26164
dc.description.abstract	[Resumen] Este trabajo aborda la creación de una biblioteca para facilitar la implementación de modelos de aprendizaje por refuerzo usando un nuevo tipo de aproximación basada en la estructura de las partidas de los juegos de mesa. La biblioteca expone una serie de interfaces para su implementación por parte del desarrollador. Esas interfaces son “Juego”, “Jugador”, “Ronda”, “Tablero” y “Acción”. Así cualquier problema que encaje en esas interfaces podrá ser resuelto por un modelo entrenado de esta manera. Con el fin de validar la biblioteca, así como servir de ejemplo de uso, se ha creado un juego de estrategia que implementa las interfaces listadas anteriormente. Sobre dicho juego se realiza un conjunto de pruebas con el fin de comprobar la utilidad de la biblioteca. Las pruebas se realizan en torno a un modelo de red neuronal que se entrena primeramente contra el algoritmo de Monte Carlo Tree Search muy utilizado en videojuegos para implementar la inteligencia artificial. Tras esto, se pasa a realizar de nuevo un entrenamiento, pero esta vez contra otro modelo similar al que se está entrenando. Por último, se comprueba contra un jugador cómo de bien juega el modelo que se ha entrenado. Se recogen datos de cada una de estas fases y se analizan para valorar el comportamiento de la biblioteca.	es_ES
dc.description.abstract	[Abstract] This paper addresses the creation of a library to facilitate the implementation of reinformentlearning models using a new approach that is based on the structure of board games. The library exposes a series of interfaces for its implementation by the developer. Those interfaces are “Game”, “Player”, “Round”, “Board” and “Action”. So any problem that fits in those interfaces can be solved by a model trained this way. In order to validate the library, as well as serve as an example of use, a strategy game has been created that implements the interfaces listed above. A set of tests have been performed on this game in order to check the usefulness of the library. The tests are carried out around a neural network model that is first trained against the Monte Carlo Tree Search algorithm widely used in video games to implement artificial intelligence. After this, training is carried out again, but this time against another model similar to the one being trained. Finally, it is checked against a human player how well the model that has been trained plays. Data has been collected from each of these phases and analyzed to assess the behavior of the library.	es_ES
dc.language.iso	spa	es_ES
dc.rights	Atribución-Atribución-No Comercial-Compartir Igual 4.0 España	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/es/	*
dc.subject	Aprendizaje automático	es_ES
dc.subject	Aprendizaje por refuerzo	es_ES
dc.subject	Juego de mesa	es_ES
dc.subject	Keras	es_ES
dc.subject	Python	es_ES
dc.subject	Tensorflow	es_ES
dc.subject	Machine learning	es_ES
dc.subject	Reinforcement learning	es_ES
dc.subject	Board game	es_ES
dc.title	Biblioteca para el apoyo al desarrollo de aproximaciones generales en el aprendizaje por refuerzo	es_ES
dc.type	info:eu-repo/semantics/bachelorThesis	es_ES
dc.rights.access	info:eu-repo/semantics/openAccess	es_ES

Ficheiros no ítem

Nome:: license_rdf
Tamaño:: 1.346Kb
Formato:: application/rdf+xml

Ver/abrir

Nome:: D.López_López_2020_Biblioteca_ ...
Tamaño:: 1.509Mb
Formato:: PDF

Ver/abrir

Este ítem aparece na(s) seguinte(s) colección(s)

Enxeñaría informática, Grao en [452]

Mostrar o rexistro simple do ítem