Comportamientos visuales sencillos en robots móviles Ranger mediante aprendizaje por refuerzo

García Cerqueiras, Aarón

dc.contributor.advisor	Regueiro, Carlos V
dc.contributor.advisor	Pardo, Xosé Manuel
dc.contributor.author	García Cerqueiras, Aarón
dc.contributor.other	Universidade da Coruña. Facultade de Informática	es_ES
dc.date.accessioned	2022-10-26T15:18:37Z
dc.date.available	2022-10-26T15:18:37Z
dc.date.issued	2022
dc.identifier.uri	http://hdl.handle.net/2183/31884
dc.description.abstract	[Resumen] En este trabajo se desarrolla un sistema que permite generar, mediante aprendizaje por refuerzo, comportamientos basados en las imágenes capturadas por una cámara en un robot capaz de moverse por sí mismo. A parte de un robot real (Formado por una Raspberry Pi 3 y el robot educativo Makeblock Ranger) se usó el simulador de robótica Webots, que facilitó el desarrollo gracias a que presenta un entorno controlado en el que se puede automatizar el aprendizaje. Los robots reciben estímulos mediante sensores, de forma automática, o manualmente, mediante la pulsación de un botón por parte de un operador humano, para así obtener un refuerzo con el que aprender el comportamiento deseado. De esta forma, y centrándonos en la tarea de seguir una línea a través de un circuito cerrado, desarrollamos dos controladores: El primero se sirve del algoritmo de Q-Learning para aprender el comportamiento, y se ayuda de técnicas de visión por computador para detectar la línea y parametrizarla. El segundo controlador se sirve de Deep Q-Learning para, tomando la imagen capturada por la cámara, identificar por si solo los elementos importantes y actuar en consecuencia. Ambos controladores se han desarrollado con una estructura modular, de forma que se puedan editar esos módulos para adaptarlos a otras tareas y otros entornos de ejecución.	es_ES
dc.description.abstract	[Abstract] In this project, a system is developed that allows generating behaviors through reinforcement learning, based on the images captured by a camera in a robot capable of moving by itself. Apart from a real robot (consisting of a Raspberry Pi 3 and the Makeblock Ranger educational robot) the Webots robotics simulator was used, which facilitated development thanks to a controlled environment where learning could also be automated. The robots receive input either automatically, through sensors, or manually, by a human operator pressing a button, all in order to obtain a reinforcement with which to learn the desired behavior. In this way, focusing on the task of following a line through a closed circuit, we developed two controllers: The first uses the Q-Learning algorithm to learn the behavior, and uses computer vision techniques to detect the line and parameterize it. The second controller uses Deep Q-learning and, by taking the image captured by the camera, identifies the important elements on its own and acts accordingly. Both drivers have been developed with a modular structure, so that those modules can be edited to suit other tasks and runtime environments.	es_ES
dc.language.iso	spa	es_ES
dc.rights	Atribución-NoComercial-CompartirIgual 3.0 España	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/es/	*
dc.subject	Aprendizaje máquina	es_ES
dc.subject	Aprendizaje por refuerzo	es_ES
dc.subject	Q-Learning	es_ES
dc.subject	Deep Q-Learning	es_ES
dc.subject	Comportamiento visual	es_ES
dc.subject	Robótica móvil	es_ES
dc.subject	Visión por computador	es_ES
dc.subject	Raspberry Pi	es_ES
dc.subject	Machine learning	es_ES
dc.subject	Reinforcement learning	es_ES
dc.subject	Visual behaviour	es_ES
dc.subject	Mobile robotics	es_ES
dc.subject	Computer vision	es_ES
dc.title	Comportamientos visuales sencillos en robots móviles Ranger mediante aprendizaje por refuerzo	es_ES
dc.type	info:eu-repo/semantics/bachelorThesis	es_ES
dc.rights.access	info:eu-repo/semantics/openAccess	es_ES
dc.description.traballos	Traballo fin de grao (UDC.FIC). Enxeñaría Informática. Curso 2021/2022	es_ES

Ficheiros no ítem

Nome:: license_rdf
Tamaño:: 1.5Kb
Formato:: application/rdf+xml

Ver/abrir

Nome:: GarciaCerqueiras_Aaron_TFG_2022.pdf
Tamaño:: 13.48Mb
Formato:: PDF

Ver/abrir

Este ítem aparece na(s) seguinte(s) colección(s)

Enxeñaría informática, Grao en [447]

Mostrar o rexistro simple do ítem