Use this link to cite:
http://hdl.handle.net/2183/31885 Estudio, análisis y desarrollo de técnicas de “interactive machine learning” y su comparación con el “machine learning” tradicional.
Loading...
Identifiers
Publication date
Authors
Pérez-Sánchez, Alberto
Other responsabilities
Universidade da Coruña. Facultade de Informática
Journal Title
Bibliographic citation
Type of academic work
Academic degree
Abstract
[Resumen]: El Aprendizaje Máquina es usado por gran parte de la comunidad científica como una caja
negra: datos de entrada, ajuste de hiperparámetros, entrenamiento del modelo y obtención de
resultados. Los modelos que se desarrollen bajo este escenario corren el riesgo de no escalar
bien, volverse estáticos o ser difíciles de evaluar.
El Aprendizaje Máquina Interactivo son un conjunto de técnicas que intentan introducir
activamente a los humanos en el bucle de aprendizaje de los modelos, funciona especialmente
bien cuando el dominio es complejo, no hay suficientes datos o son muy costosos de anotar.
El Aprendizaje Máquina Interactivo se divide dependiendo del papel que ocupa el humano
dentro del ciclo de aprendizaje del modelo. En nuestro proyecto el Aprendizaje Activo es la
aproximación desarrollada que consiste en que el modelo consulta al humano (oráculo) los
casos que generan mayor incertidumbre en sus predicciones.
En este proyecto aplicamos las técnicas de Aprendizaje Máquina Interactivo a un problema
real con datos de pacientes oncológicos con cáncer de páncreas, donde los datos y el tiempo
de etiquetado de los médicos son limitados. Para aplicar la solución fue necesario utilizar
técnicas de aumento de datos generando casos sintéticos de cáncer de páncreas mediante una
Red Generativa Antagónica. Los resultados muestran como la incorporación del humano al
bucle de entrenamiento provoca que el modelo converja de manera más rápida y eficiente con
un menor coste humano que las aproximaciones clásicas.
[Abstract]: Machine Learning is used by most of the scientific community as a black box: input data, hyperparameters optimization, training the model and obtaining results. Models developed under this scenario run the risk of not scaling well, becoming static or difficult to evaluate. Interactive Machine Learning are a set of techniques that attempt to actively introduce humans into the learning loop of models, works especially well when the domain is complex, not enough data or is too costly to annotate. Interactive Machine Learning is divided depending on the role of the human within the model learning loop. In our project, Active Learning is the approach developed which consists of the model consulting the human (oracle) for the cases that generate greater uncertainty in its predictions. In this project we applied the Interactive Machine Learning techniques to a real problem with data from oncological patients with pancreatic cancer, where data and labeling time of physicians are limited. To apply the solution it was necessary to use data augmentation techniques generating synthetic cases of pancreatic cancer using an Generative Adversarial Network. The results show how the incorporation of the human into the training loop causes the model to converge faster and more efficiently with a lower human cost than classical approaches.
[Abstract]: Machine Learning is used by most of the scientific community as a black box: input data, hyperparameters optimization, training the model and obtaining results. Models developed under this scenario run the risk of not scaling well, becoming static or difficult to evaluate. Interactive Machine Learning are a set of techniques that attempt to actively introduce humans into the learning loop of models, works especially well when the domain is complex, not enough data or is too costly to annotate. Interactive Machine Learning is divided depending on the role of the human within the model learning loop. In our project, Active Learning is the approach developed which consists of the model consulting the human (oracle) for the cases that generate greater uncertainty in its predictions. In this project we applied the Interactive Machine Learning techniques to a real problem with data from oncological patients with pancreatic cancer, where data and labeling time of physicians are limited. To apply the solution it was necessary to use data augmentation techniques generating synthetic cases of pancreatic cancer using an Generative Adversarial Network. The results show how the incorporation of the human into the training loop causes the model to converge faster and more efficiently with a lower human cost than classical approaches.
Description
Keywords
Aprendizaje máquina Aprendizaje activo Enseñanza máquina Humano en el bucle Aprendizaje curricular Inteligencia artificial explicable Red neuronal artificial Red generativa antagónica Machine learning Active learning Curriculum Learning Human in the loop eXplainable artificial intelligence Artificial neural network Generative adversarial network
Editor version
Rights
Todos os dereitos reservados






