Aprendizaje profundo con predicción monocular de profundidad estéreo para el diagnóstico de glaucoma

Díaz Freire, Miguel Alejandro

Ver/abrir

DiazFreire_MiguelAlejandro_TFG_2021.pdf (8.647Mb)

Use este enlace para citar

http://hdl.handle.net/2183/30957

A non ser que se indique outra cousa, a licenza do ítem descríbese como Atribución-NoComercial-SinDerivadas 3.0 España

Coleccións

Enxeñaría informática, Grao en [526]

Metadatos

Mostrar o rexistro completo do ítem

Título

Aprendizaje profundo con predicción monocular de profundidad estéreo para el diagnóstico de glaucoma

Autor(es)

Díaz Freire, Miguel Alejandro

Director(es)

Rouco, J.

Data

2021

Centro/Dpto/Entidade

Enxeñaría informática, Grao en

Descrición

Traballo fin de grao. Enxeñaría Informática. Curso 2020/2021

Resumo

[Resumen] El diagnóstico de afecciones oculares se realiza mediante el apoyo de diferentes modalidades de imágenes oftalmológicas. En concreto, en el diagnóstico de glaucoma, grave enfermedad que apenas presenta síntomas en sus etapas iniciales, es útil la utilización de retinografías para obtener indicadores de la presencia de la enfermedad. En general, las retinografías están ampliamente extendidas en el mundo de la oftalmología por ser de obtención no-invasiva. Sin embargo, existen diferentes tipos de retinografías. Descatan las retinografías monoculares, de fácil obtención, y las estereográficas, más complejas de obtener y menos accesibles en el ámbito hospitalario por necesidad de cámaras especializadas, pero que arrojan información sobre la profundidad del fondo ocular muy relevante a la hora de diagnósticar el glaucoma. En este proyecto se plantea la utilización de metodologías de aprendizaje profundo mediante diferentes aproximaciones para la obtención de la información de profundidad de las retinografías estereográficas pero a partir de retinografías monoculares. Por un lado, se plantea la predicción directa del mapa de profundidad del fondo ocular. Por otro lado, se plantea la predicción del par estereográfico a partir de la otra imagen del par, considerada una retinografía monocular. Finalmente, una vez obtenida esta información de profundidad, se plantea el uso del conocimiento adquirido en esta tarea de pre-entrenamiento para entrenar otro modelo cuyo objetivo sea realizar la tarea de segmentación semántica de disco y copa, regiones del fondo del ojo cuyo ratio es indicador de la presencia de glaucoma. Este procedimiento es conocido como transfer-learning, del cual se quiere demostrar su validez en tareas contenidas dentro del mismo campo semántico. Al comparar los resultados obtenidos por los modelos de segmentación utilizando el conocimiento transferido de las otras tareas con modelos de segmentación entrenados sin este conocimiento, se ha podido demostrar que el transfer-learning mejora los resultados en cuanto a la segmentación de la copa se refiere. En el caso de la segmentación de disco, los resultados se mantienen parejos a los de los modelos de referencia. Por otro lado, se han encontrado indicios de que dependiendo de la naturaleza del problema a resolver, es posible que, si la tarea de pre-entrenamiento aprende detalles demasiado específicos sobre su propia tarea, el conocimiento transferido empeore los resultados de la segmentación, como es el caso de la tarea de predicción del mapa de profundidad. No obstante, la tarea de predicción del par estereográfico muestra una tendencia a mejorar la segmentación cuanto mejor resuelva su tarea el modelo de pre-entrenamiento. Estas posibilidades plantean nuevas líneas de investigación que sería interesante seguir en un futuro.

[Abstract] The diagnosis of ocular conditions is made through the support of different ophthalmological imaging modalities. In particular, in the diagnosis of glaucoma, a serious disease that has nearly no symptoms on its early stages, the use of retinographies is useful in order to obtain indicators of the presence of the disease. In general, retinographies are broadly used on the ophthalmological world because they can be obtained in a non-invasive way. However, there are different types of retinographies, standing out the monocular ones, easily obtained, and the stereographic ones, more complex to obtain and less accesible in the hospital setting due to the need for specialized cameras, but which give information about the depth of the ocular fundus, which is very relevant when diagnosing glaucoma. In this project, the use of deep learning methodologies is proposed through different approaches to obtain the depth information derived from the stereographic retinographies but from monocular retinographies. On one hand, direct prediction of the ocular fundus depth map is proposed. On the other hand, the prediction of the stereographic pair from the other image of the pair is considered. Finally, once the depth information has been obtained, the use of the knowledge acquired during this pre-training tasks is used to train another model whose objective is to perform the task of cup and disc semantic segmentation, regions of the ocular fundus whose ratio is an indicator of the presence of glaucoma. This procedure is known as transfer-learning and the final objetive of the project is to show its validity in the tasks contained in the same semantic field. When comparing the results obtained by the segmentation models using the transferred knowledge from the other tasks with the segmentation models trained without this knowledge, it has been shown that transfer-learning improved the results of the cup segmentation. In the case of disk segmentation, the results do not improve nor get worse compared with the reference ones. On the other hand, indications have been found that depending on the nature of the problem to be solved, it can be that, if the pre-training task learns too specific details about his own task, the transferred knowledge can make the segmentation results worse, which is the case with the depth map prediction task. Nevertheless, the stereo par prediction task shows a tendency to improve the segmentation the better the pre-training model solves its task. These possibilities raise new lines of research that it would be interesting to pursue in the future.

Palabras chave

Oftalmología
Retinografía estereográfica
Retinografía monocular
Glaucoma
Aprendizaje profundo
Transfer-learning
Segmentación semántica

Dereitos

Atribución-NoComercial-SinDerivadas 3.0 España