Machine Learning Analysis of the Human Infant Gut Microbiome Identifies Influential Species in Type 1 Diabetes
Use this link to cite
http://hdl.handle.net/2183/28806Collections
- GI-RNASA - Artigos [195]
Metadata
Show full item recordTitle
Machine Learning Analysis of the Human Infant Gut Microbiome Identifies Influential Species in Type 1 DiabetesDate
2021Citation
FERNÁNDEZ-EDREIRA, Diego, LIÑARES-BLANCO, Jose and FERNANDEZ-LOZANO, Carlos, 2021. Machine Learning analysis of the human infant gut microbiome identifies influential species in type 1 diabetes. Expert Systems with Applications. 15 December 2021. Vol. 185, p. 115648. DOI 10.1016/j.eswa.2021.115648.
Abstract
[Abstract] Diabetes is a disease that is closely linked to genetics and epigenetics, yet mechanisms for clarifying the onset and/or progression of the disease have sometimes not been fully managed. In recent years and due to the large number of recent studies, it is known that changes in the balance of the microbiota can cause a high battery of diseases, including diabetes. Machine Learning (ML) techniques are able to identify complex, non-linear patterns of expression and relationships within the data set to extract intrinsic knowledge without any biological assumptions about the data. At the same time, mass sequencing techniques allow us to obtain the metagenomic profile of an individual, whether it is a body part, organ or tissue, and thus identify the composition of a given microbe. The great increase in the development of both technologies in their respective fields of study leads to the logical union of both to try to identify the bases of a complex disease such as diabetes. To this end, a Random Forest model has been developed at different taxonomic levels, obtaining results above 0.80 in AUC for families and above 0.98 at species level, following a strict experimental design to ensure that results are compared under equal conditions. It is identified how, in infants, the species Bacteroides uniformis, Bacteroides dorei and Bacteroides thetaiotaomicron are reduced in the microbiota of those with T1D, while, the populations of Prevotella copri increase slightly and that of Bacteroides vulgatus is much higher. Finally, thanks to the more specific metagenomic signature at species level, a model has been generated to predict those seroconverted patients not previously diagnosed with diabetes but who have expressed at least two of the autoantibodies analysed.
Keywords
Machine learning
Diabetes
T1D
Microbiota
Metagenomics
Feature selection
Random forest
Generalized linear model
Diabetes
T1D
Microbiota
Metagenomics
Feature selection
Random forest
Generalized linear model
Description
Financiado para publicación en acceso aberto: Universidade da Coruña/CISUG
Editor version
Rights
Atribución 4.0 Internacional