Polymerase Chain Reaction. Perturbation Theory and Machine Learning Artificial Intelligence-Experimental Microbiome Analysis: Applications to Ancient DNA and Tree Soil Metagenomics Cases of Study

UDC.coleccionInvestigación
UDC.departamentoCiencias da Computación e Tecnoloxías da Información
UDC.grupoInvRedes de Neuronas Artificiais e Sistemas Adaptativos -Informática Médica e Diagnóstico Radiolóxico (RNASA - IMEDIR)
UDC.grupoInvLaboratorio de Enxeñaría do Software (ISLA)
UDC.journalTitleAdvanced Intelligent Systems
UDC.startPagee202500587
dc.contributor.authorRodriguez, Jose L.
dc.contributor.authorAscencio, Estefanía
dc.contributor.authorAgeitos, Jose M.
dc.contributor.authorFeijoo, Lucía
dc.contributor.authorHe, Shan
dc.contributor.authorDaghighi, Amirreza
dc.contributor.authorCasañola-Martín, Gerardo M.
dc.contributor.authorMunteanu, Cristian-Robert
dc.contributor.authorRodríguez-Yáñez, S.
dc.contributor.authorPazos, A.
dc.date.accessioned2026-02-09T10:42:28Z
dc.date.available2026-02-09T10:42:28Z
dc.date.issued2026-01-06
dc.description.abstract[Abstract]: The exponential growth of genomic data has created a pressing need for methods capable of interpreting complex biological information, especially in fields like paleogenomics and environmental metagenomics. Ancient DNA (aDNA) analysis faces challenges such as degradation and contamination, while soil metagenomic DNA (soDNA) analysis is hindered by microbial diversity and incomplete reference databases. To address these limitations, this study proposes the polymerase chain reaction (PCR).Perturbation Theory and Machine Learning (PTML) methodology, which integrates machine learning with Perturbation Theory to analyze genetic sequences without the need for alignment. Two models are developed: the first classifies bacterial aDNA sequences extracted from Miocene amber; the second predicts tree health using microbial gene abundance in forest soils. Both rely on entropy-based descriptors (θk) and structural differences (Δθk) between query and reference sequences, which serve as perturbation operators for supervised learning algorithms. This approach allows the detection of meaningful patterns even without complete genomic references. The aDNA model achieves 99.65% sensitivity and 99.81% specificity, while the soDNA model reaches 98.85% sensitivity and 92.56% specificity. These results confirm the robustness and applicability of PCR.PTML in diverse genomic contexts, presenting it as a valuable tool for ancient DNA classification and environmental metagenomics analysis.
dc.description.sponsorshipThese authors, J.L.R. and E.A., contributed equally to this work. This project was funded by grants INCITE07PXI203141ES (Conselleria de Industria, Xunta de Galicia, Spain) and BFU2009-07745 (MINECO, Spain). Likewise, the authors recognize the support of the National Science Foundation under the NSF MRI award OAC-2019077, and support by the State of North Dakota. CCAST HPC System supercomputing support at NDSU is acknowledged. In addition, the work was partly supported by Basque Government/Eusko Jaurlaritza (IT1558-22) and (IT1648-22), SPRI ELKARTEK grants AIMOFGIF (KK-2022/00032), Ministry of Science and Innovation (PID2022-137365NB-I00), and LANBIDE, INVESTIGO, Eusko Jaurlaritza, Grants, IKERDATA 2022/IKER/000040, funded by NextGenerationEU funds of European Commission, and Xunta de Galicia Consolidated group, grant ED431C 2022/46. JCY and RE are members of the Spanish climate-induced forest decline (ReDec) funded by MCIN/AEI (grant RED2024-153822-T), as well as from the projects PID2020-113244GB-C21 and PID2020-113244GA-C22 projects (both funded by MCIN/ AEI /10.13039/501100011033). Also the authors recognize the funding support of the following research groups and centers of The University of A Coruña (UDC): Software Engineering Lab (ISLA), Artificial Neuron Networks and Adaptative Systems – Medical Imaging and Radiological Diagnosis (RNASA- IMEDIR), the Centre for Information and Communications Technology Research (CITIC) and Center for Technological Innovation in Construction and Civil Engineering (CITEEC).
dc.description.sponsorshipXunta de Galicia; INCITE07PXI203141ES
dc.description.sponsorshipXunta de Galicia; ED431C 2022/46
dc.description.sponsorshipUnited States National Science Foundation; OAC-2019077
dc.description.sponsorshipGobierno Vasco; IT1558-22
dc.description.sponsorshipGobierno Vasco; IT1648-22
dc.description.sponsorshipGobierno Vasco; KK-2022/00032
dc.identifier.citationJ. L. Rodriguez, E. Ascencio, J. M. Ageitos, L. Feijoo, S. He, A. Daghighi, G. M. Casanola-Martin, C. R. Munteanu, S. Rodriguez Yañez, A. Pazos, et al.,"Polymerase Chain Reaction. Perturbation Theory and Machine Learning Artificial Intelligence-Experimental Microbiome Analysis: Applications to Ancient DNA and Tree Soil Metagenomics Cases of Study", Advanced Intelligent Systems, 2026, e202500587, https://doi.org/10.1002/aisy.202500587
dc.identifier.doi10.1002/aisy.202500587open_in_new
dc.identifier.issn2640-4567
dc.identifier.urihttps://hdl.handle.net/2183/47297
dc.language.isoeng
dc.publisherWiley-VCH GmbH
dc.relation.projectIDinfo:eu-repo/grantAgreement/MICINN/Plan Estatal de Investigación Científica y Técnica y de Innovación 2008-2011/BFU2009-07745/ES/Estudios de metagenómica paleontológica: Establecimiento de derivas génicas e implicaciones fisiológicas
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-137365NB-I00/ES/ESTRATEGIAS PARA LA ACTIVACION C-H CATALIZADA POR METALES-3D. APLICACIONES SINTETICAS Y APRENDIZAJE AUTOMATICO EN ESTUDIOS DE REACTIVIDAD QUIMICA Y ACTIVIDAD BIOLOGICA
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2024-2027/RED2024-153822-T/ES/RED IBERICA DE DECAIMIENTO FORESTAL INDUCIDO POR EL CLIMA
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113244GB-C21/ES/DESARROLLO DE NUEVAS PRACTICAS FORESTALES INTELIGENTES (SSF) PARA MEJORAR LA CONSERVACION DEL SUELO Y LA PROVISION A LARGO PLAZO DE SERVICIOS ECOSISTEMICOS CLAVE DE LOS BOSQUE
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113244GA-C22/ES/DESARROLLO DE HERRAMIENTAS PARA EL DIAGNOSTICO TEMPRANO DE SALUD EN EL CONTEXTO DE VULNERABILIDAD FORESTAL
dc.relation.urihttps://doi.org/10.1002/aisy.202500587open_in_new
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectDNA sequences
dc.subjectMachine learning
dc.subjectPerturbation Theory
dc.subjectShannon Information Theory
dc.titlePolymerase Chain Reaction. Perturbation Theory and Machine Learning Artificial Intelligence-Experimental Microbiome Analysis: Applications to Ancient DNA and Tree Soil Metagenomics Cases of Study
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublicationfac98c9d-7cc7-4b09-bbb1-1068637fc73f
relation.isAuthorOfPublication8cd20c83-ea67-4239-ba03-c61eb1d04dbc
relation.isAuthorOfPublicationfa192a4c-bffd-4b23-87ae-e68c29350cdc
relation.isAuthorOfPublication.latestForDiscoveryfac98c9d-7cc7-4b09-bbb1-1068637fc73f

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Munteanu_CristianR_2026_Polymerase_Chain_Reaction.pdf
Size:
3.74 MB
Format:
Adobe Portable Document Format