Nanopore full length 16S rRNA gene sequencing increases species resolution in bacterial biomarker discovery

UDC.coleccionInvestigación
UDC.departamentoCiencias da Computación e Tecnoloxías da Información
UDC.departamentoBioloxía
UDC.departamentoFisioterapia, Medicina e Ciencias Biomédicas
UDC.grupoInvLaboratorio de Bases de Datos (LBD)
UDC.institutoCentroCICA - Centro Interdisciplinar de Química e Bioloxía
UDC.institutoCentroINIBIC - Instituto de Investigacións Biomédicas de A Coruña
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.journalTitleScientific Reports
UDC.startPage26486
UDC.volume15
dc.contributor.authorAja-Macaya, Pablo
dc.contributor.authorConde-Pérez, Kelly
dc.contributor.authorNoelia Trigo-Tasende
dc.contributor.authorNasser-Ali, Mohammed
dc.contributor.authorNión, P.
dc.contributor.authorRumbo-Feal, Soraya
dc.contributor.authorLadra, Susana
dc.contributor.authorBou, Germán
dc.contributor.authorVallejo, J. A.
dc.contributor.authorPoza, Margarita
dc.date.accessioned2025-10-09T12:34:12Z
dc.date.available2025-10-09T12:34:12Z
dc.date.issued2025-07-21
dc.descriptionRaw sequence reads generated from 16S rRNA gene sequencing by MiSeq (Illumina-V3V4) and MinION (ONT-V1V9, Dorado sup v4.1.0) are available in the NCBI SRA database under bioproject PRJNA911189. Used scripts and additional documentation are available at https://github.com/Pablo-Aja-Macaya/CRC-ONT16S.
dc.description.abstract[Abstract]: Discovery of disease-related bacterial biomarkers could be a useful approach for early prevention or diagnosis of various afflictions, such as colorectal cancer. This typically involves analyzing small regions of the 16S rRNA gene (e.g. V3V4) through short-read technologies like Illumina, obtaining genus-level results. However, recent developments in third-generation sequencing, such as Oxford Nanopore Technologies (ONT)’s new R10.4.1 chemistry and its improved basecalling models, are beginning to allow for a more complete and accessible species-level analysis through full-length 16S rRNA gene sequencing (spanning regions V1-V9). Thus, the goal of this study was to compare and evaluate both approaches, using colorectal cancer biomarker discovery as a representative case. This was achieved through the analysis of feces from 123 subjects, comparing both methods (Illumina-V3V4 with DADA2 and QIIME2 vs. ONT-V1V9 with Emu), multiple Dorado basecalling models (fast, hac and sup) and multiple databases (SILVA vs. Emu’s Default database). Basecalling models broadly resulted in similar taxonomic output, but had significantly higher observed species and different taxonomic identification the lower the basecalling quality (p-value<0.05). Database choice with Emu influenced the identified species greatly, with Emu’s Default database obtaining significantly higher diversity and identified species than SILVA (p-value<0.05). However, it overconfidently classified at times what should be an unknown species as the closest match due to its database structure. Bacterial abundance between Illumina-V3V4 and ONT-V1V9 at the genus level correlated well (R2 0.8). Nanopore sequencing identified more specific bacterial biomarkers for colorectal cancer than those obtained with Illumina, such as Parvimonas micra, Fusobacterium nucleatum, Peptostreptococcus stomatis, Peptostreptococcus anaerobius, Gemella morbillorum, Clostridium perfringens, Bacteroides fragilis and Sutterella wadsworthensis. Prediction of colorectal cancer through manual feature selection and machine learning resulted in an AUC of 0.87 with 14 species or 0.82 with just 4 species (P. micra, F. nucleatum, B. fragilis and Agathobaculum butyriciproducens). Full 16S rRNA V1V9 sequencing through Oxford Nanopore and its new R10.4.1 chemistry achieved accurate species-level bacterial identification, facilitating the discovery of more precise disease-related biomarkers and increasing the taxonomic fidelity of future microbiome analyses.
dc.description.sponsorshipThis study has been funded by the Instituto de Salud Carlos III (ISCIII), through the projects PI20/00413 and PI23/00696, cofunded by the European Union (EU) to MP. Also, the work was funded by the project IN607A2024/09 from Xunta de Galicia (Spain) to GB. Additionally, it has been supported by the project RTI2018-102032-B-I00 from the Spanish Ministry of Science and Innovation to AM and by CIBER INFEC ISCIII (CB21/13/00055) to GB and MP. KC-P was financially supported by the Spanish Association against Cancer (AECC) from December 2019 to December 2023. Additionally, this work, integrated into the framework of PERTE for Vanguard Health, has been co-financed by the Spanish Ministry of Science, Innovation and Universities with funds from the European Union NextGenerationEU, from the Recovery, Transformation and Resilience Plan (PRTR-C17.I1) and from the Autonomous Community of Galicia within the framework of the Biotechnology Plan Applied to Health to SL.
dc.description.sponsorshipXunta de Galicia; IN607A2024/09
dc.identifier.citationAja-Macaya, P., Conde-Pérez, K., Trigo-Tasende, N. et al. Nanopore full length 16S rRNA gene sequencing increases species resolution in bacterial biomarker discovery. Sci Rep 15, 26486 (2025). https://doi.org/10.1038/s41598-025-10999-8
dc.identifier.doi10.1038/s41598-025-10999-8
dc.identifier.issn2045-2322
dc.identifier.urihttps://hdl.handle.net/2183/45937
dc.language.isoeng
dc.publisherSpringer Nature
dc.relation.projectIDinfo:eu-repo/grantAgreement/ISCIII/Programa Estatal de Generación de Conocimiento y Fortalecimiento del Sistema Español de I+D+I/PI20%2F00413/ES/FARMACOMICROBIOMICA Y MEDICINA PERSONALIZADA EN LA TERAPIA DEL CANCER COLORECTAL
dc.relation.projectIDinfo:eu-repo/grantAgreement/ISCIII/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PI23%2F00696/ES/PAPEL DE BACTERIAS ANAEROBIAS EN EL CÁNCER DE COLON Y VALIDACIÓN DE BIOMARCADORES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Programa Estatal de I+D+i Orientada a los Retos de la Sociedad/RTI2018-102032-B-I00/ES/EFECTOS LOCALES Y SISTEMICOS DE LA MICROBIOTA ORAL
dc.relation.projectIDinfo:eu-repo/grantAgreement/ISCIII/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/CB21%2F13%2F00055/ES/
dc.relation.urihttps://doi.org/10.1038/s41598-025-10999-8
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectIllumina
dc.subjectOxford Nanopore
dc.subjectColorectal cancer
dc.subject16S rRNA
dc.subjectBiomarkers
dc.subjectGut microbiome
dc.subjectMetabarcoding
dc.subjectV1V9
dc.titleNanopore full length 16S rRNA gene sequencing increases species resolution in bacterial biomarker discovery
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublication55bfba4e-d15b-4c84-9894-ac53c2278caf
relation.isAuthorOfPublication909e08d1-6ed1-4b99-9e9e-c64eb72e7dea
relation.isAuthorOfPublicationa68d08dc-09ba-453e-8928-7c08e5b14a4b
relation.isAuthorOfPublication.latestForDiscovery55bfba4e-d15b-4c84-9894-ac53c2278caf

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ladra_Susana_2025_Nanopore_full_length_16S_rRNA_gene_sequencing_increases_species_resolution.pdf
Size:
4.56 MB
Format:
Adobe Portable Document Format