Inference of Viral Quasispecies With a Paired de Bruijn Graph

UDC.coleccionInvestigaciónes_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.endPage481es_ES
UDC.grupoInvLaboratorio de Bases de Datos (LBD)es_ES
UDC.issue4es_ES
UDC.journalTitleBioinformaticses_ES
UDC.startPage473es_ES
UDC.volume37es_ES
dc.contributor.authorFreire, Borja
dc.contributor.authorLadra, Susana
dc.contributor.authorParamá, José R.
dc.contributor.authorSalmela, Leena
dc.date.accessioned2024-11-14T19:16:37Z
dc.date.available2024-11-14T19:16:37Z
dc.date.issued2021-02
dc.descriptionThis is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics, published by Oxford University Press, following peer review. The version of record of the article is available online at: https://doi.org/10.1093/bioinformatics/btaa782.es_ES
dc.descriptionviaDBG is implemented in C++ and it is publicly available at https://bitbucket.org/bfreirec1/viadbg. All datasets used in this article are publicly available at https://bitbucket.org/bfreirec1/data-viadbg/.es_ES
dc.descriptionSupplementary data are available at Bioinformatics online.es_ES
dc.description.abstract[Abstract] Motivation: RNA viruses exhibit a high mutation rate and thus they exist in infected cells as a population of closely related strains called viral quasispecies. The viral quasispecies assembly problem asks to characterize the quasispecies present in a sample from high-throughput sequencing data. We study the de novo version of the problem, where reference sequences of the quasispecies are not available. Current methods for assembling viral quasispecies are either based on overlap graphs or on de Bruijn graphs. Overlap graph-based methods tend to be accurate but slow, whereas de Bruijn graph-based methods are fast but less accurate. Results: We present viaDBG, which is a fast and accurate de Bruijn graph-based tool for de novo assembly of viral quasispecies. We first iteratively correct sequencing errors in the reads, which allows us to use large k-mers in the de Bruijn graph. To incorporate the paired-end information in the graph, we also adapt the paired de Bruijn graph for viral quasispecies assembly. These features enable the use of long-range information in contig construction without compromising the speed of de Bruijn graph-based approaches. Our experimental results show that viaDBG is both accurate and fast, whereas previous methods are either fast or accurate but not both. In particular, viaDBG has comparable or better accuracy than SAVAGE, while being at least nine times faster. Furthermore, the speed of viaDBG is comparable to PEHaplo but viaDBG is able to retrieve also low abundance quasispecies, which are often missed by PEHaplo. Availability and implementation: viaDBG is implemented in C++ and it is publicly available at https://bitbucket.org/bfreirec1/viadbg. All datasets used in this article are publicly available at https://bitbucket.org/bfreirec1/data-viadbg/. Supplementary information: Supplementary data are available at Bioinformatics online.es_ES
dc.description.sponsorshipThis work was supported by EU H2020 MSCA RISE BIRDS: 690941; MCIU [TIN2016-78011-C4-1-R; TIN2016-77158-C4-3-R; FPU17/02742; RTC-2017-5908-7]; Xunta de Galicia [ED431C 2017/58; ED431G/01; IN848D-2017-2350417; IN852A 2018/14]; and from Academy of Finland [308030 and 314170].es_ES
dc.description.sponsorshipXunta de Galicia; ED431C 2017/58es_ES
dc.description.sponsorshipXunta de Galicia; ED431G/01es_ES
dc.description.sponsorshipXunta de Galicia; IN848D-2017-2350417es_ES
dc.description.sponsorshipXunta de Galicia; IN852A 2018/14es_ES
dc.description.sponsorshipFinlandia. Academy of Finland; 308030es_ES
dc.description.sponsorshipFinlandia. Academy of Finland; 314170es_ES
dc.description.urihttps://bitbucket.org/bfreirec1/viadbg
dc.description.urihttps://bitbucket.org/bfreirec1/data-viadbg/
dc.identifier.citationBorja Freire, Susana Ladra, Jose R Paramá, Leena Salmela, Inference of viral quasispecies with a paired de Bruijn graph, Bioinformatics, Volume 37, Issue 4, February 2021, Pages 473–481, https://doi.org/10.1093/bioinformatics/btaa782es_ES
dc.identifier.doi10.1093/bioinformatics/btaa782
dc.identifier.issn1367-4811
dc.identifier.urihttp://hdl.handle.net/2183/40132
dc.language.isoenges_ES
dc.publisherOxford University Presses_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2016-78011-C4-1-R/ES/DATOS 4.0: RETOS Y SOLUCIONES-UDC/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2016-77158-C4-3-R/ES/VELOCITY: PROCESADO EFICIENTE DE BIG DATA ESPAZO-TEMPORAL PARA FLATCITYes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/MECD/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/FPU17%2F02742/ES/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTC-2017-5908-7/ES/STEPS. Soluciones Tecnológicas para la Evolución en la Prestación de Servicios en campo/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/690941es_ES
dc.relation.urihttps://doi.org/10.1093/bioinformatics/btaa782es_ES
dc.rights© 2020, © The Author(s) 2020. Published by Oxford University Press. All rights reserved.es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectGenome assemblyes_ES
dc.subjectDe Bruijn graphses_ES
dc.subjectViral quasispecies assemblyes_ES
dc.titleInference of Viral Quasispecies With a Paired de Bruijn Graphes_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublication0a956b95-a8d9-42b5-ad5c-62ddebb2b1ff
relation.isAuthorOfPublication55bfba4e-d15b-4c84-9894-ac53c2278caf
relation.isAuthorOfPublication8e2da7aa-f6fb-47b1-baec-9de8dd1a067e
relation.isAuthorOfPublication.latestForDiscovery0a956b95-a8d9-42b5-ad5c-62ddebb2b1ff

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Freire_Borja_2021_Inference_of_viral_quasispecies_with_a_paired_de_Bruijn_graph.pdf
Size:
512.83 KB
Format:
Adobe Portable Document Format
Description: