Mostrar o rexistro simple do ítem

dc.contributor.authorExpósito, Roberto R.
dc.contributor.authorGonzález-Domínguez, Jorge
dc.contributor.authorTouriño, Juan
dc.date.accessioned2019-02-15T18:23:34Z
dc.date.available2019-02-15T18:23:34Z
dc.date.issued2018-07-31
dc.identifier.citationExpósito RR, González-Domínguez J, Touriño J (2018) HSRA: Hadoop-based spliced read aligner for RNA sequencing data. PLoS ONE 13(7): e0201483. https://doi.org/10.1371/journal.pone.0201483es_ES
dc.identifier.issn1932-6203
dc.identifier.urihttp://hdl.handle.net/2183/21813
dc.description.abstract[Abstract] Nowadays, the analysis of transcriptome sequencing (RNA-seq) data has become the standard method for quantifying the levels of gene expression. In RNA-seq experiments, the mapping of short reads to a reference genome or transcriptome is considered a crucial step that remains as one of the most time-consuming. With the steady development of Next Generation Sequencing (NGS) technologies, unprecedented amounts of genomic data introduce significant challenges in terms of storage, processing and downstream analysis. As cost and throughput continue to improve, there is a growing need for new software solutions that minimize the impact of increasing data volume on RNA read alignment. In this work we introduce HSRA, a Big Data tool that takes advantage of the MapReduce programming model to extend the multithreading capabilities of a state-of-the-art spliced read aligner for RNA-seq data (HISAT2) to distributed memory systems such as multi-core clusters or cloud platforms. HSRA has been built upon the Hadoop MapReduce framework and supports both single- and paired-end reads from FASTQ/FASTA datasets, providing output alignments in SAM format. The design of HSRA has been carefully optimized to avoid the main limitations and major causes of inefficiency found in previous Big Data mapping tools, which cannot fully exploit the raw performance of the underlying aligner. On a 16-node multi-core cluster, HSRA is on average 2.3 times faster than previous Hadoop-based tools. Source code in Java as well as a user’s guide are publicly available for download at http://hsra.dec.udc.es.es_ES
dc.description.sponsorshipMinisterio de Economía, Industria y Competitividad; TIN2016-75845-Pes_ES
dc.description.sponsorshipXunta de Galicia; ED431G/01es_ES
dc.language.isoenges_ES
dc.publisherPublic Library of Sciencees_ES
dc.relation.urihttps://doi.org/10.1371/journal.pone.0201483es_ES
dc.subjectSequence alignmentes_ES
dc.subjectData processinges_ES
dc.subjectGenome analysises_ES
dc.subjectRNAes_ES
dc.subjectSequencinges_ES
dc.subjectMemoryes_ES
dc.subjectRNA analysises_ES
dc.subjectPreprocessinges_ES
dc.subjectRNA alignmentes_ES
dc.titleHSRA: Hadoop-based spliced read aligner for RNA sequencing dataes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.journalTitlePL o S Onees_ES
UDC.volume13es_ES
UDC.issue7es_ES
dc.identifier.doi10.1371/journal.pone.0201483


Ficheiros no ítem

Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem