MarDRe: efficient MapReduce-based removal of duplicate DNA reads in the cloud

UDC.coleccionInvestigaciónes_ES
UDC.departamentoEnxeñaría de Computadoreses_ES
UDC.endPage2764es_ES
UDC.grupoInvGrupo de Arquitectura de Computadores (GAC)es_ES
UDC.issue17es_ES
UDC.journalTitleBioinformaticses_ES
UDC.startPage2762es_ES
UDC.volume33es_ES
dc.contributor.authorExpósito, Roberto R.
dc.contributor.authorVeiga, Jorge
dc.contributor.authorGonzález-Domínguez, Jorge
dc.contributor.authorTouriño, Juan
dc.date.accessioned2018-07-04T14:48:36Z
dc.date.embargoEndDate2018-09-02es_ES
dc.date.embargoLift2018-09-02
dc.date.issued2017
dc.descriptionThis is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of record Roberto R. Expósito, Jorge Veiga, Jorge González-Domínguez, Juan Touriño; MarDRe: efficient MapReduce-based removal of duplicate DNA reads in the cloud, Bioinformatics, Volume 33, Issue 17, 1 September 2017, Pages 2762–2764 is available online at: https://doi.org/10.1093/bioinformatics/btx307es_ES
dc.description.abstract[Abstract] This article presents MarDRe, a de novo cloud-ready duplicate and near-duplicate removal tool that can process single- and paired-end reads from FASTQ/FASTA datasets. MarDRe takes advantage of the widely adopted MapReduce programming model to fully exploit Big Data technologies on cloud-based infrastructures. Written in Java to maximize cross-platform compatibility, MarDRe is built upon the open-source Apache Hadoop project, the most popular distributed computing framework for scalable Big Data processing. On a 16-node cluster deployed on the Amazon EC2 cloud platform, MarDRe is up to 8.52 times faster than a representative state-of-the-art tool.es_ES
dc.description.sponsorshipMinisterio de Economia y Competitividad; TIN2016-75845-Pes_ES
dc.description.sponsorshipMinisterio de Educación; FPU014/02805es_ES
dc.identifier.citationRoberto R. Expósito, Jorge Veiga, Jorge González-Domínguez, Juan Touriño; MarDRe: efficient MapReduce-based removal of duplicate DNA reads in the cloud, Bioinformatics, Volume 33, Issue 17, 1 September 2017, Pages 2762–2764, https://doi.org/10.1093/bioinformatics/btx307es_ES
dc.identifier.doi10.1093/bioinformatics/btx307
dc.identifier.issn1367-4803
dc.identifier.issn1367-4811
dc.identifier.urihttp://hdl.handle.net/2183/20848
dc.language.isoenges_ES
dc.publisherOxford University Presses_ES
dc.relation.urihttps://doi.org/10.1093/bioinformatics/btx307es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectMarDRees_ES
dc.subjectApache Hadoopes_ES
dc.subjectBig Dataes_ES
dc.subjectCloud platformes_ES
dc.subjectMapReducees_ES
dc.subjectCloud-ready duplicatees_ES
dc.titleMarDRe: efficient MapReduce-based removal of duplicate DNA reads in the cloudes_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublication6a6967e9-a4f5-4006-afee-4fc9d5f3a658
relation.isAuthorOfPublication0ef9135c-b7c9-48f1-8f06-55c025236916
relation.isAuthorOfPublication84d13059-7f4b-4cb5-ac65-0e07a77271f0
relation.isAuthorOfPublication86e306a5-99a1-4c43-8faa-720f0a9f0a34
relation.isAuthorOfPublication.latestForDiscovery6a6967e9-a4f5-4006-afee-4fc9d5f3a658

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Expósito_R.R._MarDRe_efficient_MapReduce-based_removal_2017.pdf
Size:
92.37 KB
Format:
Adobe Portable Document Format
Description: