High-performance computing selection of models of DNA substitution for multicore clusters

UDC.coleccionInvestigaciónes_ES
UDC.departamentoEnxeñaría de Computadoreses_ES
UDC.endPage125es_ES
UDC.grupoInvGrupo de Arquitectura de Computadores (GAC)es_ES
UDC.issue1es_ES
UDC.journalTitleThe International Journal of High Performance Computing Applicationses_ES
UDC.startPage112es_ES
UDC.volume28es_ES
dc.contributor.authorDarriba, Diego
dc.contributor.authorTaboada, Guillermo L.
dc.contributor.authorDoallo, Ramón
dc.contributor.authorPosada, David
dc.date.accessioned2018-08-28T10:05:40Z
dc.date.available2018-08-28T10:05:40Z
dc.date.issued2014
dc.description.abstract[Abstract] This paper presents the high-performance computing (HPC) support of jModelTest2, the most popular bioinformatic tool for the statistical selection of models of DNA substitution. As this can demand vast computational resources, especially in terms of processing power, jModelTest2 implements three parallel algorithms for model selection: (1) a multithreaded implementation for shared memory architectures; (2) a message-passing implementation for distributed memory architectures, such as clusters; and (3) a hybrid shared/distributed memory implementation for clusters of multicore nodes, combining the workload distribution across cluster nodes with a multithreaded model optimization within each node. The main limitation of the shared and distributed versions is the workload imbalance that generally appears when using more than 32 cores, a direct consequence of the heterogeneity in the computational cost of the evaluated models. The hybrid shared/distributed memory version overcomes this issue reducing the workload imbalance through a thread-based decomposition of the most costly model optimization tasks. The performance evaluation of this HPC application on a 40-core shared memory system and on a 528-core cluster has shown high scalability, with speedups of the multithreaded version of up to 32, and up to 257 for the hybrid shared/distributed memory implementation. This can represent a reduction in the execution time of some analyses from 4 days down to barely 20 minutes. The implementation of the three parallel execution strategies of jModelTest2 presented in this paper are available under a GPL license at http://code.google.com/jmodeltest2.es_ES
dc.description.sponsorshipEuropean Research Council; ERC-2007-Stg 203161-PHYGENOM to D.P.es_ES
dc.description.sponsorshipMinisterio de Ciencia y Educación; BFU2009-08611 to D.P.es_ES
dc.description.sponsorshipMinisterio de Ciencia y Educación; TIN2010-16735 to R.D.es_ES
dc.identifier.citationDARRIBA, Diego, et al. High-performance computing selection of models of DNA substitution for multicore clusters. The International Journal of High Performance Computing Applications, 2014, vol. 28, no 1, p. 112-125.es_ES
dc.identifier.doi10.1177/1094342013495095
dc.identifier.issn1094-3420
dc.identifier.issn1741-2846
dc.identifier.urihttp://hdl.handle.net/2183/20978
dc.language.isoenges_ES
dc.publisherSage Publications Ltd.es_ES
dc.relation.urihttps://doi.org/10.1177/1094342013495095es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectHigh-Performance computing (HPC)es_ES
dc.subjectMulticore clusterses_ES
dc.subjectMessage-Passing in Java (MPJ)es_ES
dc.subjectPhylogenyes_ES
dc.subjectNucleotide substitutiones_ES
dc.subjectPerformance evaluationes_ES
dc.titleHigh-performance computing selection of models of DNA substitution for multicore clusterses_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublication64f4176e-8f06-4807-b964-3c474b876a4d
relation.isAuthorOfPublicationb3302f65-05d3-4b2c-b8b3-8503e58bba5e
relation.isAuthorOfPublication.latestForDiscovery64f4176e-8f06-4807-b964-3c474b876a4d

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Darriba_D._High-Performance_Computing_Selection_of_Models_of_DNA_Substitution_for_Multicore_Clusters_2014.pdf
Size:
950.82 KB
Format:
Adobe Portable Document Format
Description: