MPI-dot2dot: A Parallel Tool to Find DNA Tandem Repeats on Multicore Clusters

Use this link to cite
http://hdl.handle.net/2183/30194
Except where otherwise noted, this item's license is described as Atribución-NoComercial-SinDerivadas 4.0 Internacional
Collections
- Investigación (FIC) [1654]
Metadata
Show full item recordTitle
MPI-dot2dot: A Parallel Tool to Find DNA Tandem Repeats on Multicore ClustersDate
2022Citation
González-Domínguez, J., Martín-Martínez, J.M. & Expósito, R.R. MPI-dot2dot: A parallel tool to find DNA tandem repeats on multicore clusters. J Supercomput 78, 4217–4235 (2022). https://doi.org/10.1007/s11227-021-04025-7
Abstract
[Abstract] Tandem Repeats (TRs) are segments that occur several times in a DNA sequence, and each copy is adjacent to other. In the last few years, TRs have gained significant attention as they are thought to be related with certain human diseases. Therefore, identifying and classifying TRs have become a highly important task in bioinformatics in order to analyze their disorders and relationships with illnesses. Dot2dot, a tool recently developed to find TRs, provides more accurate results than the previous state-of-the-art, but it requires a long execution time even when using multiple threads. This work presents MPI-dot2dot, a novel version of this tool that combines MPI and OpenMP so that it can be executed in a cluster of multicore nodes and thus reduces its execution time. The performance of this new parallel implementation has been tested using different real datasets. Depending on the characteristics of the input genomes, it is able to obtain the same biological results as Dot2dot but more than 100 times faster on a 16-node multicore cluster (384 cores). MPI-dot2dot is publicly available to download from https://sourceforge.net/projects/mpi-dot2dot.
Keywords
Tandem repeat
High performance computing
MPI
OpenMP
Bioinformatics
High performance computing
MPI
OpenMP
Bioinformatics
Description
Financiado para publicación en acceso aberto: Universidade da Coruña/CISUG
Editor version
Rights
Atribución-NoComercial-SinDerivadas 4.0 Internacional
ISSN
1573-0484