Parallelizing Epistasis Detection in GWAS on FPGA and GPU-Accelerated Computing Systems

Loading...
Thumbnail Image

Identifiers

Publication date

Authors

Wienbrandt, Lars
Kässens, Jan Christian
Ellinghaus, David
Schimmler, Manfred
Schmidt, Bertil

Advisors

Other responsabilities

Journal Title

Bibliographic citation

J. González-Domínguez, L. Wienbrandt, J. C. Kässens, D. Ellinghaus, M. Schimmler and B. Schmidt, "Parallelizing Epistasis Detection in GWAS on FPGA and GPU-Accelerated Computing Systems," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 12, no. 5, pp. 982-994, 1 Sept.-Oct. 2015. doi: 10.1109/TCBB.2015.2389958

Type of academic work

Academic degree

Abstract

[Abstract] High-throughput genotyping technologies (such as SNP-arrays) allow the rapid collection of up to a few million genetic markers of an individual. Detecting epistasis (based on 2-SNP interactions) in Genome-Wide Association Studies is an important but time consuming operation since statistical computations have to be performed for each pair of measured markers. Computational methods to detect epistasis therefore suffer from prohibitively long runtimes; e.g., processing a moderately-sized dataset consisting of about 500,000 SNPs and 5,000 samples requires several days using state-of-the-art tools on a standard 3 GHz CPU. In this paper, we demonstrate how this task can be accelerated using a combination of fine-grained and coarse-grained parallelism on two different computing systems. The first architecture is based on reconfigurable hardware (FPGAs) while the second architecture uses multiple GPUs connected to the same host. We show that both systems can achieve speedups of around four orders-of-magnitude compared to the sequential implementation. This significantly reduces the runtimes for detecting epistasis to only a few minutes for moderatelysized datasets and to a few hours for large-scale datasets.

Description

This is a post-peer-review, pre-copyedit version of an article published in IEEE - ACM Transactions on Computational Biology and Bioinformatics. The final authenticated version is available online at: http://dx.doi.org/10.1109/TCBB.2015.2389958

Rights