CUDA acceleration of MI-based feature selection methods

Beceiro, Bieito; González-Domínguez, Jorge; Morán-Fernández, Laura; Bolón-Canedo, Verónica; Touriño, Juan

Use this link to cite:

http://hdl.handle.net/2183/36386

CUDA acceleration of MI-based feature selection methods

Files

Beceiro_Bieito_2024_CUDA_acceleration_of_MI_based_feature_selection_methods.pdf (2.98 MB)

Identifiers

URI: http://hdl.handle.net/2183/36386

DOI: 10.1016/j.jpdc.2024.104901

Publication date

2024-08

Authors

Beceiro, Bieito

González-Domínguez, Jorge

Morán-Fernández, Laura

Bolón-Canedo, Verónica

Touriño, Juan

Bibliographic citation

Beceiro, B., González-Domínguez, J., Morán-Fernández, L., Bolón-Canedo, V., & Touriño, J. (2024). CUDA acceleration of MI-based feature selection methods. Journal of Parallel and Distributed Computing, 104901. https://doi.org/10.1016/j.jpdc.2024.104901

Abstract

[Abstract]: Feature selection algorithms are necessary nowadays for machine learning as they are capable of removing irrelevant and redundant information to reduce the dimensionality of the data and improve the quality of subsequent analyses. The problem with current feature selection approaches is that they are computationally expensive when processing large datasets. This work presents parallel implementations for Nvidia GPUs of three highly-used feature selection methods based on the Mutual Information (MI) metric: mRMR, JMI and DISR. Publicly available code includes not only CUDA implementations of the general methods, but also an adaptation of them to work with low-precision fixed point in order to further increase their performance on GPUs. The experimental evaluation was carried out on two modern Nvidia GPUs (Turing T4 and Ampere A100) with highly satisfactory results, achieving speedups of up to 283x when compared to state-of-the-art C implementations.