On the scalability of feature selection methods on high-dimensional data
Use este enlace para citar
http://hdl.handle.net/2183/36408Colecciones
- Investigación (FIC) [1576]
Metadatos
Mostrar el registro completo del ítemTítulo
On the scalability of feature selection methods on high-dimensional dataAutor(es)
Fecha
2018Cita bibliográfica
Bolón-Canedo, V., Rego-Fernández, D., Peteiro-Barral, D. et al. On the scalability of feature selection methods on high-dimensional data. Knowl Inf Syst 56, 395–442 (2018). https://doi.org/10.1007/s10115-017-1140-3
Resumen
[Abstract]: Lately, derived from the explosion of high dimensionality, researchers in machine learning became interested not only in accuracy, but also in scalability. Although scalability of learning methods is a trending issue, scalability of feature selection methods has not received the same amount of attention. This research analyzes the scalability of state-of-the-art feature selection methods, belonging to filter, embedded and wrapper approaches. For this purpose, several new measures are presented, based not only on accuracy but also on execution time and stability. The results on seven classical artificial datasets are presented and discussed, as well as two cases study analyzing the particularities of microarray data and the effect of redundancy. Trying to check whether the results can be generalized, we included some experiments with two real datasets. As expected, filters are the most scalable feature selection approach, being INTERACT, ReliefF and mRMR the most accurate methods.
Palabras clave
Feature selection
Scalability
Big data
High-dimensionality
Scalability
Big data
High-dimensionality
Descripción
This version of the article has been accepted for publication, after peer review and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/s10115-017-1140-3.
Versión del editor
Derechos
Todos os dereitos reservados. All rights reserved. Copyright © 2017, Springer-Verlag London Ltd., part of Springer Nature
ISSN
0219-1377
0219-3116
0219-3116