• A comparison of performance of K-complex classification methods using feature selection 

      Hernández-Pereira, Elena; Bolón-Canedo, Verónica; Sánchez-Maroño, Noelia; Álvarez-Estévez, Diego; Moret-Bonillo, Vicente; Alonso-Betanzos, Amparo (2016-01-20)
      [Abstract] The main objective of this work is to obtain a method that achieves the best accuracy results with a low false positive rate in the classification of K-complexes, a kind of transient waveform found in the ...
    • A novel framework for generic Spark workload characterization and similar pattern recognition using machine learning 

      Garralda-Barrio, Mariano; Eiras-Franco, Carlos; Bolón-Canedo, Verónica (Elsevier, 2024-07)
      [Abstract]: Comprehensive workload characterization plays a pivotal role in comprehending Spark applications, as it enables the analysis of diverse aspects and behaviors. This understanding is indispensable for devising ...
    • A review of green artificial intelligence: Towards a more sustainable future 

      Bolón-Canedo, Verónica; Morán-Fernández, Laura; Cancela, Brais; Alonso-Betanzos, Amparo (Elsevier B.V., 2024-09-28)
      [Abstract]: Green artificial intelligence (AI) is more environmentally friendly and inclusive than conventional AI, as it not only produces accurate results without increasing the computational cost but also ensures that ...
    • A scalable saliency-based feature selection method with instance-level information 

      Cancela, Brais; Bolón-Canedo, Verónica; Alonso-Betanzos, Amparo; Gama, João (Elsevier, 2019-11)
      [Abstract]: Classic feature selection techniques remove irrelevant or redundant features to achieve a subset of relevant features in compact models that are easier to interpret and so improve knowledge extraction. Most ...
    • Anomaly Detection on Natural Language Processing to Improve Predictions on Tourist Preferences 

      Meira, Jorge; Carneiro, João; Bolón-Canedo, Verónica; Alonso-Betanzos, Amparo; Novais, Paulo; Marreiros, Goreti (MDPI, 2022)
      [Abstract] Argumentation-based dialogue models have shown to be appropriate for decision contexts in which it is intended to overcome the lack of interaction between decision-makers, either because they are dispersed, they ...
    • CUDA acceleration of MI-based feature selection methods 

      Beceiro, Bieito; González-Domínguez, Jorge; Morán-Fernández, Laura; Bolón-Canedo, Verónica; Touriño, Juan (Elsevier, 2024-08)
      [Abstract]: Feature selection algorithms are necessary nowadays for machine learning as they are capable of removing irrelevant and redundant information to reduce the dimensionality of the data and improve the quality of ...
    • Data analysis and feature selection for predictive maintenance: A case-study in the metallurgic industry 

      Fernandes, Marta; Canito, Alda; Bolón-Canedo, Verónica; Conceição, Luís; Praça, Isabel; Marreiros, Goreti (Elsevier Ltd, 2019-06)
      [Abstract]: Proactive Maintenance practices are becoming more standard in industrial environments, with a direct and profound impact on the competitivity within the sector. These practices demand the continuous monitorization ...
    • Data-driven predictive maintenance framework for railway systems 

      Meira, Jorge; Veloso, Bruno; Bolón-Canedo, Verónica; Marreiros, Goreti; Alonso-Betanzos, Amparo; Gama, João (IOS Press, 2023)
      [Abstract]: The emergence of the Industry 4.0 trend brings automation and data exchange to industrial manufacturing. Using computational systems and IoT devices allows businesses to collect and deal with vast volumes of ...
    • Dealing with heterogeneity in the context of distributed feature selection for classification 

      Morillo-Salas, José Luis; Bolón-Canedo, Verónica; Alonso-Betanzos, Amparo (Springer, 2021)
      [Abstract]: Advances in the information technologies have greatly contributed to the advent of larger datasets. These datasets often come from distributed sites, but even so, their large size usually means they cannot be ...
    • Distributed classification based on distances between probability distributions in feature space 

      Montero Manso, Pablo; Morán-Fernández, Laura; Bolón-Canedo, Verónica; Vilar, José; Alonso-Betanzos, Amparo (Elsevier, 2019-09)
      [Abstract]: We consider a distributed framework where training and test samples drawn from the same distribution are available, with the training instances spread across disjoint nodes. In this setting, a novel learning ...
    • Do all roads lead to Rome? Studying distance measures in the context of machine learning 

      Blanco Mallo, Eva; Morán-Fernández, Laura; Remeseiro, Beatriz; Bolón-Canedo, Verónica (Elsevier Ltd, 2023-09)
      [Abstract]: Many machine learning and data mining tasks are based on distance measures, so a large amount of literature addresses this aspect somehow. Due to the broad scope of the topic, this paper aims to provide an ...
    • E2E-FS: An End-to-End Feature Selection Method for Neural Networks 

      Cancela, Brais; Bolón-Canedo, Verónica; Alonso-Betanzos, Amparo (IEEE, 2023-07)
      [Abstract]: Classic embedded feature selection algorithms are often divided in two large groups: tree-based algorithms and LASSO variants. Both approaches are focused in different aspects: while the tree-based algorithms ...
    • Ensembles for feature selection: A review and future trends 

      Bolón-Canedo, Verónica; Alonso-Betanzos, Amparo (Elsevier, 2019)
      [Abstract]: Ensemble learning is a prolific field in Machine Learning since it is based on the assumption that combining the output of multiple models is better than using a single model, and it usually provides good ...
    • Fast anomaly detection with locality-sensitive hashing and hyperparameter autotuning 

      Meira, Jorge; Eiras-Franco, Carlos; Bolón-Canedo, Verónica; Marreiros, Goreti; Alonso-Betanzos, Amparo (Elsevier, 2022-08)
      [Abstract]: This paper presents LSHAD, an anomaly detection (AD) method based on Locality Sensitive Hashing (LSH), capable of dealing with large-scale datasets. The resulting algorithm is highly parallelizable and its ...
    • Feature selection for domain adaptation using complexity measures and swarm intelligence 

      Castillo-García, G.; Morán-Fernández, Laura; Bolón-Canedo, Verónica (Elsevier B.V., 2023-09-01)
      [Abstract]: Particle Swarm Optimization is an optimization algorithm that mimics the behaviour of a flock of birds, setting multiple particles that explore the search space guided by a fitness function in order to find the ...
    • Feature selection with limited bit depth mutual information for portable embedded systems 

      Morán-Fernández, Laura; Sechidis, Konstantinos; Bolón-Canedo, Verónica; Alonso-Betanzos, Amparo; Brown, Gavin (Elsevier, 2020-06)
      [Abstract]: Since wearable computing systems have grown in importance in the last years, there is an increased interest in implementing machine learning algorithms with reduced precision parameters/computations. Not only ...
    • Finding a needle in a haystack: insights on feature selection for classification tasks 

      Morán-Fernández, Laura; Bolón-Canedo, Verónica (Springer, 2024-04)
      [Abstract]: The growth of Big Data has resulted in an overwhelming increase in the volume of data available, including the number of features. Feature selection, the process of selecting relevant features and discarding ...
    • How Important Is Data Quality? Best Classifiers vs Best Features 

      Morán-Fernández, Laura; Bolón-Canedo, Verónica; Alonso-Betanzos, Amparo (Elsevier, 2021)
      [Abstract] The task of choosing the appropriate classifier for a given scenario is not an easy-to-solve question. First, there is an increasingly high number of algorithms available belonging to different families. And ...
    • Insights into distributed feature ranking 

      Bolón-Canedo, Verónica; Sechidis, Konstantinos; Sánchez-Maroño, Noelia; Alonso-Betanzos, Amparo; Brown, Gavin (Elsevier, 2019)
      [Abstract]: In an era in which the volume and complexity of datasets is continuously growing, feature selection techniques have become indispensable to extract useful information from huge amounts of data. However, existing ...
    • Low-Precision Feature Selection on Microarray Data: An Information Theoretic Approach 

      Morán-Fernández, Laura; Bolón-Canedo, Verónica; Alonso-Betanzos, Amparo (Springer, 2022)
      [Abstract] The number of interconnected devices, such as personal wearables, cars, and smart-homes, surrounding us every day has recently increased. The Internet of Things devices monitor many processes, and have the ...