Optimising Image Feature Extraction and Selection: A Comprehensive Review With Spark Case Studies

Loading...
Thumbnail Image

Identifiers

Publication date

Authors

Figueira-Domínguez, J. Guzmán
Remeseiro, Beatriz

Advisors

Other responsabilities

Journal Title

Bibliographic citation

Figueira-Domínguez, J. G., B. Remeseiro, and V. Bolón-Canedo. 2026. “ Optimising Image Feature Extraction and Selection: A Comprehensive Review With Spark Case Studies.” Expert Systems 43, no. 2: e70188. https://doi.org/10.1111/exsy.70188

Type of academic work

Academic degree

Abstract

[Abstract]: As benchmark image datasets expand in sample size and feature complexity, the challenge of managing increased dimensionality becomes apparent. Contrary to the expectation that more features equate to enhanced information and improved outcomes, the curse of dimensionality often hampers performance. This paper reviews existing literature on filter feature selection techniques applied to image features, highlighting their use in both classical and deep-learning-based feature extraction methods. Building on these findings, this study proposes a scalable approach for image feature extraction and selection using Big Data technologies, specifically Apache Spark, to efficiently process large and high-dimensional datasets. The proposed framework integrates filter-based feature selection methods within a distributed environment to evaluate their effectiveness in image analysis tasks. Several experiments were performed to compare the results using feature selection techniques with various reduction percentages. Results show that significant feature reduction can be achieved without compromising classification accuracy, demonstrating the potential of Spark-based distributed processing for large-scale image analytics.

Description

The data that support the findings of this study are openly available inImagenet Features Extracted with VGG-19 at https://zenodo.org/records/12791398

Rights

Attribution 4.0 International
Attribution 4.0 International

Except where otherwise noted, this item's license is described as Attribution 4.0 International