Use this link to cite:
https://hdl.handle.net/2183/47231 Optimising Image Feature Extraction and Selection: A Comprehensive Review With Spark Case Studies
Loading...
Identifiers
Publication date
Authors
Advisors
Other responsabilities
Journal Title
Bibliographic citation
Figueira-Domínguez, J. G., B. Remeseiro, and V. Bolón-Canedo. 2026. “ Optimising Image Feature Extraction and Selection: A Comprehensive Review With Spark Case Studies.” Expert Systems 43, no. 2: e70188. https://doi.org/10.1111/exsy.70188
Type of academic work
Academic degree
Abstract
[Abstract]: As benchmark image datasets expand in sample size and feature complexity, the challenge of managing increased dimensionality becomes apparent. Contrary to the expectation that more features equate to enhanced information and improved outcomes, the curse of dimensionality often hampers performance. This paper reviews existing literature on filter feature selection techniques applied to image features, highlighting their use in both classical and deep-learning-based feature extraction methods. Building on these findings, this study proposes a scalable approach for image feature extraction and selection using Big Data technologies, specifically Apache Spark, to efficiently process large and high-dimensional datasets. The proposed framework integrates filter-based feature selection methods within a distributed environment to evaluate their effectiveness in image analysis tasks. Several experiments were performed to compare the results using feature selection techniques with various reduction percentages. Results show that significant feature reduction can be achieved without compromising classification accuracy, demonstrating the potential of Spark-based distributed processing for large-scale image analytics.
Description
The data that support the findings of this study are openly available inImagenet Features Extracted with VGG-19 at https://zenodo.org/records/12791398
Editor version
Rights
Attribution 4.0 International








