Fast anomaly detection with locality-sensitive hashing and hyperparameter autotuning
Use this link to cite
http://hdl.handle.net/2183/34389
Except where otherwise noted, this item's license is described as Atribución-NoComercial-SinDerivadas 3.0 España
Collections
- GI-LIDIA - Artigos [64]
Metadata
Show full item recordTitle
Fast anomaly detection with locality-sensitive hashing and hyperparameter autotuningAuthor(s)
Date
2022-08Citation
J. Meira, C. Eiras-Franco, V. Bolón-Canedo, G. Marreiros, y A. Alonso-Betanzos, «Fast anomaly detection with locality-sensitive hashing and hyperparameter autotuning», Information Sciences, vol. 607, pp. 1245-1264, ago. 2022, doi: 10.1016/j.ins.2022.06.035.
Abstract
[Abstract]: This paper presents LSHAD, an anomaly detection (AD) method based on Locality Sensitive Hashing (LSH), capable of dealing with large-scale datasets. The resulting algorithm is highly parallelizable and its implementation in Apache Spark further increases its ability to handle very large datasets. Moreover, the algorithm incorporates an automatic hyperparameter tuning mechanism so that users do not have to implement costly manual tuning. Our LSHAD method is novel as both hyperparameter automation and distributed properties are not usual in AD techniques. Our results for experiments with LSHAD across a variety of datasets point to state-of-the-art AD performance while handling much larger datasets than state-of-the-art alternatives. In addition, evaluation results for the tradeoff between AD performance and scalability show that our method offers significant advantages over competing methods.
Keywords
Anomaly detection
Unsupervised learning
AutoML
Scalability
Big data
Unsupervised learning
AutoML
Scalability
Big data
Editor version
Rights
Atribución-NoComercial-SinDerivadas 3.0 España CC BY-NC-ND 4.0
ISSN
0020-0255