Enhancing text recognition on Tor Darknet images

Use this link to cite
http://hdl.handle.net/2183/23704
Except where otherwise noted, this item's license is described as Atribución-NoComercial-CompartirIgual 4.0
Collections
Metadata
Show full item recordTitle
Enhancing text recognition on Tor Darknet imagesDate
2019Citation
Blanco Medina, P., Alegre, E., Al-Nabki, Mhd., Chaves, D., Fidalgo Fernández, E. (2019). Enhancing text recognition on Tor Darknet images. En XL Jornadas de Automática: libro de actas, Ferrol, 4-6 de septiembre de 2019 (pp. 828-835). DOI capítulo: https://doi.org/10.17979/spudc.9788497497169.828. DOI libro: https://doi.org/10.17979/spudc.9788497497169
Abstract
[Abstract] Text Spotting can be used as an approach to retrieve
information found in images that cannot be
obtained otherwise, by performing text detection
rst and then recognizing the located text. Examples
of images to apply this task on can be
found in Tor network images, which contain information
that may not be found in plain text. When
comparing both stages, the latter performs worse
due to the low resolution of the cropped areas
among other problems. Focusing on the recognition
part of the pipeline, we study the performance
of ve recognition approaches, based on state-ofthe-
art neural network models, standalone OCR,
and OCR enhancements. We complement them
using string-matching techniques with two lexicons
and compare computational time on ve
di erent datasets, including Tor network images.
Our nal proposal achieved 39,70% precision of
text recognition in a custom dataset of images
taken from Tor domains
Keywords
Text Spotting
Text Recognition
OCR
Cybersecurity
Tor darknet
Text Recognition
OCR
Cybersecurity
Tor darknet
Editor version
Rights
Atribución-NoComercial-CompartirIgual 4.0
ISBN
978-84-9749-716-9