Oculus-Crawl, a software tool for building datasets for computer vision tasks

Use this link to cite
http://hdl.handle.net/2183/25870
Except where otherwise noted, this item's license is described as Atribución-NoComercial-CompartirIgual 4.0 España
Collections
Metadata
Show full item recordTitle
Oculus-Crawl, a software tool for building datasets for computer vision tasksDate
2017Citation
Paz Centeno, I., Fidalgo, E., Alegre Gutiérrez, E., Al-Nabki, M. W. Oculus-Crawl, a software tool for building datasets for computer vision tasks. En Actas de las XXXVIII Jornadas de Automática, Gijón, 6-8 de Septiembre de 2017 (pp.991-998). DOI capítulo: https://doi.org/10.17979/spudc.9788497497749.0991 DOI libro: https://doi.org/10.17979/spudc.9788497497749
Abstract
[Abstract] Building datasets for computer vision tasks require a source of a large number of images, like the ones provided by the Internet search engines, joined with automated scraping tools, to construct them in a reasonable time. In this paper it is presented Oculus-Crawl, a tool designed to crawl and scrape images from the search engines Google and Yahoo Images to build datasets of pictures, that is modular, scalable and portable. It is also discussed a benchmark for this crawler and an internal feature for storing and sharing big datasets, that makes it suitable for computer vision and machine learning tasks. In our tests we were able to crawl and fetch 11.555 images in less than 14 minutes, including also their meta-data description, showing that it might be well-suited for retrieving large datasets.
Keywords
Crawler
Search engine
Dataset
Images
Computer vision
Search engine
Dataset
Images
Computer vision
Editor version
Rights
Atribución-NoComercial-CompartirIgual 4.0 España
ISBN
978-8416664-74-0 (UOV) 978-84-9749-774-9 (UDC electrónico)