Oculus-Crawl, a software tool for building datasets for computer vision tasks

Loading...
Thumbnail Image

Identifiers

Publication date

Authors

Paz Centeno, Iván de
Fidalgo, Eduardo
Alegre, Enrique
Al-Nabki, Mhd Wesam

Advisors

Other responsabilities

Journal Title

Bibliographic citation

Paz Centeno, I., Fidalgo, E., Alegre Gutiérrez, E., Al-Nabki, M. W. Oculus-Crawl, a software tool for building datasets for computer vision tasks. En Actas de las XXXVIII Jornadas de Automática, Gijón, 6-8 de Septiembre de 2017 (pp.991-998). DOI capítulo: https://doi.org/10.17979/spudc.9788497497749.0991 DOI libro: https://doi.org/10.17979/spudc.9788497497749

Type of academic work

Academic degree

Abstract

[Abstract] Building datasets for computer vision tasks require a source of a large number of images, like the ones provided by the Internet search engines, joined with automated scraping tools, to construct them in a reasonable time. In this paper it is presented Oculus-Crawl, a tool designed to crawl and scrape images from the search engines Google and Yahoo Images to build datasets of pictures, that is modular, scalable and portable. It is also discussed a benchmark for this crawler and an internal feature for storing and sharing big datasets, that makes it suitable for computer vision and machine learning tasks. In our tests we were able to crawl and fetch 11.555 images in less than 14 minutes, including also their meta-data description, showing that it might be well-suited for retrieving large datasets.

Description

Rights

Atribución-NoComercial-CompartirIgual 4.0 España
Atribución-NoComercial-CompartirIgual 4.0 España

Except where otherwise noted, this item's license is described as Atribución-NoComercial-CompartirIgual 4.0 España