Building High-Quality Datasets for Information Retrieval Evaluation at a Reduced Cost
Title
Building High-Quality Datasets for Information Retrieval Evaluation at a Reduced CostDate
2019-08-01Citation
OTERO, David, et al. Building High-Quality Datasets for Information Retrieval Evaluation at a Reduced Cost. En Multidisciplinary Digital Publishing Institute Proceedings. 2019. p. 33.
Abstract
[Abstract] Information Retrieval is not any more exclusively about document ranking. Continuously new tasks are proposed on this and sibling fields. With this proliferation of tasks, it becomes crucial to have a cheap way of constructing test collections to evaluate the new developments. Building test collections is time and resource consuming: it requires time to obtain the documents, to define the user needs and it requires the assessors to judge a lot of documents. To reduce the latest, pooling strategies aim to decrease the assessment effort by presenting to the assessors a sample of documents in the corpus with the maximum number of relevant documents in it. In this paper, we propose the preliminary design of different techniques to easily and cheapily build high-quality test collections without the need of having participants systems.
Keywords
Information retrieval
Evaluation
Datasets
Cost
Evaluation
Datasets
Cost
Editor version
Rights
Atribución 4.0 España
ISSN
2504-3900