The wisdom of the rankers: a cost-effective method for building pooled test collections without participant systems

UDC.coleccionInvestigaciónes_ES
UDC.conferenceTitleSAC ’21es_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.grupoInvInformation Retrieval Lab (IRlab)es_ES
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicaciónes_ES
dc.contributor.authorOtero, David
dc.contributor.authorParapar, Javier
dc.contributor.authorBarreiro, Álvaro
dc.date.accessioned2025-03-13T08:13:56Z
dc.date.available2025-03-13T08:13:56Z
dc.date.issued2021
dc.descriptionThis is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in SAC '21: Proceedings of the 36th Annual ACM Symposium on Applied Computing, https://doi.org/10.1145/3412841.3441947es_ES
dc.description.abstract[Abstract]: Information Retrieval is an area where evaluation is crucial to validate newly proposed models. As the first step in the evaluation of models, researchers carry out offline experiments on specific datasets. While the field started around ad-hoc search, the number of new tasks is continuously growing. These tasks demand the development of new test collections (documents, information needs, and judgments). The construction of those datasets relies on expensive campaigns like TREC. Due to the size of modern collections, obtaining the relevance for each document-topic pair is infeasible. To reduce this cost, organizers usually apply a technique called pooling. When building pooled test collections, assessors only judge a portion of the documents selected among the participants' results. Although the judgments will not be exhaustive, they will be sufficiently complete and unbiased if pooling is done correctly. Therefore, researchers may safely use pooled collections to evaluate new models. However, the application of pooling depends on the existence of participant systems. This need is a handicap for tasks for which it is necessary to release training data before the celebration of the competition or for those with few participants. In this paper, we present a simple method for building pooled collections when such restrictions exist. Our proposal relies on two principles: the wisdom of the rankers and the application of pooling. By creating enough artificial participant systems, we can apply pooling on their results to select the documents that merit human assessment. Using an innovative approach to evaluate our method, we show that researchers may use it to produce high-quality collections on the absence of participant systems.es_ES
dc.description.sponsorshipThis work was supported by projects RTI2018-093336-B-C22 (MCIU & ERDF), and GPC ED431B 2019/03 (Xunta de Galicia & ERDF), and accreditation ED431G 2019/01 (Xunta de Galicia & ERDF).es_ES
dc.description.sponsorshipXunta de Galicia; ED431B 2019/03es_ES
dc.description.sponsorshipXunta de Galicia; ED431G 2019/01es_ES
dc.identifier.citationDavid Otero, Javier Parapar, and Álvaro Barreiro. 2021. The Wisdom of the Rankers: A Cost-Effective Method for Building Pooled Test Collections with-out Participant Systems. In The 36th ACM/SIGAPP Symposium on Applied Computing (SAC ’21), March 22–26, 2021, Virtual Event, Republic of Korea. ACM, New York, NY, USA, 9 pages.es_ES
dc.identifier.doi10.1145/3412841.3441947
dc.identifier.urihttp://hdl.handle.net/2183/41377
dc.language.isoenges_ES
dc.publisherACMes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-093336-B-C22/ES/TECNOLOGIAS PARA LA PREDICCION TEMPRANA DE SIGNOS RELACIONADOS CON TRASTORNOS PSICOLOGICOS (SUBPROYECTO UDC)es_ES
dc.relation.urihttps://doi.org/10.1145/3412841.3441947es_ES
dc.rights© 2021 Owner/Author | ACMes_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectInformation systemses_ES
dc.subjectInformation retrievales_ES
dc.subjectEvaluation of retrieval resultses_ES
dc.subjectTest collectionses_ES
dc.subjectPoolinges_ES
dc.titleThe wisdom of the rankers: a cost-effective method for building pooled test collections without participant systemses_ES
dc.typeconference outputes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication00d04042-9b75-419e-9aab-33fd14b201af
relation.isAuthorOfPublicationfef1a9cb-e346-4e53-9811-192e144f09d0
relation.isAuthorOfPublicationa3e43020-ee28-428d-8087-2f3c1e20aa2c
relation.isAuthorOfPublication.latestForDiscovery00d04042-9b75-419e-9aab-33fd14b201af

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Otero_David_2021_The_wisdom_of_the_rankers.pdf
Size:
4.35 MB
Format:
Adobe Portable Document Format
Description:
Versión aceptada