Relevance feedback for building pooled test collections

UDC.coleccionInvestigaciónes_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.endPage18es_ES
UDC.grupoInvInformation Retrieval Lab (IRlab)es_ES
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicaciónes_ES
UDC.journalTitleJournal of Information Sciencees_ES
UDC.startPage1es_ES
dc.contributor.authorOtero, David
dc.contributor.authorParapar, Javier
dc.contributor.authorBarreiro, Álvaro
dc.date.accessioned2025-03-05T19:08:05Z
dc.date.available2025-03-05T19:08:05Z
dc.date.issued2023
dc.descriptionThis manuscript version of the article: Otero, D., Parapar, J., & Barreiro, Á. (2023). ‘Relevance feedback for building pooled test collections’ has been accepted for publication in Journal of Information Science, 2023, pp-1-18. DOI: https://doi.org/10.1177/0165551515598926.es_ES
dc.description.abstract[Abstract]: Offline evaluation of information retrieval systems depends on test collections. These datasets provide the researchers with a corpus of documents, topics and relevance judgements indicating which documents are relevant for each topic. Gathering the latter is costly, requiring human assessors to judge the documents. Therefore, experts usually judge only a portion of the corpus. The most common approach for selecting that subset is pooling. By intelligently choosing which documents to assess, it is possible to optimise the number of positive labels for a given budget. For this reason, much work has focused on developing techniques to better select which documents from the corpus merit human assessments. In this article, we propose using relevance feedback to prioritise the documents when building new pooled test collections. We explore several state-of-the-art statistical feedback methods for prioritising the documents the algorithm presents to the assessors. A thorough comparison on eight Text Retrieval Conference (TREC) datasets against strong baselines shows that, among other results, our proposals improve in retrieving relevant documents with lower assessment effort than other state-of-the-art adjudicating methods without harming the reliability, fairness and reusability.es_ES
dc.description.sponsorshipThe author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: this work has received support from: (1) project PLEC2021-007662 (grant no. MCIN/AEI/10.13039/501100011033, Ministerio de Ciencia e Innovación (MCIN), Agencia Estatal de Investigación, Plan de Recuperación, Transformación y Resiliencia, Unión Europea-NextGenerationEU), (2) Programa de Ayudas para la Formación de Profesorado Universitario, grant number FPU20/02659 (Ministerio de Universidades), (3) project PID2022-137061OB-C21 (Proyectos de Generación de Conocimiento, MCIN), (4) Consellería de Educación, Universidade e Formación Profesional (accreditation 2019-2022 ED431G 2019/01) and the European Regional Development Fund, which acknowledges the Centro de Investigación en Tecnologías de la Información y la Comunicación (CITIC) Research Centre in Information and Communications Technology (ICT) of the University of A Coruña as a Research Centre of the Galician University System and (5) project ED431-B 2022/33 (Xunta de Galicia/European Regional Development Fund (ERDF).es_ES
dc.description.sponsorshipXunta de Galicia; ED431-B 2022/33es_ES
dc.description.sponsorshipXunta de Galicia; ED431G 2019/01es_ES
dc.identifier.citationOtero, D., Parapar, J., & Barreiro, Á. (2023). Relevance feedback for building pooled test collections. Journal of Information Science, [online first], pp. 1-18. https://doi.org/10.1177/01655515231171085es_ES
dc.identifier.doi10.1177/01655515231171085
dc.identifier.issn0165-5515
dc.identifier.issn1741-6485
dc.identifier.urihttp://hdl.handle.net/2183/41306
dc.language.isoenges_ES
dc.publisherSAGE Publicationses_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2024/PLEC2021-007662/ES/BIG-eRISK: PREDICCIÓN TEMPRANA DE RIESGOS PERSONALES EN CONJUNTOS DE DATOS MASIVOS es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/MECD/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/FPU20%2F02659/ES/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica, Técnica y de Innovación 2021-2023/PID2022-137061OB-C21/ES/BUSQUEDA, SELECCION Y ORGANIZACION DE CONTENIDOS PARA NECESIDADES DE INFORMACION RELACIONADAS CON LA SALUD - CONSTRUCCION DE RECURSOS Y PERSONALIZACIONes_ES
dc.relation.urihttps://doi.org/10.1177/01655515231171085es_ES
dc.rightsCopyright © 2023 The Authors. Article reuse guidelines: sagepub.com/journals-permissions.es_ES
dc.rightsAtribución-NoComercial-NoDerivates 4.0 Internacional
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectPoolinges_ES
dc.subjectRelevance feedbackes_ES
dc.subjectRerankinges_ES
dc.subjectTest collectionses_ES
dc.titleRelevance feedback for building pooled test collectionses_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublication00d04042-9b75-419e-9aab-33fd14b201af
relation.isAuthorOfPublicationfef1a9cb-e346-4e53-9811-192e144f09d0
relation.isAuthorOfPublicationa3e43020-ee28-428d-8087-2f3c1e20aa2c
relation.isAuthorOfPublication.latestForDiscovery00d04042-9b75-419e-9aab-33fd14b201af

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Otero_David_2023_Relevance_feedback_for_building_pooled_test_collections.pdf
Size:
752.23 KB
Format:
Adobe Portable Document Format
Description: