Keyword Embeddings for Query Suggestion
| UDC.coleccion | Investigación | es_ES |
| UDC.conferenceTitle | 45th European Conference on Information Retrieval, ECIR 2023 | es_ES |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | es_ES |
| UDC.endPage | 360 | es_ES |
| UDC.grupoInv | Information Retrieval Lab (IRlab) | es_ES |
| UDC.journalTitle | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | es_ES |
| UDC.startPage | 346 | es_ES |
| dc.contributor.author | Gabín, Jorge | |
| dc.contributor.author | Ares, M. Eduardo | |
| dc.contributor.author | Parapar, Javier | |
| dc.date.accessioned | 2023-05-11T18:55:50Z | |
| dc.date.available | 2023-05-11T18:55:50Z | |
| dc.date.issued | 2023-04 | |
| dc.description.abstract | [Abstract]: Nowadays, search engine users commonly rely on query suggestions to improve their initial inputs. Current systems are very good at recommending lexical adaptations or spelling corrections to users’ queries. However, they often struggle to suggest semantically related keywords given a user’s query. The construction of a detailed query is crucial in some tasks, such as legal retrieval or academic search. In these scenarios, keyword suggestion methods are critical to guide the user during the query formulation. This paper proposes two novel models for the keyword suggestion task trained on scientific literature. Our techniques adapt the architecture of Word2Vec and FastText to generate keyword embeddings by leveraging documents’ keyword co-occurrence. Along with these models, we also present a specially tailored negative sampling approach that exploits how keywords appear in academic publications. We devise a ranking-based evaluation methodology following both known-item and ad-hoc search scenarios. Finally, we evaluate our proposals against the state-of-the-art word and sentence embedding models showing considerable improvements over the baselines for the tasks. | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431G/01 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431B 2022/33 | es_ES |
| dc.description.sponsorship | This work was supported by projects PLEC2021-007662 (MCIN/AEI/10.13039/501100011033, Ministerio de Ciencia e Innovación, Agencia Estatal de Investigación, Plan de Recuperación, Transformación y Resiliencia, Unión Europea-Next Generation EU) and RTI2018-093336-B-C22 (Ministerio de Ciencia e Innovación, Agencia Estatal de Investigación). The first and third authors also thank the financial support supplied by the Consellería de Cultura, Educación e Universidade Consellería de Cultura, Educación, Formación Profesional e Universidades (accreditation 2019-2022 ED431G/01, ED431B 2022/33) and the European Regional Development Fund, which acknowledges the CITIC Research Center in ICT of the University of A Coruña as a Research Center of the Galician University System. The fist author also acknowledges the support of grant DIN2020-011582 financed by the MCIN/AEI/10.13039/501100011033. | es_ES |
| dc.identifier.citation | Gabín, J., Ares, M.E., Parapar, J. (2023). Keyword Embeddings for Query Suggestion. In: , et al. Advances in Information Retrieval. ECIR 2023. Lecture Notes in Computer Science, vol 13980. Springer, Cham. https://doi.org/10.1007/978-3-031-28244-7_22 | es_ES |
| dc.identifier.isbn | 978-3-031-28244-7 | |
| dc.identifier.isbn | 978-3-031-28243-0 (print) | |
| dc.identifier.issn | 0302-9743 | |
| dc.identifier.uri | http://hdl.handle.net/2183/33069 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Springer, Cham | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-093336-B-C22/ES/TECNOLOGIAS PARA LA PREDICCION TEMPRANA DE SIGNOS RELACIONADOS CON TRASTORNOS PSICOLOGICOS (SUBPROYECTO UDC) | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2024/PLEC2021-007662/ES/BIG-eRISK: PREDICCIÓN TEMPRANA DE RIESGOS PERSONALES EN CONJUNTOS DE DATOS MASIVOS | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/DIN2020-011582/ES/ | es_ES |
| dc.relation.uri | 10.1007/978-3-031-28244-7_22 | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.subject | Keyword suggestion | es_ES |
| dc.subject | Keyword embeddings | es_ES |
| dc.subject | Negative sampling | es_ES |
| dc.subject | Academic search | es_ES |
| dc.title | Keyword Embeddings for Query Suggestion | es_ES |
| dc.type | conference output | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | fef1a9cb-e346-4e53-9811-192e144f09d0 | |
| relation.isAuthorOfPublication.latestForDiscovery | fef1a9cb-e346-4e53-9811-192e144f09d0 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Gabin_Jorge_2023_keywords_embeddings_query_suggestion.pdf
- Size:
- 423.3 KB
- Format:
- Adobe Portable Document Format
- Description:

