Mostrar o rexistro simple do ítem

dc.contributor.authorGarcía, Marcos
dc.contributor.authorGarcía Salido, Marcos
dc.contributor.authorAlonso-Ramos, Margarita
dc.date.accessioned2024-06-21T11:12:12Z
dc.date.available2024-06-21T11:12:12Z
dc.date.issued2019
dc.identifier.citationMarcos Garcia, Marcos García Salido, and Margarita Alonso-Ramos. 2019. A comparison of statistical association measures for identifying dependency-based collocations in various languages.. In Proceedings of the Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019), pages 49–59, Florence, Italy. Association for Computational Linguistics.es_ES
dc.identifier.isbn9781950737260
dc.identifier.urihttp://hdl.handle.net/2183/37283
dc.description.abstract[Abstract] This paper presents an exploration of different statistical association measures to automatically identify collocations from corpora in English, Portuguese, and Spanish. To evaluate the impact of the association measures we manually annotated corpora with three different syntactic patterns of collocations (adjective-noun, verb-object and nominal compounds). We took advantage of the PARSEME 1.1 Shared Task corpora by selecting a subset of 155k tokens in the three referred languages, in which we annotated 1, 526 collocations with their Lexical Functions according to the Meaning-Text Theory. Using the resulting gold-standard, we have carried out a comparison between frequency data and several well-known association measures, both symmetric and asymmetric. The results show that the combination of dependency triples with raw frequency information is as powerful as the best association measures in most syntactic patterns and languages. Furthermore, and despite the asymmetric behaviour of collocations, directional approaches perform worse than the symmetric ones in the extraction of these phraseological combinations.es_ES
dc.description.sponsorshipMinisterio de Economía y Competitividad; FFI2016-78299-Pes_ES
dc.description.sponsorshipXunta de Galicia; ED431B-2017/01es_ES
dc.description.sponsorshipInstituto Juan de la Cierva; IJCI-2016-29598es_ES
dc.description.sponsorshipXunta de Galicia; ED481D 2017/009es_ES
dc.language.isoenges_ES
dc.publisherAssociation for Computational Linguistics (ACL)es_ES
dc.relation.urihttps://aclanthology.org/W19-5107es_ES
dc.rightsAtribución 4.0 Internacionales_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.subjectCollocationses_ES
dc.subjectStatistical measureses_ES
dc.subjectLanguageses_ES
dc.subjectPortuguese languagees_ES
dc.subjectEnglish languagees_ES
dc.subjectSpanish languagees_ES
dc.subjectCorporaes_ES
dc.titleA comparison of statistical association measures for identifying dependency-based collocations in various languageses_ES
dc.typeinfo:eu-repo/semantics/conferenceObjectes_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.startPage49es_ES
UDC.endPage59es_ES
UDC.conferenceTitleProceedings of the Joint Workshop on Multiword Expressions and WordNet (MWE-WN 2019)es_ES


Ficheiros no ítem

Thumbnail
Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem