dc.contributor.author | Muñoz-Ortiz, Alberto | |
dc.contributor.author | Gómez-Rodríguez, Carlos | |
dc.contributor.author | Vilares, David | |
dc.date.accessioned | 2024-05-27T12:35:00Z | |
dc.date.available | 2024-05-27T12:35:00Z | |
dc.date.issued | 2022-05 | |
dc.identifier.citation | Alberto Muñoz-Ortiz, Carlos Gómez-Rodríguez, and David Vilares. 2022. Cross-lingual Inflection as a Data Augmentation Method for Parsing. In Proceedings of the Third Workshop on Insights from Negative Results in NLP, pages 54–61, Dublin, Ireland. Association for Computational Linguistics. | es_ES |
dc.identifier.uri | http://hdl.handle.net/2183/36647 | |
dc.description | Held 26 May 2022, Dublin, Ireland. | es_ES |
dc.description.abstract | [Absctract]: We propose a morphology-based method for low-resource (LR) dependency parsing. We train a morphological inflector for target LR languages, and apply it to related rich-resource (RR) treebanks to create cross-lingual (x-inflected) treebanks that resemble the target LR language. We use such inflected treebanks to train parsers in zero- (training on x-inflected treebanks) and few-shot (training on x-inflected and target language treebanks) setups. The results show that the method sometimes improves the baselines, but not consistently. | es_ES |
dc.description.sponsorship | This work is supported by a 2020 Leonardo Grant
for Researchers and Cultural Creators from the
FBBVA,3
as well as by the European Research
Council (ERC), under the European Union’s Horizon 2020 research and innovation programme
(FASTPARSE, grant agreement No 714150). The
work is also supported by ERDF/MICINN-AEI
(SCANNER-UDC, PID2020-113230RB-C21), by
Xunta de Galicia (ED431C 2020/11), and by Centro de Investigación de Galicia “CITIC” which is
funded by Xunta de Galicia, Spain and the European Union (ERDF - Galicia 2014–2020 Program),
by grant ED431G 2019/01. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Association for Computational Linguistics | es_ES |
dc.relation.uri | https://aclanthology.org/2022.insights-1.7/ | es_ES |
dc.rights | Atribución 3.0 España | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/3.0/es/ | * |
dc.subject | Cross-lingual inflection | es_ES |
dc.subject | Morphological Inflection | es_ES |
dc.subject | Data augmentation | es_ES |
dc.subject | Dependency parsing | es_ES |
dc.subject | Low-resource languages | es_ES |
dc.subject | Syntactic data augmentation | es_ES |
dc.title | Cross-lingual Inflection as a Data Augmentation Method for Parsing | es_ES |
dc.type | conference output | es_ES |
dc.rights.accessRights | open access | es_ES |
UDC.journalTitle | Proceedings of the Third Workshop on Insights from Negative Results in NLP | es_ES |
UDC.startPage | 54 | es_ES |
UDC.endPage | 61 | es_ES |
UDC.conferenceTitle | Third Workshop on Insights from Negative Results in NLP (Insights 2022) | es_ES |
UDC.coleccion | Investigación | es_ES |
UDC.departamento | Letras | es_ES |
UDC.grupoInv | Lingua e Sociedade da Información (LYS) | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/H2020/714150 | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113230RB-C21/ES/MODELOS MULTITAREA DE ETIQUETADO SECUENCIAL PARA EL RECONOCIMIENTO DE ENTIDADES ENRIQUECIDO CON INFORMACIÓN LINGÜÍSTICA: SINTAXIS E INTEGRACIÓN MULTITAREA (SCANNER-UDC) | es_ES |