Cross-lingual Inflection as a Data Augmentation Method for Parsing

Muñoz-Ortiz, Alberto; Gómez-Rodríguez, Carlos; Vilares, David

dc.contributor.author	Muñoz-Ortiz, Alberto
dc.contributor.author	Gómez-Rodríguez, Carlos
dc.contributor.author	Vilares, David
dc.date.accessioned	2024-05-27T12:35:00Z
dc.date.available	2024-05-27T12:35:00Z
dc.date.issued	2022-05
dc.identifier.citation	Alberto Muñoz-Ortiz, Carlos Gómez-Rodríguez, and David Vilares. 2022. Cross-lingual Inflection as a Data Augmentation Method for Parsing. In Proceedings of the Third Workshop on Insights from Negative Results in NLP, pages 54–61, Dublin, Ireland. Association for Computational Linguistics.	es_ES
dc.identifier.uri	http://hdl.handle.net/2183/36647
dc.description	Held 26 May 2022, Dublin, Ireland.	es_ES
dc.description.abstract	[Absctract]: We propose a morphology-based method for low-resource (LR) dependency parsing. We train a morphological inflector for target LR languages, and apply it to related rich-resource (RR) treebanks to create cross-lingual (x-inflected) treebanks that resemble the target LR language. We use such inflected treebanks to train parsers in zero- (training on x-inflected treebanks) and few-shot (training on x-inflected and target language treebanks) setups. The results show that the method sometimes improves the baselines, but not consistently.	es_ES
dc.description.sponsorship	This work is supported by a 2020 Leonardo Grant for Researchers and Cultural Creators from the FBBVA,3 as well as by the European Research Council (ERC), under the European Union’s Horizon 2020 research and innovation programme (FASTPARSE, grant agreement No 714150). The work is also supported by ERDF/MICINN-AEI (SCANNER-UDC, PID2020-113230RB-C21), by Xunta de Galicia (ED431C 2020/11), and by Centro de Investigación de Galicia “CITIC” which is funded by Xunta de Galicia, Spain and the European Union (ERDF - Galicia 2014–2020 Program), by grant ED431G 2019/01.	es_ES
dc.language.iso	eng	es_ES
dc.publisher	Association for Computational Linguistics	es_ES
dc.relation	info:eu-repo/grantAgreement/EC/H2020/714150	es_ES
dc.relation	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113230RB-C21/ES/MODELOS MULTITAREA DE ETIQUETADO SECUENCIAL PARA EL RECONOCIMIENTO DE ENTIDADES ENRIQUECIDO CON INFORMACIÓN LINGÜÍSTICA: SINTAXIS E INTEGRACIÓN MULTITAREA (SCANNER-UDC)	es_ES
dc.relation.uri	https://aclanthology.org/2022.insights-1.7/	es_ES
dc.rights	Atribución 3.0 España	es_ES
dc.rights.uri	http://creativecommons.org/licenses/by/3.0/es/	*
dc.subject	Cross-lingual inflection	es_ES
dc.subject	Morphological Inflection	es_ES
dc.subject	Data augmentation	es_ES
dc.subject	Dependency parsing	es_ES
dc.subject	Low-resource languages	es_ES
dc.subject	Syntactic data augmentation	es_ES
dc.title	Cross-lingual Inflection as a Data Augmentation Method for Parsing	es_ES
dc.type	info:eu-repo/semantics/conferenceObject	es_ES
dc.type	info:eu-repo/semantics/conferenceObject	es_ES
dc.rights.access	info:eu-repo/semantics/openAccess	es_ES
UDC.journalTitle	Proceedings of the Third Workshop on Insights from Negative Results in NLP	es_ES
UDC.startPage	54	es_ES
UDC.endPage	61	es_ES
UDC.conferenceTitle	Third Workshop on Insights from Negative Results in NLP (Insights 2022)	es_ES

Ficheiros no ítem

Nome:: MuñozOrtiz_2022_Cross_lingual_ ...
Tamaño:: 193.8Kb
Formato:: PDF

Ver/abrir

Nome:: license_rdf
Tamaño:: 1.337Kb
Formato:: application/rdf+xml

Ver/abrir

Este ítem aparece na(s) seguinte(s) colección(s)

OpenAIRE [336]
GI-LYS - Congresos, conferencias, etc. [65]

Mostrar o rexistro simple do ítem