On the Logistical Difficulties and Findings of Jopara Sentiment Analysis
| UDC.coleccion | Investigación | es_ES |
| UDC.conferenceTitle | Fifth Workshop on Computational Approaches to Linguistic Code-Switching | es_ES |
| UDC.departamento | Letras | es_ES |
| UDC.endPage | 102 | es_ES |
| UDC.grupoInv | Lingua e Sociedade da Información (LYS) | es_ES |
| UDC.startPage | 95 | es_ES |
| dc.contributor.author | Agüero-Torales, Marvin M. | |
| dc.contributor.author | Vilares, David | |
| dc.contributor.author | López-Herrera, Antonio G. | |
| dc.date.accessioned | 2022-04-26T08:02:26Z | |
| dc.date.available | 2022-04-26T08:02:26Z | |
| dc.date.issued | 2021-06 | |
| dc.description.abstract | [Abstract] This paper addresses the problem of sentiment analysis for Jopara, a code-switching language between Guarani and Spanish. We first collect a corpus of Guarani-dominant tweets and discuss on the difficulties of finding quality data for even relatively easy-to-annotate tasks, such as sentiment analysis. Then, we train a set of neural models, including pre-trained language models, and explore whether they perform better than traditional machine learning ones in this low-resource setup. Transformer architectures obtain the best results, despite not considering Guarani during pre-training, but traditional machine learning models perform close due to the low-resource nature of the problem. | es_ES |
| dc.description.sponsorship | DV is supported by a 2020 Leonardo Grant for Researchers and Cultural Creators from the FBBVA. 15 DV also receives funding from MINECO (ANSWER-ASAP, TIN2017-85160-C2-1-R), from Xunta de Galicia (ED431C 2020/11), from Centro de Investigación de Galicia ‘CITIC’, funded by Xunta de Galicia and the European Union (European Regional Development Fund- Galicia 2014-2020 Program) by grant ED431G 2019/01 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2020/11 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431G 2019/01 | |
| dc.description.uri | https://aclanthology.org/2021.calcs-1 | |
| dc.identifier.citation | Marvin Agüero-Torales, David Vilares, and Antonio López-Herrera. 2021. On the logistical difficulties and findings of Jopara Sentiment Analysis. In Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching, pages 95–102, Online. Association for Computational Linguistics. | es_ES |
| dc.identifier.doi | 10.18653/v1/2021.calcs-1.12 | |
| dc.identifier.uri | http://hdl.handle.net/2183/30534 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Association for Computational Linguistics | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/TIN2017-85160-C2-1-R/ES/AVANCES EN NUEVOS SISTEMAS DE EXTRACCION DE RESPUESTAS CON ANALISIS SEMANTICO Y APRENDIZAJE PROFUNDO/ | |
| dc.relation.uri | https://doi.org/10.18653/v1/2021.calcs-1.12 | es_ES |
| dc.rights | Atribución 4.0 Internacional | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | * |
| dc.subject | Code-switching language | es_ES |
| dc.subject | Jopara | es_ES |
| dc.subject | Guarani language | es_ES |
| dc.subject | Spanish language | es_ES |
| dc.subject | Sentiment analysis | es_ES |
| dc.subject | Machine learning models | es_ES |
| dc.title | On the Logistical Difficulties and Findings of Jopara Sentiment Analysis | es_ES |
| dc.type | conference output | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 37dabbe9-f54f-43bb-960e-0bf3ac7e54eb | |
| relation.isAuthorOfPublication.latestForDiscovery | 37dabbe9-f54f-43bb-960e-0bf3ac7e54eb |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Aguero_Torales_Marvin_Vilares_David_Lopez_Herrera_Antonio_2021_Logistical_difficulties_Jopara_sentiment_Analysis.pdf
- Size:
- 392.51 KB
- Format:
- Adobe Portable Document Format
- Description:

