EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis
Use este enlace para citar
http://hdl.handle.net/2183/34966
A non ser que se indique outra cousa, a licenza do ítem descríbese como Atribución-NoComercial 4.0 Internacional
Coleccións
Metadatos
Mostrar o rexistro completo do ítemTítulo
EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment AnalysisData
2016-05Cita bibliográfica
David Vilares, Miguel A. Alonso, and Carlos Gómez-Rodríguez. 2016. EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 4149–4153, Portorož, Slovenia. European Language Resources Association (ELRA).
Resumo
[Abstract]: Code-switching texts are those that contain terms in two or more different languages, and they appear increasingly often in social media. The aim of this paper is to provide a resource to the research community to evaluate the performance of sentiment classification techniques on this complex multilingual environment, proposing an English-Spanish corpus of tweets with code-switching (EN-ES-CS CORPUS). The tweets are labeled according to two well-known criteria used for this purpose: SentiStrength and a trinary scale (positive, neutral and negative categories). Preliminary work on the resource is already done, providing a set of baselines for the research community.
Palabras chave
Sentiment Analysis
Corpus Generation
Code-Switching
Corpus Generation
Code-Switching
Versión do editor
Dereitos
Atribución-NoComercial 4.0 Internacional
ISBN
978-2-9517408-9-1