Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora

Bibliographic citation

David Vilares, Miguel A. Alonso, and Carlos Gómez-Rodríguez. 2015. Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora. In Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pages 2–8, Lisboa, Portugal. Association for Computational Linguistics.

Type of academic work

Academic degree

Abstract

[Abstract]: We address the problem of performing po- larity classification on Twitter over differ- ent languages, focusing on English and Spanish, comparing three techniques: (1) a monolingual model which knows the language in which the opinion is written, (2) a monolingual model that acts based on the decision provided by a language iden- tification tool and (3) a multilingual model trained on a multilingual dataset that does not need any language recognition step. Results show that multilingual models are even able to outperform the monolingual models on some monolingual sets. We introduce the first code-switching corpus with sentiment labels, showing the robust- ness of a multilingual approach.

Description

Rights

Atribución-NoComercial-CompartirIgual 3.0 Internacional
Atribución-NoComercial-CompartirIgual 3.0 Internacional

Except where otherwise noted, this item's license is described as Atribución-NoComercial-CompartirIgual 3.0 Internacional