Browsing by Author "Doval, Yerai"
Now showing items 1-14 of 14
-
Comparing neural- and N-gram-based language models for word segmentation
Doval, Yerai; Gómez-Rodríguez, Carlos (John Wiley and Sons Inc., 2019-02)[Abstract]: Word segmentation is the task of inserting or deleting word boundary characters in order to separate character sequences that correspond to words in some language. In this article we propose an approach based ... -
Identificación Automática del Idioma en Twitter: Adaptación de Identificadores del Estado del Arte al Contexto Ibérico
Doval, Yerai; Vilares, David; Vilares, Jesús (CEUR-WS.org, 2014)[Abstract]: We describe here our partipation in TweetLID. After having studied the problem of language identification, the resources available, and designed a text conflation approach for this kind of tasks, we joined ... -
LyS at SemEval-2016 Task 4: Exploiting Neural Activation Values for Twitter Sentiment Classification and Quantification
Vilares, David; Doval, Yerai; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2016)[Abstract]: In this paper we describe our deep learning approach for solving both two-, three- and fiveclass tweet polarity classification, and twoand five-class quantification. We first trained a convolutional neural ... -
LyS at TASS 2014: A Prototype for Extracting and Analysing Aspects from Spanish tweets
Vilares, David; Doval, Yerai; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2014)[Abstract]: This paper describes our participation at the third edition of the work- shop on Sentiment Analysis focused on Spanish tweets, tass 2014. This year’s eval- uation campaign includes four challenges: (1) global ... -
LyS at TASS 2015: Deep Learning Experiments for Sentiment Analysis on Spanish Tweets
Vilares, David; Doval, Yerai; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (CEUR-WS Workshop Proceedings, 2015)[Abstract]: This paper describes the participation of the LyS group at tass 2015. In this year’s edition, we used a long short-term memory neural network to address the two proposed challenges: (1) sentiment analysis at ... -
LyS: Porting a Twitter Sentiment Analysis Approach from Spanish to English
Vilares, David; Hermo, Miguel; Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Doval, Yerai (Association for Computational Linguistics, 2014)[Abstract]: This paper proposes an approach to solve message- and phrase-level polarity classification in Twitter, derived from an existing system designed for Spanish. As a first step, an ad-hoc preprocessing is performed. ... -
On the performance of phonetic algorithms in microtext normalization
Doval, Yerai; Vilares Ferro, Manuel; Vilares, Jesús (Elsevier, 2018-12-15)[Abstract]: User–generated content published on microblogging social networks constitutes a priceless source of information. However, microtexts usually deviate from the standard lexical and grammatical rules of the language, ... -
On the Processing and Analysis of Microtexts: From Normalization to Semantics
Doval, Yerai; Vilares, David (M D P I AG, 2018-09-18)[Abstract] User-generated content published on microblogging social platforms constitutes an invaluable source of information for diverse purposes: health surveillance, business intelligence, political analysis, etc. We ... -
Seeking robustness in a multilingual world: from pipelines to embeddings
Doval, Yerai (2019)[Abstract] In this dissertation, we study two approaches to overcome the challenges posed by processing user-generated non-standard multilingual text content as it is found on the Web nowadays. Firstly, we present a ... -
Segmentación de palabras en español mediante modelos del lenguaje basados en redes neuronales
Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)[Resumen] En las plataformas de microblogging abundan ciertos tokens especiales como los hashtags o las menciones en los que un grupo de palabras se escriben juntas sin espaciado entre ellas; p.ej.: #añobisiesto o ... -
Seguimiento y análisis automático de contenidos en redes sociales
Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Vilares, David; Doval, Yerai; Vilares, Jesús (Centro Universitario de la Defensa de Marín, 2015)[Abstract]: La Minería de Opiniones es la disciplina que aborda el tratamiento automático de las opiniones contenidas en un texto. Permite, por ejemplo, determinar si en un texto se está opinando o no, o si la polaridad ... -
Shallow Recurrent Neural Network for Personality Recognition in Source Code
Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (CEUR Workshop Proceedings, 2016-12)[Abstract] Personality recognition in source code constitutes a novel task in the field of author profiling on written text. In this paper we describe our proposal for the PR-SOCO shared task in FIRE 2016, which is based ... -
Studying the Effect and Treatment of Misspelled Queries in Cross-Language Information Retrieval
Vilares, Jesús; Alonso, Miguel A.; Doval, Yerai; Vilares Ferro, Manuel (2016-07)[Abstract] The performance of Information Retrieval systems is limited by the linguistic variation present in natural language texts. Word-level Natural Language Processing techniques have been shown to be useful in reducing ... -
Towards Robust Word Embeddings for Noisy Texts
Doval, Yerai; Vilares, Jesús; Gómez-Rodríguez, Carlos (MDPI, 2020)[Abstract] Research on word embeddings has mainly focused on improving their performance on standard corpora, disregarding the difficulties posed by noisy texts in the form of tweets and other types of non-standard writing ...