• Parsing as Pretraining 

      Vilares, David; Strzyz, Michalina; Søgaard, Anders; Gómez-Rodríguez, Carlos (2020)
      [Abstract] Recent analyses suggest that encoders pretrained for language modeling capture certain morpho-syntactic structure. However, probing frameworks for word vectors still do not report results on standard setups ...
    • Prototipado rápido de un sistema de normalización de tuitsuna aproximación léxica 

      Vilares, Jesús; Alonso, Miguel A.; Vilares, David (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)
      [Resumen]: Este trabajo describe el sistema de normalización de tuits en español desarrollado por el Grupo de Lengua Y Sociedad de la Información (LYS) de la Universidade da Coruña para el Tweet-Norm 2013. Se trata de un ...
    • Public Sentiment Analysis and Topic Modeling Regarding COVID-19’s Three Waves of Total Lockdown: A Case Study on Movement Control Order in Malaysia 

      Alamoodi, A.H.; Baker, Mohammed Rashad; Albahri, O.S.; Zaidan, B.B.; Zaidan, A.A.; Wong, Wing-Kwong; Garfan, Salem; Albahri, A.S.; Alonso, Miguel A.; Jasim, Ali Najm; Baqer, M.J. (KSII, 2022-07-31)
      [Abstract] The COVID-19 pandemic has affected many aspects of human life. The pandemic not only caused millions of fatalities and problems but also changed public sentiment and behavior. Owing to the magnitude of this ...
    • Restricted Non-Projectivity: Coverage vs. Efficiency 

      Gómez-Rodríguez, Carlos (2016-12)
      [Abstract] In the last decade, various restricted classes of non-projective dependency trees have been proposed with the goal of achieving a good tradeoff between parsing efficiency and coverage of the syntactic structures ...
    • Searching Four-Millennia-Old Documents: A Text Retrieval System for Egyptologists 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (2016-08)
      [Abstract] Progress made in recent years has led to a growing interest in Digital Heritage. This article focuses on Egyptology and, more specifically, the study and preservation of ancient Egyptian scripts. We present a ...
    • Segmentación de palabras en español mediante modelos del lenguaje basados en redes neuronales 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)
      [Resumen] En las plataformas de microblogging abundan ciertos tokens especiales como los hashtags o las menciones en los que un grupo de palabras se escriben juntas sin espaciado entre ellas; p.ej.: #añobisiesto o ...
    • Seguimiento y análisis automático de contenidos en redes sociales 

      Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Vilares, David; Doval, Yerai; Vilares, Jesús (Centro Universitario de la Defensa de Marín, 2015)
      [Abstract]: La Minería de Opiniones es la disciplina que aborda el tratamiento automático de las opiniones contenidas en un texto. Permite, por ejemplo, determinar si en un texto se está opinando o no, o si la polaridad ...
    • Semantic Relation Extraction. Resources, Tools and Strategies 

      García, Marcos (Springer, 2016-07)
      [Abstract] Relation extraction is a subtask of information extraction that aims at obtaining instances of semantic relations present in texts. This information can be arranged in machine-readable formats, useful for several ...
    • Sentiment Analysis for Fake News Detection 

      Alonso, Miguel A.; Vilares, David; Gómez-Rodríguez, Carlos; Vilares, Jesús (MDPI, 2021)
      [Abstract] In recent years, we have witnessed a rise in fake news, i.e., provably false pieces of information created with the intention of deception. The dissemination of this type of news poses a serious threat to cohesion ...
    • Sentiment analysis for reviews and microtexts based on lexico-syntactic knowledge 

      Vilares, David (BCS-IRSG, 2013)
      [Abstract]: We describe two methods to perform sentiment analysis both on long and short texts written in Spanish language. We first present an unsupervised method based on dependency parsing which calculates the semantic ...
    • Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2015)
      [Abstract]: We address the problem of performing po- larity classification on Twitter over differ- ent languages, focusing on English and Spanish, comparing three techniques: (1) a monolingual model which knows ...
    • Sequence Tagging for Fast Dependency Parsing 

      Strzyz, Michalina; Vilares, David; Gómez-Rodríguez, Carlos (2019)
      [Abstract] Dependency parsing has been built upon the idea of using parsing methods based on shift-reduce or graph-based algorithms in order to identify binary dependency relations between the words in a sentence. In this ...
    • Shallow Recurrent Neural Network for Personality Recognition in Source Code 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (CEUR Workshop Proceedings, 2016-12)
      [Abstract] Personality recognition in source code constitutes a novel task in the field of author profiling on written text. In this paper we describe our proposal for the PR-SOCO shared task in FIRE 2016, which is based ...
    • Studying the Effect and Treatment of Misspelled Queries in Cross-Language Information Retrieval 

      Vilares, Jesús; Alonso, Miguel A.; Doval, Yerai; Vilares Ferro, Manuel (2016-07)
      [Abstract] The performance of Information Retrieval systems is limited by the linguistic variation present in natural language texts. Word-level Natural Language Processing techniques have been shown to be useful in reducing ...
    • Supervised polarity classification of Spanish tweets based on linguistic knowledge 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computing Machinery, 2013)
      [Abstract]: We describe a system that classifies the polarity of Spanish tweets. We adopt a hybrid approach, which combines machine learning and linguistic knowledge acquired by means of NLP. We use part-of-speech tags, ...
    • Supervised sentiment analysis in multilingual environments 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Elsevier, 2017-05)
      [Abstract]: This article tackles the problem of performing multilingual polarity classification on Twitter, comparing three techniques: (1) a multilingual model trained on a multilingual dataset, obtained by fusing existing ...
    • Surfing the Modeling of pos Taggers in Low-Resource Scenarios 

      Vilares Ferro, Manuel; Darriba Bilbao, Víctor M.; Ribadas Pena, Francisco José; Graña Gil, Jorge (MDPI, 2022-09-27)
      [Abstract] The recent trend toward the application of deep structured techniques has revealed the limits of huge models in natural language processing. This has reawakened the interest in traditional machine learning ...
    • The Impact of Edge Displacement Vaserstein Distance on UD Parsing Performance 

      Anderson, Mark; Gómez-Rodríguez, Carlos (The MIT Press, 2022)
      [Abstract] We contribute to the discussion on parsing performance in NLP by introducing a measurement that evaluates the differences between the distributions of edge displacement (the directed distance of edges) seen in ...
    • The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweets 

      Vilares, David; Thelwall, Mike; Alonso, Miguel A. (SAGE Publications & CILIP, 2015)
      [Abstract]: Twitter is an important platform for sharing opinions about politicians, parties and political decisions. These opinions can be exploited as a source of information to monitor the impact of politics on society. ...
    • The scaling of the minimum sum of edge lengths in uniformly random trees 

      Esteban, Juan Luis; Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos (2016-06)
      [Abstract] The minimum linear arrangement problem on a network consists of finding the minimum sum of edge lengths that can be achieved when the vertices are arranged linearly. Although there are algorithms to solve this ...