• Natural Language Parsing : Progress and Challenges 

      Gómez-Rodríguez, Carlos (Sociedad de Estadística e Investigación Operativa, 2018-07)
      [Abstract] Natural language parsing is the task of automatically obtaining the syntactic structure of sentences written in a human language. Parsing is a crucial step for language processing systems that need to extract ...
    • New Treebank or Repurposed? On the Feasibility of Cross-Lingual Parsing of Romance Languages with Universal Dependencies 

      García, Marcos; Gómez-Rodríguez, Carlos; Alonso, Miguel A. (Cambridge University Press, 2018-01)
      [Abstract] This paper addresses the feasibility of cross-lingual parsing with Universal Dependencies (UD) between Romance languages, analyzing its performance when compared to the use of manually annotated resources of the ...
    • On the Feasibility of Character n-Grams Pseudo-Translation for Cross-Language Information Retrieval Tasks 

      Vilares, Jesús; Vilares Ferro, Manuel; Alonso, Miguel A.; Oakes, Michael P. (2016-03)
      [Abstract] The field of Cross-Language Information Retrieval relates techniques close to both the Machine Translation and Information Retrieval fields, although in a context involving characteristics of its own. The present ...
    • On the Logistical Difficulties and Findings of Jopara Sentiment Analysis 

      Agüero-Torales, Marvin M.; Vilares, David; López-Herrera, Antonio G. (Association for Computational Linguistics, 2021-06)
      [Abstract] This paper addresses the problem of sentiment analysis for Jopara, a code-switching language between Guarani and Spanish. We first collect a corpus of Guarani-dominant tweets and discuss on the difficulties of ...
    • On the performance of phonetic algorithms in microtext normalization 

      Doval, Yerai; Vilares Ferro, Manuel; Vilares, Jesús (Elsevier, 2018-12-15)
      [Abstract]: User–generated content published on microblogging social networks constitutes a priceless source of information. However, microtexts usually deviate from the standard lexical and grammatical rules of the language, ...
    • On the Processing and Analysis of Microtexts: From Normalization to Semantics 

      Doval, Yerai; Vilares, David (M D P I AG, 2018-09-18)
      [Abstract] User-generated content published on microblogging social platforms constitutes an invaluable source of information for diverse purposes: health surveillance, business intelligence, political analysis, etc. We ...
    • On the Use of Parsing for Named Entity Recognition 

      Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Vilares, Jesús (MDPI, 2021-01-25)
      [Abstract] Parsing is a core natural language processing technique that can be used to obtain the structure underlying sentences in human languages. Named entity recognition (NER) is the task of identifying the entities ...
    • On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Wiley, 2015-09)
      [Abstract]: Millions of micro texts are published every day on Twitter. Identifying the sentiment present in them can be helpful for measuring the frame of mind of the public, their satisfaction with respect to a product, ...
    • Optimality of syntactic dependency distances 

      Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos; Esteban, Juan Luis; Alemany-Puig, Lluís (American Physical Society, 2022-01)
      [Abstract]: It is often stated that human languages, as other biological systems, are shaped by cost-cutting pressures but, to what extent? Attempts to quantify the degree of optimality of languages by means of an optimality ...
    • Parsing as Pretraining 

      Vilares, David; Strzyz, Michalina; Søgaard, Anders; Gómez-Rodríguez, Carlos (2020)
      [Abstract] Recent analyses suggest that encoders pretrained for language modeling capture certain morpho-syntactic structure. However, probing frameworks for word vectors still do not report results on standard setups ...
    • Prototipado rápido de un sistema de normalización de tuitsuna aproximación léxica 

      Vilares, Jesús; Alonso, Miguel A.; Vilares, David (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)
      [Resumen]: Este trabajo describe el sistema de normalización de tuits en español desarrollado por el Grupo de Lengua Y Sociedad de la Información (LYS) de la Universidade da Coruña para el Tweet-Norm 2013. Se trata de un ...
    • Public Sentiment Analysis and Topic Modeling Regarding COVID-19’s Three Waves of Total Lockdown: A Case Study on Movement Control Order in Malaysia 

      Alamoodi, A.H.; Baker, Mohammed Rashad; Albahri, O.S.; Zaidan, B.B.; Zaidan, A.A.; Wong, Wing-Kwong; Garfan, Salem; Albahri, A.S.; Alonso, Miguel A.; Jasim, Ali Najm; Baqer, M.J. (KSII, 2022-07-31)
      [Abstract] The COVID-19 pandemic has affected many aspects of human life. The pandemic not only caused millions of fatalities and problems but also changed public sentiment and behavior. Owing to the magnitude of this ...
    • Restricted Non-Projectivity: Coverage vs. Efficiency 

      Gómez-Rodríguez, Carlos (2016-12)
      [Abstract] In the last decade, various restricted classes of non-projective dependency trees have been proposed with the goal of achieving a good tradeoff between parsing efficiency and coverage of the syntactic structures ...
    • Searching Four-Millennia-Old Documents: A Text Retrieval System for Egyptologists 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (2016-08)
      [Abstract] Progress made in recent years has led to a growing interest in Digital Heritage. This article focuses on Egyptology and, more specifically, the study and preservation of ancient Egyptian scripts. We present a ...
    • Segmentación de palabras en español mediante modelos del lenguaje basados en redes neuronales 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)
      [Resumen] En las plataformas de microblogging abundan ciertos tokens especiales como los hashtags o las menciones en los que un grupo de palabras se escriben juntas sin espaciado entre ellas; p.ej.: #añobisiesto o ...
    • Seguimiento y análisis automático de contenidos en redes sociales 

      Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Vilares, David; Doval, Yerai; Vilares, Jesús (Centro Universitario de la Defensa de Marín, 2015)
      [Abstract]: La Minería de Opiniones es la disciplina que aborda el tratamiento automático de las opiniones contenidas en un texto. Permite, por ejemplo, determinar si en un texto se está opinando o no, o si la polaridad ...
    • Semantic Relation Extraction. Resources, Tools and Strategies 

      García, Marcos (Springer, 2016-07)
      [Abstract] Relation extraction is a subtask of information extraction that aims at obtaining instances of semantic relations present in texts. This information can be arranged in machine-readable formats, useful for several ...
    • Sentiment Analysis for Fake News Detection 

      Alonso, Miguel A.; Vilares, David; Gómez-Rodríguez, Carlos; Vilares, Jesús (MDPI, 2021)
      [Abstract] In recent years, we have witnessed a rise in fake news, i.e., provably false pieces of information created with the intention of deception. The dissemination of this type of news poses a serious threat to cohesion ...
    • Sentiment analysis for reviews and microtexts based on lexico-syntactic knowledge 

      Vilares, David (BCS-IRSG, 2013)
      [Abstract]: We describe two methods to perform sentiment analysis both on long and short texts written in Spanish language. We first present an unsupervised method based on dependency parsing which calculates the semantic ...
    • Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2015)
      [Abstract]: We address the problem of performing po- larity classification on Twitter over differ- ent languages, focusing on English and Spanish, comparing three techniques: (1) a monolingual model which knows ...