• On the usefulness of lexical and syntactic processing in polarity classification of Twitter messages 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Wiley, 2015-09)
      [Abstract]: Millions of micro texts are published every day on Twitter. Identifying the sentiment present in them can be helpful for measuring the frame of mind of the public, their satisfaction with respect to a product, ...
    • Optimality of syntactic dependency distances 

      Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos; Esteban, Juan Luis; Alemany-Puig, Lluís (American Physical Society, 2022-01)
      [Abstract]: It is often stated that human languages, as other biological systems, are shaped by cost-cutting pressures but, to what extent? Attempts to quantify the degree of optimality of languages by means of an optimality ...
    • Parsing as Pretraining 

      Vilares, David; Strzyz, Michalina; Søgaard, Anders; Gómez-Rodríguez, Carlos (2020)
      [Abstract] Recent analyses suggest that encoders pretrained for language modeling capture certain morpho-syntactic structure. However, probing frameworks for word vectors still do not report results on standard setups ...
    • Parsing linearizations appreciate PoS tags - but some are fussy about errors 

      Muñoz-Ortiz, Alberto; Anderson, Mark; Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2022-11)
      [Absctract]: PoS tags, once taken for granted as a useful resource for syntactic parsing, have become more situational with the popularization of deep learning. Recent work on the impact of PoS tags on graph- and ...
    • Prototype of an Entity Recognition System for Antimicrobial Resistance Data Management 

      Prado-Valiño, Francisco; Santos-Ríos, Roi; Gómez-Rodríguez, Carlos; Vilares, Jesús (Universidade da Coruña, Servizo de Publicacións, 2023)
      [Abstract] Often, a study or research process requires the analysis of large volumes of information in the form of unstructured text. This task consumes a large amount of time and resources of the human experts in charge ...
    • Restricted Non-Projectivity: Coverage vs. Efficiency 

      Gómez-Rodríguez, Carlos (2016-12)
      [Abstract] In the last decade, various restricted classes of non-projective dependency trees have been proposed with the goal of achieving a good tradeoff between parsing efficiency and coverage of the syntactic structures ...
    • Segmentación de palabras en español mediante modelos del lenguaje basados en redes neuronales 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (Sociedad Española para el Procesamiento del Lenguaje Natural, 2016-09)
      [Resumen] En las plataformas de microblogging abundan ciertos tokens especiales como los hashtags o las menciones en los que un grupo de palabras se escriben juntas sin espaciado entre ellas; p.ej.: #añobisiesto o ...
    • Seguimiento y análisis automático de contenidos en redes sociales 

      Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Vilares, David; Doval, Yerai; Vilares, Jesús (Centro Universitario de la Defensa de Marín, 2015)
      [Abstract]: La Minería de Opiniones es la disciplina que aborda el tratamiento automático de las opiniones contenidas en un texto. Permite, por ejemplo, determinar si en un texto se está opinando o no, o si la polaridad ...
    • Sentiment Analysis for Fake News Detection 

      Alonso, Miguel A.; Vilares, David; Gómez-Rodríguez, Carlos; Vilares, Jesús (MDPI, 2021)
      [Abstract] In recent years, we have witnessed a rise in fake news, i.e., provably false pieces of information created with the intention of deception. The dissemination of this type of news poses a serious threat to cohesion ...
    • Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2015)
      [Abstract]: We address the problem of performing po- larity classification on Twitter over differ- ent languages, focusing on English and Spanish, comparing three techniques: (1) a monolingual model which knows ...
    • Sequence Labeling Parsing by Learning across Representations 

      Strzyz, Michalina; Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-07)
      [Absctract]: We use parsing as sequence labeling as a common framework to learn across constituency and dependency syntactic abstractions. To do so, we cast the problem as multitask learning (MTL). First, we show that ...
    • Sequence Tagging for Fast Dependency Parsing 

      Strzyz, Michalina; Vilares, David; Gómez-Rodríguez, Carlos (2019)
      [Abstract] Dependency parsing has been built upon the idea of using parsing methods based on shift-reduce or graph-based algorithms in order to identify binary dependency relations between the words in a sentence. In this ...
    • Shallow Recurrent Neural Network for Personality Recognition in Source Code 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (CEUR Workshop Proceedings, 2016-12)
      [Abstract] Personality recognition in source code constitutes a novel task in the field of author profiling on written text. In this paper we describe our proposal for the PR-SOCO shared task in FIRE 2016, which is based ...
    • Supervised polarity classification of Spanish tweets based on linguistic knowledge 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computing Machinery, 2013)
      [Abstract]: We describe a system that classifies the polarity of Spanish tweets. We adopt a hybrid approach, which combines machine learning and linguistic knowledge acquired by means of NLP. We use part-of-speech tags, ...
    • Supervised sentiment analysis in multilingual environments 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Elsevier, 2017-05)
      [Abstract]: This article tackles the problem of performing multilingual polarity classification on Twitter, comparing three techniques: (1) a multilingual model trained on a multilingual dataset, obtained by fusing existing ...
    • The Fragility of Multi-Treebank Parsing Evaluation 

      Alonso-Alonso, Iago; Vilares, David; Gómez-Rodríguez, Carlos (International Committee on Computational Linguistics, 2022-10)
      [Absctract]: Treebank selection for parsing evaluation and the spurious effects that might arise from a biased choice have not been explored in detail. This paper studies how evaluating on a single subset of treebanks can ...
    • The Impact of Edge Displacement Vaserstein Distance on UD Parsing Performance 

      Anderson, Mark; Gómez-Rodríguez, Carlos (The MIT Press, 2022)
      [Abstract] We contribute to the discussion on parsing performance in NLP by introducing a measurement that evaluates the differences between the distributions of edge displacement (the directed distance of edges) seen in ...
    • The scaling of the minimum sum of edge lengths in uniformly random trees 

      Esteban, Juan Luis; Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos (2016-06)
      [Abstract] The minimum linear arrangement problem on a network consists of finding the minimum sum of edge lengths that can be achieved when the vertices are arranged linearly. Although there are algorithms to solve this ...
    • Towards fast natural language parsing: FASTPARSE ERC Starting Grant 

      Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN), 2017-09)
      [Abstract:] The goal of the FASTPARSE project (Fast Natural Language Parsing for Large-Scale NLP), funded by the European Research Council (ERC), is to achieve a breakthrough in the speed of natural language syntactic ...
    • Towards Making a Dependency Parser See 

      Strzyz, Michalina; Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-11)
      [Absctract]: We explore whether it is possible to leverage eye-tracking data in an RNN dependency parser (for English) when such information is only available during training - i.e. no aggregated or token-level gaze features ...