• Sentiment Analysis on Monolingual, Multilingual and Code-Switching Twitter Corpora 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2015)
      [Abstract]: We address the problem of performing po- larity classification on Twitter over differ- ent languages, focusing on English and Spanish, comparing three techniques: (1) a monolingual model which knows ...
    • Sequence Labeling Parsing by Learning across Representations 

      Strzyz, Michalina; Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-07)
      [Absctract]: We use parsing as sequence labeling as a common framework to learn across constituency and dependency syntactic abstractions. To do so, we cast the problem as multitask learning (MTL). First, we show that ...
    • Sequence Tagging for Fast Dependency Parsing 

      Strzyz, Michalina; Vilares, David; Gómez-Rodríguez, Carlos (2019)
      [Abstract] Dependency parsing has been built upon the idea of using parsing methods based on shift-reduce or graph-based algorithms in order to identify binary dependency relations between the words in a sentence. In this ...
    • Shallow Recurrent Neural Network for Personality Recognition in Source Code 

      Doval, Yerai; Gómez-Rodríguez, Carlos; Vilares, Jesús (CEUR Workshop Proceedings, 2016-12)
      [Abstract] Personality recognition in source code constitutes a novel task in the field of author profiling on written text. In this paper we describe our proposal for the PR-SOCO shared task in FIRE 2016, which is based ...
    • Studying the Effect and Treatment of Misspelled Queries in Cross-Language Information Retrieval 

      Vilares, Jesús; Alonso, Miguel A.; Doval, Yerai; Vilares Ferro, Manuel (2016-07)
      [Abstract] The performance of Information Retrieval systems is limited by the linguistic variation present in natural language texts. Word-level Natural Language Processing techniques have been shown to be useful in reducing ...
    • Supervised polarity classification of Spanish tweets based on linguistic knowledge 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computing Machinery, 2013)
      [Abstract]: We describe a system that classifies the polarity of Spanish tweets. We adopt a hybrid approach, which combines machine learning and linguistic knowledge acquired by means of NLP. We use part-of-speech tags, ...
    • Supervised sentiment analysis in multilingual environments 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Elsevier, 2017-05)
      [Abstract]: This article tackles the problem of performing multilingual polarity classification on Twitter, comparing three techniques: (1) a multilingual model trained on a multilingual dataset, obtained by fusing existing ...
    • Surfing the Modeling of pos Taggers in Low-Resource Scenarios 

      Vilares Ferro, Manuel; Darriba Bilbao, Víctor M.; Ribadas Pena, Francisco José; Graña Gil, Jorge (MDPI, 2022-09-27)
      [Abstract] The recent trend toward the application of deep structured techniques has revealed the limits of huge models in natural language processing. This has reawakened the interest in traditional machine learning ...
    • The Fragility of Multi-Treebank Parsing Evaluation 

      Alonso-Alonso, Iago; Vilares, David; Gómez-Rodríguez, Carlos (International Committee on Computational Linguistics, 2022-10)
      [Absctract]: Treebank selection for parsing evaluation and the spurious effects that might arise from a biased choice have not been explored in detail. This paper studies how evaluating on a single subset of treebanks can ...
    • The Impact of Edge Displacement Vaserstein Distance on UD Parsing Performance 

      Anderson, Mark; Gómez-Rodríguez, Carlos (The MIT Press, 2022)
      [Abstract] We contribute to the discussion on parsing performance in NLP by introducing a measurement that evaluates the differences between the distributions of edge displacement (the directed distance of edges) seen in ...
    • The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweets 

      Vilares, David; Thelwall, Mike; Alonso, Miguel A. (SAGE Publications & CILIP, 2015)
      [Abstract]: Twitter is an important platform for sharing opinions about politicians, parties and political decisions. These opinions can be exploited as a source of information to monitor the impact of politics on society. ...
    • The scaling of the minimum sum of edge lengths in uniformly random trees 

      Esteban, Juan Luis; Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos (2016-06)
      [Abstract] The minimum linear arrangement problem on a network consists of finding the minimum sum of edge lengths that can be achieved when the vertices are arranged linearly. Although there are algorithms to solve this ...
    • TIR over Egyptian Hieroglyphs 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (IEEE Computer Society Press, 2016-09)
      [Abstract] This work presents an Information Retrieval system specifically designed to manage Ancient Egyptian hieroglyphic texts taking into account their peculiarities both at lexical and at encoding level for its ...
    • Towards a Graded Dictionary of Spanish Collocations 

      García Salido, Marcos; García, Marcos; Alonso-Ramos, Margarita (Lexical Computing, 2019)
      [Abstract] Several recent studies have observed that texts of different quality and written by learners at different proficiency levels also vary in the lexical combinations they contain. Such variation can be operationalized ...
    • Towards fast natural language parsing: FASTPARSE ERC Starting Grant 

      Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN), 2017-09)
      [Abstract:] The goal of the FASTPARSE project (Fast Natural Language Parsing for Large-Scale NLP), funded by the European Research Council (ERC), is to achieve a breakthrough in the speed of natural language syntactic ...
    • Towards Making a Dependency Parser See 

      Strzyz, Michalina; Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-11)
      [Absctract]: We explore whether it is possible to leverage eye-tracking data in an RNN dependency parser (for English) when such information is only available during training - i.e. no aggregated or token-level gaze features ...
    • Towards Robust Word Embeddings for Noisy Texts 

      Doval, Yerai; Vilares, Jesús; Gómez-Rodríguez, Carlos (MDPI, 2020)
      [Abstract] Research on word embeddings has mainly focused on improving their performance on standard corpora, disregarding the difficulties posed by noisy texts in the form of tweets and other types of non-standard writing ...
    • Towards the Automatic Construction of a Multilingual Dictionary of Collocations using Distributional Semantics 

      García, Marcos; García Salido, Marcos; Alonso-Ramos, Margarita (Lexical Computing, 2019)
      [Abstract] This paper presents the method used to create a multilingual online dictionary of collocations of English, Portuguese, and Spanish. This resource is built automatically and contains three types of collocations: ...
    • Transition-based Semantic Dependency Parsing with Pointer Networks 

      Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Association for Computational Linguistics (ACL), 2020-07)
      [Abstract]: Transition-based parsers implemented with Pointer Networks have become the new state of the art in dependency parsing, excelling in producing labelled syntactic trees and outperforming graph-based models in ...
    • Transition-based semantic role labeling with pointer networks 

      Fernández-González, Daniel (Elsevier, 2023-01-25)
      [Abstract] Semantic role labeling (SRL) focuses on recognizing the predicate–argument structure of a sentence and plays a critical role in many natural language processing tasks such as machine translation and question ...