• Studying the Effect and Treatment of Misspelled Queries in Cross-Language Information Retrieval 

      Vilares, Jesús; Alonso, Miguel A.; Doval, Yerai; Vilares Ferro, Manuel (2016-07)
      [Abstract] The performance of Information Retrieval systems is limited by the linguistic variation present in natural language texts. Word-level Natural Language Processing techniques have been shown to be useful in reducing ...
    • Supervised polarity classification of Spanish tweets based on linguistic knowledge 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computing Machinery, 2013)
      [Abstract]: We describe a system that classifies the polarity of Spanish tweets. We adopt a hybrid approach, which combines machine learning and linguistic knowledge acquired by means of NLP. We use part-of-speech tags, ...
    • Supervised sentiment analysis in multilingual environments 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Elsevier, 2017-05)
      [Abstract]: This article tackles the problem of performing multilingual polarity classification on Twitter, comparing three techniques: (1) a multilingual model trained on a multilingual dataset, obtained by fusing existing ...
    • Surfing the Modeling of pos Taggers in Low-Resource Scenarios 

      Vilares Ferro, Manuel; Darriba Bilbao, Víctor M.; Ribadas Pena, Francisco José; Graña Gil, Jorge (MDPI, 2022-09-27)
      [Abstract] The recent trend toward the application of deep structured techniques has revealed the limits of huge models in natural language processing. This has reawakened the interest in traditional machine learning ...
    • The Fragility of Multi-Treebank Parsing Evaluation 

      Alonso-Alonso, Iago; Vilares, David; Gómez-Rodríguez, Carlos (International Committee on Computational Linguistics, 2022-10)
      [Absctract]: Treebank selection for parsing evaluation and the spurious effects that might arise from a biased choice have not been explored in detail. This paper studies how evaluating on a single subset of treebanks can ...
    • The Impact of Edge Displacement Vaserstein Distance on UD Parsing Performance 

      Anderson, Mark; Gómez-Rodríguez, Carlos (The MIT Press, 2022)
      [Abstract] We contribute to the discussion on parsing performance in NLP by introducing a measurement that evaluates the differences between the distributions of edge displacement (the directed distance of edges) seen in ...
    • The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweets 

      Vilares, David; Thelwall, Mike; Alonso, Miguel A. (SAGE Publications & CILIP, 2015)
      [Abstract]: Twitter is an important platform for sharing opinions about politicians, parties and political decisions. These opinions can be exploited as a source of information to monitor the impact of politics on society. ...
    • The scaling of the minimum sum of edge lengths in uniformly random trees 

      Esteban, Juan Luis; Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos (2016-06)
      [Abstract] The minimum linear arrangement problem on a network consists of finding the minimum sum of edge lengths that can be achieved when the vertices are arranged linearly. Although there are algorithms to solve this ...
    • TIR over Egyptian Hieroglyphs 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (IEEE Computer Society Press, 2016-09)
      [Abstract] This work presents an Information Retrieval system specifically designed to manage Ancient Egyptian hieroglyphic texts taking into account their peculiarities both at lexical and at encoding level for its ...
    • Towards a Graded Dictionary of Spanish Collocations 

      García Salido, Marcos; García, Marcos; Alonso-Ramos, Margarita (Lexical Computing, 2019)
      [Abstract] Several recent studies have observed that texts of different quality and written by learners at different proficiency levels also vary in the lexical combinations they contain. Such variation can be operationalized ...
    • Towards fast natural language parsing: FASTPARSE ERC Starting Grant 

      Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN), 2017-09)
      [Abstract:] The goal of the FASTPARSE project (Fast Natural Language Parsing for Large-Scale NLP), funded by the European Research Council (ERC), is to achieve a breakthrough in the speed of natural language syntactic ...
    • Towards Making a Dependency Parser See 

      Strzyz, Michalina; Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-11)
      [Absctract]: We explore whether it is possible to leverage eye-tracking data in an RNN dependency parser (for English) when such information is only available during training - i.e. no aggregated or token-level gaze features ...
    • Towards Robust Word Embeddings for Noisy Texts 

      Doval, Yerai; Vilares, Jesús; Gómez-Rodríguez, Carlos (MDPI, 2020)
      [Abstract] Research on word embeddings has mainly focused on improving their performance on standard corpora, disregarding the difficulties posed by noisy texts in the form of tweets and other types of non-standard writing ...
    • Towards the Automatic Construction of a Multilingual Dictionary of Collocations using Distributional Semantics 

      García, Marcos; García Salido, Marcos; Alonso-Ramos, Margarita (Lexical Computing, 2019)
      [Abstract] This paper presents the method used to create a multilingual online dictionary of collocations of English, Portuguese, and Spanish. This resource is built automatically and contains three types of collocations: ...
    • Transition-based Semantic Dependency Parsing with Pointer Networks 

      Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Association for Computational Linguistics (ACL), 2020-07)
      [Abstract]: Transition-based parsers implemented with Pointer Networks have become the new state of the art in dependency parsing, excelling in producing labelled syntactic trees and outperforming graph-based models in ...
    • Transition-based semantic role labeling with pointer networks 

      Fernández-González, Daniel (Elsevier, 2023-01-25)
      [Abstract] Semantic role labeling (SRL) focuses on recognizing the predicate–argument structure of a sentence and plays a critical role in many natural language processing tasks such as machine translation and question ...
    • Tratamiento sintáctico de la negación en análisis del sentimiento monolingüe y multilingüe 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (2017-09-19)
      [Abstract] Dealing with negation in a proper way is a relevant factor in order to obtain high performance sentiment analysis systems. In this framework, we present a method for the treatment of negation in Spanish that ...
    • Uma Utilidade para o Reconhecimento de Topónimos em Documentos Medievais 

      Canosa, Xavier; Gamallo, Pablo; Varela, Xavier; Martínez Lema, Paulo; García, Marcos; Taboada, José Angel (Universidade do Minho, 2019)
      [Resumo] Este artigo apresenta o método de construçao duma ferramenta para a anotaçao de entidades geográficas mencionadas em textos medievais. A nova ferramenta foi desenvolvida a partir dos módulos de língua contemporanea ...
    • Universal, unsupervised (rule-based), uncovered sentiment analysis 

      Vilares, David; Gómez-Rodríguez, Carlos; Alonso, Miguel A. (Elsevier, 2017-02)
      [Abstract]: We present a novel unsupervised approach for multilingual sentiment analysis driven by compositional syntax-based rules. On the one hand, we exploit some of the main advantages of unsupervised algorithms: (1) ...
    • Unveiling factors influencing judgment variation in sentiment analysis with natural language processing and statistics 

      Kellert, Olga; Gómez-Rodríguez, Carlos; Zaman, Mahmud Uz (Public Library of Science, 2024-05)
      [Abstract]: TripAdvisor reviews and comparable data sources play an important role in many tasks in Natural Language Processing (NLP), providing a data basis for the identification and classification of subjective judgments, ...