• Discovering Topics in Twitter About the COVID-19 Outbreak in Spain 

      Agüero-Torales, Marvin M.; Vilares, David; López-Herrera, Antonio G. (Sociedad Española de Procesamiento del Lenguaje Natural, 2021)
      [Resumen] En este trabajo, analizamos lo que los usuarios han estado discutiendo en Twitter durante el comienzo de la pandemia causada por el COVID-19. Concretamente, analizamos tres fases diferenciadas de la crisis del ...
    • EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (European Language Resources Association (ELRA), 2016-05)
      [Abstract]: Code-switching texts are those that contain terms in two or more different languages, and they appear increasingly often in social media. The aim of this paper is to provide a resource to the research community ...
    • Entity linking with distributional semantics 

      Gamallo, Pablo; García, Marcos (Springer, 2016-07)
      [Abstract] Entity Linking (EL) consists in linking name mentions in a given text with their referring entities in external knowledge bases such as DBpedia/Wikipedia. In this paper, we propose an EL approach whose main ...
    • Faster shift-reduce constituent parsing with a non-binary, bottom-up strategy 

      Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Elsevier B.V., 2019-10)
      [Absctract]: An increasingly wide range of artificial intelligence applications rely on syntactic information to process and extract meaning from natural language text or speech, with constituent trees being one of the ...
    • From Partial to Strictly Incremental Constituent Parsing 

      Ezquerro, Ana; Gómez-Rodríguez, Carlos; Vilares, David (Association for Computational Linguistics, 2024-03)
      [Absctract]: We study incremental constituent parsers to assess their capacity to output trees based on prefix representations alone. Guided by strictly left-to-right generative language models and tree-decoding modules, ...
    • Global Transition-based Non-projective Dependency Parsing 

      Fernández-González, Daniel; Shi, Tianze; Lee, Lillian (Association for Computational Linguistics (ACL), 2018)
      [Absctract]: Shi, Huang, and Lee (2017a) obtained state-of-the-art results for English and Chinese dependency parsing by combining dynamic-programming implementations of transition-based dependency parsers with a minimal ...
    • Grounding the Semantics of Part-of-Day Nouns Worldwide using Twitter 

      Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2018-06)
      [Absctract]: The usage of part-of-day nouns, such as ‘night’, and their time-specific greetings (‘good night’), varies across languages and cultures. We show the possibilities that Twitter offers for studying the semantics ...
    • Harry Potter and the Action Prediction Challenge from Natural Language 

      Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-06)
      [Absctract]: We explore the challenge of action prediction from textual descriptions of scenes, a testbed to approximate whether text inference can be used to predict upcoming actions. As a case of study, we consider the ...
    • HEAD-QA: A Healthcare Dataset for Complex Reasoning 

      Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-07)
      [Absctract]: We present HEAD-QA, a multi-choice question answering testbed to encourage research on complex reasoning. The questions come from exams to access a specialized position in the Spanish healthcare system, and ...
    • How important is syntactic parsing accuracy? An empirical evaluation on rule-based sentiment analysis 

      Gómez-Rodríguez, Carlos; Alonso-Alonso, Iago; Vilares, David (Springer, 2019)
      [Abstract]: Syntactic parsing, the process of obtaining the internal structure of sentences in natural languages, is a crucial task for artificial intelligence applications that need to extract meaning from natural language ...
    • Identificación Automática del Idioma en Twitter: Adaptación de Identificadores del Estado del Arte al Contexto Ibérico 

      Doval, Yerai; Vilares, David; Vilares, Jesús (CEUR-WS.org, 2014)
      [Abstract]: We describe here our partipation in TweetLID. After having studied the problem of language identification, the resources available, and designed a text conflation approach for this kind of tasks, we joined ...
    • Incorporating lexico-semantic heuristics into coreference resolution sieves for named entity recognition at document-level 

      García, Marcos (European Language Resources Association (ELRA), 2016-05)
      [Abstract] This paper explores the incorporation of lexico-semantic heuristics into a deterministic Coreference Resolution (CR) system for classifying named entities at document-level. The highest precise sieves of a CR ...
    • Increasing NLP Parsing Efficiency with Chunking 

      Anderson, Mark Dáibhidh; Vilares, David (M D P I AG, 2018-09-19)
      [Abstract] We introduce a “Chunk-and-Pass” parsing technique influenced by a psycholinguistic model, where linguistic information is processed not word-by-word but rather in larger chunks of words. We present preliminary ...
    • Intelligent retrieval for biodiversity 

      Vilares Ferro, Manuel; Fernández, Milagros; Blanco, Adrián; Gómez-Rodríguez, Carlos (2016-02)
      [Abstract] A knowledge discovery and representation frame to mine contents in systems biology is described. It applies natural language processing to integrate linguistic and domain knowledge in a mathematical model for ...
    • Left-to-Right Dependency Parsing with Pointer Networks 

      Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Association for Computational Linguistics (ACL), 2019)
      [Abstract]: We propose a novel transition-based algorithm that straightforwardly parses sentences from left to right by building n attachments, with n being the length of the input sentence. Similarly to the recent ...
    • Liberating language research from dogmas of the 20th century 

      Ferrer-i-Cancho, Ramon; Gómez-Rodríguez, Carlos (2016)
      [Abstract] A commentary on the article “Large -scale evidence of dependency length minimization in 37 languages” by Futrell, Mahowald & Gibson (PNAS 2015 112 (33) 10336-10341).
    • Lyapunov filtering of objectivity for Spanish sentiment model 

      Chaturvedi, Iti; Cambria, Erik; Vilares, David (IEEE, 2016-07)
      [Abstract] Objective sentences lack sentiments and, hence, can reduce the accuracy of a sentiment classifier. Traditional methods prior to 2001 used hand-crafted templates to identify subjectivity and did not generalize ...
    • LyS A Coruña at GUA-SPA@IberLEF2023. Multi-Task Learning with Large Language Model Encoders for Guarani-Spanish Code Switching Analysis 

      Muñoz Ortiz, Alberto; Vilares, David (2023)
      [Abstract] This paper introduces the LyS A Coruña proposal for the Guarani-Spanish Code Switching Analysis task at IberLEF2023. The shared task proposes to analyze Guarani-Spanish code-switched texts, focusing on language ...
    • LyS at SemEval-2016 Task 4: Exploiting Neural Activation Values for Twitter Sentiment Classification and Quantification 

      Vilares, David; Doval, Yerai; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2016)
      [Abstract]: In this paper we describe our deep learning approach for solving both two-, three- and fiveclass tweet polarity classification, and twoand five-class quantification. We first trained a convolutional neural ...
    • LyS at TASS 2013: Analysing Spanish tweets by means of dependency parsing, semantic-oriented lexicons and psychometric word-properties 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)
      [Abstract]: This article describes the approach developed by our group in order to resolve the sentiment analysis at a global level, topic identification and political tendency classification tasks on Spanish tweets; ...