• Cognitive Constraints Built into Formal Grammars: Implications for Language Evolution 

      Gómez-Rodríguez, Carlos; Christiansen, Morten H.; Ferrer-i-Cancho, Ramon (Ravignani, A., Barbieri, C., Martins, M., Flaherty, M., Jadoul, Y., Lattenkamp, E., Little, H., Mudd, K., Verhoef, T., 2020-04-17)
      [Abstract] We study the validity of the cognitive independence assumption using an ensemble of artificial syntactic structures from various classes of dependency grammars. Our findings show that memory limitations have ...
    • Constituent Parsing as Sequence Labeling 

      Gómez-Rodríguez, Carlos; Vilares, David (Association for Computational Linguistics (ACL), 2018)
      [Absctract]: We introduce a method to reduce constituent parsing to sequence labeling. For each word wt, it generates a label that encodes: (1) the number of ancestors in the tree that the words wt and wt+1 have in common, ...
    • Cross-lingual Inflection as a Data Augmentation Method for Parsing 

      Muñoz-Ortiz, Alberto; Gómez-Rodríguez, Carlos; Vilares, David (Association for Computational Linguistics, 2022-05)
      [Absctract]: We propose a morphology-based method for low-resource (LR) dependency parsing. We train a morphological inflector for target LR languages, and apply it to related rich-resource (RR) treebanks to create ...
    • Detecting Perspectives in Political Debates 

      Vilares, David; He, Yulan (Association for Computational Linguistics, 2017-09)
      [Abstract]: We explore how to detect people’s perspectives that occupy a certain proposition. We propose a Bayesian modelling approach where topics (or propositions) and their associated perspectives (or viewpoints) are ...
    • Discontinuous Constituent Parsing as Sequence Labeling 

      Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2020-11)
      [Absctract]: This paper reduces discontinuous parsing to sequence labeling. It first shows that existing reductions for constituent parsing as labeling do not support discontinuities. Second, it fills this gap and proposes ...
    • EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (European Language Resources Association (ELRA), 2016-05)
      [Abstract]: Code-switching texts are those that contain terms in two or more different languages, and they appear increasingly often in social media. The aim of this paper is to provide a resource to the research community ...
    • Entity linking with distributional semantics 

      Gamallo, Pablo; García, Marcos (Springer, 2016-07)
      [Abstract] Entity Linking (EL) consists in linking name mentions in a given text with their referring entities in external knowledge bases such as DBpedia/Wikipedia. In this paper, we propose an EL approach whose main ...
    • From Partial to Strictly Incremental Constituent Parsing 

      Ezquerro, Ana; Gómez-Rodríguez, Carlos; Vilares, David (Association for Computational Linguistics, 2024-03)
      [Absctract]: We study incremental constituent parsers to assess their capacity to output trees based on prefix representations alone. Guided by strictly left-to-right generative language models and tree-decoding modules, ...
    • Global Transition-based Non-projective Dependency Parsing 

      Fernández-González, Daniel; Shi, Tianze; Lee, Lillian (Association for Computational Linguistics (ACL), 2018)
      [Absctract]: Shi, Huang, and Lee (2017a) obtained state-of-the-art results for English and Chinese dependency parsing by combining dynamic-programming implementations of transition-based dependency parsers with a minimal ...
    • Grounding the Semantics of Part-of-Day Nouns Worldwide using Twitter 

      Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2018-06)
      [Absctract]: The usage of part-of-day nouns, such as ‘night’, and their time-specific greetings (‘good night’), varies across languages and cultures. We show the possibilities that Twitter offers for studying the semantics ...
    • Harry Potter and the Action Prediction Challenge from Natural Language 

      Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-06)
      [Absctract]: We explore the challenge of action prediction from textual descriptions of scenes, a testbed to approximate whether text inference can be used to predict upcoming actions. As a case of study, we consider the ...
    • HEAD-QA: A Healthcare Dataset for Complex Reasoning 

      Vilares, David; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2019-07)
      [Absctract]: We present HEAD-QA, a multi-choice question answering testbed to encourage research on complex reasoning. The questions come from exams to access a specialized position in the Spanish healthcare system, and ...
    • Identificación Automática del Idioma en Twitter: Adaptación de Identificadores del Estado del Arte al Contexto Ibérico 

      Doval, Yerai; Vilares, David; Vilares, Jesús (CEUR-WS.org, 2014)
      [Abstract]: We describe here our partipation in TweetLID. After having studied the problem of language identification, the resources available, and designed a text conflation approach for this kind of tasks, we joined ...
    • Incorporating lexico-semantic heuristics into coreference resolution sieves for named entity recognition at document-level 

      García, Marcos (European Language Resources Association (ELRA), 2016-05)
      [Abstract] This paper explores the incorporation of lexico-semantic heuristics into a deterministic Coreference Resolution (CR) system for classifying named entities at document-level. The highest precise sieves of a CR ...
    • Increasing NLP Parsing Efficiency with Chunking 

      Anderson, Mark Dáibhidh; Vilares, David (M D P I AG, 2018-09-19)
      [Abstract] We introduce a “Chunk-and-Pass” parsing technique influenced by a psycholinguistic model, where linguistic information is processed not word-by-word but rather in larger chunks of words. We present preliminary ...
    • Left-to-Right Dependency Parsing with Pointer Networks 

      Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Association for Computational Linguistics (ACL), 2019)
      [Abstract]: We propose a novel transition-based algorithm that straightforwardly parses sentences from left to right by building n attachments, with n being the length of the input sentence. Similarly to the recent ...
    • Lyapunov filtering of objectivity for Spanish sentiment model 

      Chaturvedi, Iti; Cambria, Erik; Vilares, David (IEEE, 2016-07)
      [Abstract] Objective sentences lack sentiments and, hence, can reduce the accuracy of a sentiment classifier. Traditional methods prior to 2001 used hand-crafted templates to identify subjectivity and did not generalize ...
    • LyS A Coruña at GUA-SPA@IberLEF2023. Multi-Task Learning with Large Language Model Encoders for Guarani-Spanish Code Switching Analysis 

      Muñoz Ortiz, Alberto; Vilares, David (2023)
      [Abstract] This paper introduces the LyS A Coruña proposal for the Guarani-Spanish Code Switching Analysis task at IberLEF2023. The shared task proposes to analyze Guarani-Spanish code-switched texts, focusing on language ...
    • LyS at SemEval-2016 Task 4: Exploiting Neural Activation Values for Twitter Sentiment Classification and Quantification 

      Vilares, David; Doval, Yerai; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2016)
      [Abstract]: In this paper we describe our deep learning approach for solving both two-, three- and fiveclass tweet polarity classification, and twoand five-class quantification. We first trained a convolutional neural ...
    • LyS at TASS 2013: Analysing Spanish tweets by means of dependency parsing, semantic-oriented lexicons and psychometric word-properties 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)
      [Abstract]: This article describes the approach developed by our group in order to resolve the sentiment analysis at a global level, topic identification and political tendency classification tasks on Spanish tweets; ...