• Detecting Perspectives in Political Debates 

      Vilares, David; He, Yulan (Association for Computational Linguistics, 2017-09)
      [Abstract]: We explore how to detect people’s perspectives that occupy a certain proposition. We propose a Bayesian modelling approach where topics (or propositions) and their associated perspectives (or viewpoints) are ...
    • EN-ES-CS: An English-Spanish Code-Switching Twitter Corpus for Multilingual Sentiment Analysis 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (European Language Resources Association (ELRA), 2016-05)
      [Abstract]: Code-switching texts are those that contain terms in two or more different languages, and they appear increasingly often in social media. The aim of this paper is to provide a resource to the research community ...
    • Entity linking with distributional semantics 

      Gamallo, Pablo; García, Marcos (Springer, 2016-07)
      [Abstract] Entity Linking (EL) consists in linking name mentions in a given text with their referring entities in external knowledge bases such as DBpedia/Wikipedia. In this paper, we propose an EL approach whose main ...
    • Global Transition-based Non-projective Dependency Parsing 

      Fernández-González, Daniel; Shi, Tianze; Lee, Lillian (Association for Computational Linguistics (ACL), 2018)
      [Absctract]: Shi, Huang, and Lee (2017a) obtained state-of-the-art results for English and Chinese dependency parsing by combining dynamic-programming implementations of transition-based dependency parsers with a minimal ...
    • Identificación Automática del Idioma en Twitter: Adaptación de Identificadores del Estado del Arte al Contexto Ibérico 

      Doval, Yerai; Vilares, David; Vilares, Jesús (CEUR-WS.org, 2014)
      [Abstract]: We describe here our partipation in TweetLID. After having studied the problem of language identification, the resources available, and designed a text conflation approach for this kind of tasks, we joined ...
    • Incorporating lexico-semantic heuristics into coreference resolution sieves for named entity recognition at document-level 

      García, Marcos (European Language Resources Association (ELRA), 2016-05)
      [Abstract] This paper explores the incorporation of lexico-semantic heuristics into a deterministic Coreference Resolution (CR) system for classifying named entities at document-level. The highest precise sieves of a CR ...
    • Increasing NLP Parsing Efficiency with Chunking 

      Anderson, Mark Dáibhidh; Vilares, David (M D P I AG, 2018-09-19)
      [Abstract] We introduce a “Chunk-and-Pass” parsing technique influenced by a psycholinguistic model, where linguistic information is processed not word-by-word but rather in larger chunks of words. We present preliminary ...
    • Left-to-Right Dependency Parsing with Pointer Networks 

      Fernández-González, Daniel; Gómez-Rodríguez, Carlos (Association for Computational Linguistics (ACL), 2019)
      [Abstract]: We propose a novel transition-based algorithm that straightforwardly parses sentences from left to right by building n attachments, with n being the length of the input sentence. Similarly to the recent ...
    • Lyapunov filtering of objectivity for Spanish sentiment model 

      Chaturvedi, Iti; Cambria, Erik; Vilares, David (IEEE, 2016-07)
      [Abstract] Objective sentences lack sentiments and, hence, can reduce the accuracy of a sentiment classifier. Traditional methods prior to 2001 used hand-crafted templates to identify subjectivity and did not generalize ...
    • LyS A Coruña at GUA-SPA@IberLEF2023. Multi-Task Learning with Large Language Model Encoders for Guarani-Spanish Code Switching Analysis 

      Muñoz Ortiz, Alberto; Vilares, David (2023)
      [Abstract] This paper introduces the LyS A Coruña proposal for the Guarani-Spanish Code Switching Analysis task at IberLEF2023. The shared task proposes to analyze Guarani-Spanish code-switched texts, focusing on language ...
    • LyS at SemEval-2016 Task 4: Exploiting Neural Activation Values for Twitter Sentiment Classification and Quantification 

      Vilares, David; Doval, Yerai; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Association for Computational Linguistics, 2016)
      [Abstract]: In this paper we describe our deep learning approach for solving both two-, three- and fiveclass tweet polarity classification, and twoand five-class quantification. We first trained a convolutional neural ...
    • LyS at TASS 2013: Analysing Spanish tweets by means of dependency parsing, semantic-oriented lexicons and psychometric word-properties 

      Vilares, David; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)
      [Abstract]: This article describes the approach developed by our group in order to resolve the sentiment analysis at a global level, topic identification and political tendency classification tasks on Spanish tweets; ...
    • LyS at TASS 2014: A Prototype for Extracting and Analysing Aspects from Spanish tweets 

      Vilares, David; Doval, Yerai; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (Sociedad Española para el Procesamiento del Lenguaje Natural, 2014)
      [Abstract]: This paper describes our participation at the third edition of the work- shop on Sentiment Analysis focused on Spanish tweets, tass 2014. This year’s eval- uation campaign includes four challenges: (1) global ...
    • LyS at TASS 2015: Deep Learning Experiments for Sentiment Analysis on Spanish Tweets 

      Vilares, David; Doval, Yerai; Alonso, Miguel A.; Gómez-Rodríguez, Carlos (CEUR-WS Workshop Proceedings, 2015)
      [Abstract]: This paper describes the participation of the LyS group at tass 2015. In this year’s edition, we used a long short-term memory neural network to address the two proposed challenges: (1) sentiment analysis at ...
    • LyS: Porting a Twitter Sentiment Analysis Approach from Spanish to English 

      Vilares, David; Hermo, Miguel; Alonso, Miguel A.; Gómez-Rodríguez, Carlos; Doval, Yerai (Association for Computational Linguistics, 2014)
      [Abstract]: This paper proposes an approach to solve message- and phrase-level polarity classification in Twitter, derived from an existing system designed for Spanish. As a first step, an ad-hoc preprocessing is performed. ...
    • On the Logistical Difficulties and Findings of Jopara Sentiment Analysis 

      Agüero-Torales, Marvin M.; Vilares, David; López-Herrera, Antonio G. (Association for Computational Linguistics, 2021-06)
      [Abstract] This paper addresses the problem of sentiment analysis for Jopara, a code-switching language between Guarani and Spanish. We first collect a corpus of Guarani-dominant tweets and discuss on the difficulties of ...
    • On the Processing and Analysis of Microtexts: From Normalization to Semantics 

      Doval, Yerai; Vilares, David (M D P I AG, 2018-09-18)
      [Abstract] User-generated content published on microblogging social platforms constitutes an invaluable source of information for diverse purposes: health surveillance, business intelligence, political analysis, etc. We ...
    • Parsing as Pretraining 

      Vilares, David; Strzyz, Michalina; Søgaard, Anders; Gómez-Rodríguez, Carlos (2020)
      [Abstract] Recent analyses suggest that encoders pretrained for language modeling capture certain morpho-syntactic structure. However, probing frameworks for word vectors still do not report results on standard setups ...
    • Prototipado rápido de un sistema de normalización de tuitsuna aproximación léxica 

      Vilares, Jesús; Alonso, Miguel A.; Vilares, David (Sociedad Española para el Procesamiento del Lenguaje Natural, 2013)
      [Resumen]: Este trabajo describe el sistema de normalización de tuits en español desarrollado por el Grupo de Lengua Y Sociedad de la Información (LYS) de la Universidade da Coruña para el Tweet-Norm 2013. Se trata de un ...
    • Searching Four-Millennia-Old Documents: A Text Retrieval System for Egyptologists 

      Iglesias-Franjo, Estíbaliz; Vilares, Jesús (2016-08)
      [Abstract] Progress made in recent years has led to a growing interest in Digital Heritage. This article focuses on Egyptology and, more specifically, the study and preservation of ancient Egyptian scripts. We present a ...